Tutorialspoint

April Learning Carnival is here, Use code FEST10 for an extra 10% off

Apache Hive for Data Engineers (Hands On)

person icon Bigdata Engineer

3.9

Apache Hive for Data Engineers (Hands On)

Learn everything about Apache Hive a modern, data warehouse.

updated on icon Updated on Apr, 2024

language icon Language - English

person icon Bigdata Engineer

English [CC]

category icon Databases,Apache

Lectures -92

Resources -1

Duration -6 hours

3.9

price-loader

30-days Money-Back Guarantee

Training 5 or more people ?

Get your team access to 10000+ top Tutorials Point courses anytime, anywhere.

Course Description

The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command-line tool and JDBC driver are provided to connect users to Hive.

One of the most valuable technology skills is the ability to analyze huge data sets, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Hive! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Apache Hive!

Built on top of Apache Hadoop, Hive provides the following features:

  • Tools to enable easy access to data via SQL, thus enabling data warehousing tasks such as extract/transform/load (ETL), reporting, and data analysis.

  • A mechanism to impose structure on a variety of data formats

  • Access to files stored either directly in Apache HDFS™ or in other data storage systems such as Apache HBase™

  • Query execution via Apache Tez™, Apache Spark™, or MapReduce

  • Procedural language with HPL-SQL

  • Sub-second query retrieval via Hive LLAP, Apache YARN and Apache Slider.

Hive provides standard SQL functionality, including many of the later SQL:2003, SQL:2011, and SQL:2016 features for analytics.
Hive's SQL can also be extended with user code via user defined functions (UDFs), user defined aggregates (UDAFs), and user defined table functions (UDTFs).

There is not a single "Hive format" in which data must be stored. Hive comes with built in connectors for comma and tab-separated values (CSV/TSV) text files, Apache Parquet™, Apache ORC™, and other formats. Users can extend Hive with connectors for other formats. Please see File Formats and Hive SerDe in the Developer Guide for details.

Hive is not designed for online transaction processing (OLTP) workloads. It is best used for traditional data warehousing tasks.

Hive is designed to maximize scalability (scale out with more machines added dynamically to the Hadoop cluster), performance, extensibility, fault-tolerance, and loose-coupling with its input formats.

Goals

What will you learn in this course:

  • Why Hive is necessary for Data Engineer
  • The goal of this course is to help you become familiar with Apache Hive bits and bytes
  • Learn A to Z of Apache HIVE (From Basic to Advance level).
  • Hands on Experience on Apache Hive and Real-time Use Case

Prerequisites

What are the prerequisites for this course?

  • Basic Knowledge of Hadoop
  • Basic Knowledge of SQL and Database
  • Desktop or Laptop with Ubuntu Operating System and Minimum 8 GB RAM is recommended
Apache Hive for Data Engineers (Hands On)

Curriculum

Check out the detailed breakdown of what’s inside the course

Introduction
8 Lectures
  • play icon Introduction to Course 04:47 04:47
  • play icon Introduction to Apache Hive 01:51 01:51
  • play icon Hive Architecture 05:05 05:05
  • play icon How a Hive query flows through the system. 03:16 03:16
  • play icon (Optional) Introduction to Big Data 03:42 03:42
  • play icon (Optional) What is Hadoop 07:13 07:13
  • play icon Hive Features 02:28 02:28
  • play icon Hive Limitation 00:58 00:58
Installing Apache Hive on Ubuntu (Linux) Machine
2 Lectures
Tutorialspoint
Hive Data Model
4 Lectures
Tutorialspoint
Hive Data Types
3 Lectures
Tutorialspoint
HIVE Data Definition Language.
18 Lectures
Tutorialspoint
HIVE Data Manipulation Language
5 Lectures
Tutorialspoint
Hive View, Metastore, Partitions, and Bucketing
11 Lectures
Tutorialspoint
Hive Built-In Functions
3 Lectures
Tutorialspoint
Built-in Operators
4 Lectures
Tutorialspoint
Hive Join
5 Lectures
Tutorialspoint
Frequently Asked Interview Question and Answers
8 Lectures
Tutorialspoint
Hands On Projects (2 Projects)
19 Lectures
Tutorialspoint
Working with XML and JSON
2 Lectures
Tutorialspoint

Instructor Details

Bigdata Engineer

Bigdata Engineer

e


Course Certificate

Use your certificate to make a career change or to advance in your current career.

sample Tutorialspoint certificate

Our students work
with the Best

Feedbacks

D

Dipak Panda

e

Sounds like reading the slides. Need to be more elaborative and use more diagrams.

Related Video Courses

View More

Annual Membership

Become a valued member of Tutorials Point and enjoy unlimited access to our vast library of top-rated Video Courses

Subscribe now
Annual Membership

Online Certifications

Master prominent technologies at full length and become a valued certified professional.

Explore Now
Online Certifications

Talk to us

1800-202-0515