Apache Druid for Data Engineers (Hands-On)
Learn everything about Apache Druid a modern real-time analytics database.
Lectures -43
Resources -3
Duration -2 hours
30-days Money-Back Guarantee
Get your team access to 10000+ top Tutorials Point courses anytime, anywhere.
Course Description
Druid is a high-performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.
Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Most often, Druid powers use cases where real-time ingestion, fast query performance, and high uptime are important.
Druid is commonly used as the database backend for GUIs of analytical applications, or for highly-concurrent APIs that need fast aggregations. Druid works best with event-oriented data.
One of the most valuable technology skills is the ability to Real-time analytics databases handle analytics on large amounts of data by optimizing resources to enable compute-heavy workloads, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Duid! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Apache Druid!
Apache Druid Essentials: Unleashing Real-time Analytics and Scalable Data Exploration
Unlock the potential of real-time analytics and scalable data exploration with our comprehensive Apache Druid Essentials course. In this dynamic program, participants will delve into the world of Apache Druid, an open-source, high-performance analytics database designed for fast query response and seamless scalability.
Key Learning Objectives:
Introduction to Course
Real-time Analytics Databases
What is Apache Druid?
Key Features of Druid
Technology
Use cases
When to use Druid
When not to use Druid
List of Company using Apache Druid
Installation of Apache Druid
Start up Druid services
Open the web console
Load data
Query data
Overview of the Druid Web Console
Architecture of Druid
Druid Servers
External Dependencies
Storage Design
Datasources and Segments
Segment Identifiers
Segments
Introduction to Segments
Segment File Structure
Data Loading in Druid
Load Data from Local Files
Load Data from URI
Load Data from Kafka (Prerequisite Introduction to Kafka)
Installing Single Node Kafka Cluster
Change the following to avoid Zookeeper Issue conflict
Load Data from Kafka
Query Data Explain Plan
Aggregate data with rollup
Frequently Asked Questions
Goals
What will you learn in this course:
- Understanding of basic architecture of Apache Druid
- Installing and Configuring Apache Druid
- Apache Druid Design, Ingestion, Data management, Querying
- Frequently asked Questions
Prerequisites
What are the prerequisites for this course?
- Basic knowledge of SQL is appreciated but if you don't have any knowledge on Database management its fine.
- Linux as Operating System Required
- 8 GB RAM is required
Curriculum
Check out the detailed breakdown of what’s inside the course
Introduction
9 Lectures
- Introduction to Course 08:57 08:57
- Real-time Analytics Databases 01:21 01:21
- What is Apache Druid? 02:15 02:15
- Key Features of Druid 05:18 05:18
- Technology 09:57 09:57
- Use cases 10:32 10:32
- When to use Druid 02:01 02:01
- When not to use Druid 01:21 01:21
- List of Company using Apache Druid 03:44 03:44
Installation of Apache Druid
7 Lectures
Architecture of Druid
5 Lectures
Storage Design
2 Lectures
Segments
2 Lectures
Data Loading in Druid
9 Lectures
Frequently Asked Questions
9 Lectures
Instructor Details
Bigdata Engineer
eCourse Certificate
Use your certificate to make a career change or to advance in your current career.
Our students work
with the Best
Related Video Courses
View MoreAnnual Membership
Become a valued member of Tutorials Point and enjoy unlimited access to our vast library of top-rated Video Courses
Subscribe nowOnline Certifications
Master prominent technologies at full length and become a valued certified professional.
Explore Now