Tutorialspoint

#May Motivation Use code MAY10 for extra 10% off

Hadoop Administration: An easy way to become a Hadoop Admin

person icon Shrikant Ahire

4.1

Hadoop Administration: An easy way to become a Hadoop Admin

Hadoop Administration: Online Training for Beginners to Professional

updated on icon Updated on May, 2024

language icon Language - English

person icon Shrikant Ahire

category icon IT & Software,Network & Security,Hadoop

Lectures -24

Resources -29

Duration -8 hours

4.1

price-loader

30-days Money-Back Guarantee

Training 5 or more people ?

Get your team access to 10000+ top Tutorials Point courses anytime, anywhere.

Course Description

Module 0: Giveaways

  • Linux / UNIX Course
  • 100 Solved Queries of Hadoop Administration Day to Day-to-Day Activities.
  • Guidelines to create an AWS account.

Module 1: Introduction to Hadoop Administration

  • Understanding Big Data
  • Common big data domain scenarios
  • Analyze Limitations of Traditional Solutions
  • Roles and Responsibility
  • Case Studies

Module 2: Hadoop Architecture And MapReduce

  • Introduction to Hadoop
  • Hadoop Architecture
  • Difference between Hadoop 1.x, Hadoop 2.x and Hadoop 3.x
  • Hadoop 1.x Ecosystem tools and Core System
  • Hadoop 2.x Ecosystem tools and Core System
  • HDFS File System
  1.  Introduction of NameNode, DataNode, and Secondary NameNode
  2.  Anatomy of Write and Read
  3.  Replication Pipeline
  • YARN Framework
  1.  Role and function of YARN in Hadoop
  2.  Mapreduce Theory

§ Cluster testing using MapReduce Code in the YARN Environment

Module 3: Cluster Planning

  • Types of Rack
  • General principle of selecting CPU Memory and hardware
  • Understand Hardware Consideration
  • Machines requirement as per the daemons
  • Learn Best Practices for selecting hardware

Know the network Consideration

Module 4: Hadoop Cluster Administration, Backup, Recovery and Maintenance

  • SafeMode
  • Decommissioning, Commissioning, and Re-Commissioning of Node
  • Trash Functionality
  • Distcp
  • Rack Awareness
  • HDFS / Hadoop Balancer

Module 5: Managing Resources and Scheduling

  • Scheduler: Explanation and demo
  1.  Capacity Scheduler

Module 6: HDFS Federation and High Availability

  • Understand the YARN framework
  • Understand the Federation
  • Understand High Availability
  • High Availability Implementation Using Quorum Journal Manager

Module 7: Cloudera Setup and Performance Tuning

  • Cloudera Distribution Hadoop
  • Cloudera Features
  • Cloudera Manager Editions
  • Cloudera Manager Web UI
  • CDH Installation

Module 8: Security

  • Basics of Hadoop Platform Security
  • Securing the Platform
  • Understand Kerberos

Configuring Kerberos on Cloudera Hadoop Cluster using LDAP authentication

Who this course is for:

  • Linux / Unix Administrator, Data analysts, and database administrators who are curious about the Hadoop Administration part and how it relates to their work.
  • Hadoop Developers and Java Developers who want to be a Hadoop Administrator.
  • Software engineers and programmers who want to understand the administration of the larger Hadoop ecosystem.

Goals

What will you learn in this course:

  • Create a Hadoop Single node cluster on VM-Ware.

  • Create a Hadoop Multi-node cluster on the AWS platform and know how to submit jobs on the Hadoop Cluster.

  • Learn to plan the Hadoop Cluster.

  • Learn to Commission, Decommission, and Recommission machines

  • Learn to take back-up from cluster using Distcp Command, recover and maintain Hadoop Cluster.

  • Learn how to enable a capacity scheduler in the Hadoop Cluster.

  • Enable NameNode High availability configuration on Hadoop Cluster.

  • Learn to install Hadoop using Cloudera Manager and other administrative activities

  • Enable Kerberos security on Cloudera Hadoop Cluster using LDAP connection with Active Directory.

  • How to Monitor a Hadoop Cluster

Prerequisites

What are the prerequisites for this course?

  • It is great if a student knows Linux commands but if not, he/she can learn the commands from the "Linux Commands" pdf which I am giving as a giveaway.

  • Students should have an AWS account. If a student does not have one, then the student can create an account using the "Guidelines to Create AWS Free Tier Account" PDF which I am giving as a part of the giveaway.

  • To create a Single node cluster on VM-Ware students must have a configuration that can support VM-Ware of 4 GB RAM, 20 GB HDD, and 2 CPU.

  • Students need head phones to listen to audio clearly.

  • You will need access to a PC running 64-bit Windows, MacOS, or Linux with an Internet connection.

Hadoop Administration: An easy way to become a Hadoop Admin

Curriculum

Check out the detailed breakdown of what’s inside the course

Mod0-GiveAways
2 Lectures
  • play icon GiveAways
  • play icon ShriHadoopSoftware
Mod1-Introduction of Hadoop Administration
1 Lectures
Tutorialspoint
Mod2-Hadoop Architecture And MapReduce
5 Lectures
Tutorialspoint
Mod3-Hadoop Cluster Planning And Management
2 Lectures
Tutorialspoint
Mod4-Hadoop Cluster Administration, Backup, Recovery And Maintenance
6 Lectures
Tutorialspoint
Mod5-Managing Resources And Scheduling
2 Lectures
Tutorialspoint
Mod6-HDFS Federation and High Availability
2 Lectures
Tutorialspoint
Mod7-Cloudera Setup And Performance Tuning
2 Lectures
Tutorialspoint
Mod8-Security : Kerberos
2 Lectures
Tutorialspoint

Instructor Details

user profile image

Shrikant Ahire

e


Course Certificate

Use your certificate to make a career change or to advance in your current career.

sample Tutorialspoint certificate

Our students work
with the Best

Related Video Courses

View More

Annual Membership

Become a valued member of Tutorials Point and enjoy unlimited access to our vast library of top-rated Video Courses

Subscribe now
Annual Membership

Online Certifications

Master prominent technologies at full length and become a valued certified professional.

Explore Now
Online Certifications

Talk to us

1800-202-0515