UrbanPro
true

Hadoop Development Training

LIVE
60 Hours

Course offered by Uplatz

0 review

1. Introduction

1.1 Big Data Introduction

What is Big Data

Data Analytics

Bigdata Challenges

Technologies supported by big data

1.2 Hadoop Introduction

What is Hadoop?

History of Hadoop

Basic Concepts

Future of Hadoop

The Hadoop Distributed File System

Anatomy of a Hadoop Cluster

Breakthroughs of Hadoop

Hadoop Distributions:

Apache Hadoop

Cloudera Hadoop

Horton Networks Hadoop

MapR Hadoop

2. Hadoop Daemon Processes

Name Node

DataNode

Secondary Name Node/High Availability

Job Tracker/Resource Manager

Task Tracker/Node Manager

3. HDFS (Hadoop Distributed File System)

Blocks and Input Splits

Data Replication

www.uplatz.com

Leading Marketplace for IT and Certification Courses

Hadoop Rack Awareness

Cluster Architecture and Block Placement

Accessing HDFS

JAVA Approach

CLI Approach

4. Hadoop Installation Modes and HDFS

Local Mode

Pseudo-distributed Mode

Fully distributed mode

Pseudo Mode installation and configurations

HDFS basic file operations

5. Hadoop Developer Tasks

5.1 Writing a MapReduce Program

Basic API Concepts

The Driver Class

The Mapper Class

The Reducer Class

The Combiner Class

The Partitioner Class

Examining a Sample MapReduce Program with several examples

Hadoop's Streaming API

Examining a Sample MapReduce Program with several examples

Running your MapReduce program on Hadoop 1.0

Running your MapReduce Program on Hadoop 2.0

5.2 Performing several Hadoop jobs

Sequence Files

Record Reader

Record Writer

Role of Reporter

Output Collector

www.uplatz.com

Leading Marketplace for IT and Certification Courses

Processing XML files

Counters

Directly Accessing HDFS

ToolRunner

Using The Distributed Cache

5.3 Advanced MapReduce Programming

A Recap of the MapReduce Flow

The Secondary Sort

Customized Input Formats and Output Formats

Map-Side Joins

Reduce-Side Joins

5.4 Practical Development Tips and Techniques

Strategies for Debugging MapReduce Code

Testing MapReduce Code Locally by Using LocalJobRunner

Testing with MRUnit

Writing and Viewing Log Files

Retrieving Job Information with Counters

Reusing Objects

5.5 Data Input and Output

Creating Custom Writable and Writable-Comparable Implementations

Saving Binary Data Using SequenceFile and Avro Data Files

Issues to Consider When Using File Compression

5.6 Tuning for Performance in MapReduce

Reducing network traffic with Combiner, Partitioner classes

Reducing the amount of input data using compression

Reusing the JVM

Running with speculative execution

Input Formatters

Output Formatters

Schedulers

www.uplatz.com

Leading Marketplace for IT and Certification Courses

FIFO schedulers

FAIR Schedulers

CAPACITY Schedulers

5.7 YARN

What is YARN

How YARN Works

Advantages of YARN

6. Hadoop Ecosystems

6.1 PIG

PIG concepts

Install and configure PIG on a cluster

PIG Vs MapReduce and SQL

PIG Vs HIVE

Write sample PIG Latin scripts

Modes of running PIG

Programming in Eclipse

Running as Java program

PIG UDFs

PIG Macros

Accessing Hive from PIG

6.2 HIVE

Hive concepts

Hive architecture

Installing and configuring HIVE

Managed tables and external tables

Partitioned tables

Bucketed tables

Complex data types

Joins in HIVE

Multiple ways of inserting data in HIVE tables

CTAS, views, alter tables

www.uplatz.com

Leading Marketplace for IT and Certification Courses

User-defined functions in HIVE

Hive UDF

Hive UDAF

Hive UDTF

6.3 SQOOP

SQOOP concepts

SQOOP architecture

Install and configure SQOOP

Connecting to RDBMS

Internal mechanism of import/export

Import data from Oracle/Mysql to HIVE

Export data to Oracle/Mysql

Other SQOOP commands

6.4 HBASE

HBASE concepts

ZOOKEEPER concepts

HBASE and Region server architecture

File storage architecture

NoSQL vs SQL

Defining Schema and basic operations

DDLs

DMLs

HBASE use cases

Access data stored in HBASE using clients like CLI, and Java

Map Reduce client to access the HBASE data

HBASE admin tasks

6.5 OOZIE

OOZIE concepts

OOZIE architecture

Workflow engine

Job coordinator

Install and configuring OOZIE

www.uplatz.com

Leading Marketplace for IT and Certification Courses

HPDL and XML for creating Workflows

Nodes in OOZIE

Action nodes

Control nodes

Accessing OOZIE jobs through CLI, and web console

Develop sample workflows in OOZIE on various Hadoop distributions

Run HDFS file operations

Run MapReduce programs

Run PIG scripts

Run HIVE jobs

Run SQOOP Imports/Exports

6.6 FLUME

FLUME Concepts

FLUME architecture

Installation and configurations

Executing FLUME jobs

6.7 IMPALA

What is Impala

How Impala Works

Impala Vs Hive

Impala's shortcomings

Impala Hands-on

6.8 ZOOKEEPER

ZOOKEEPER Concepts

Zookeeper as a service

Zookeeper in production

7. Integrations

Mapreduce and HIVE integration

Mapreduce and HBASE integration

www.uplatz.com

Leading Marketplace for IT and Certification Courses

Java and HIVE integration

HIVE - HBASE Integration

SAS – HADOOP

8. Spark

Introduction to Scala

Functional Programming in Scala

Working with Spark RDDs

9. Hadoop Administrative Tasks:

Setup Hadoop cluster: Apache, Cloudera and VMware

Install and configure Apache Hadoop on a multi-node cluster

Install and configure Cloudera Hadoop distribution in fully distributed mode

Install and configure different ecosystems

Basic Administrative tasks

10. Course Deliverables

Workshop style coaching

Interactive approach

Course material

Hands-on practice exercises for each topic

Quiz at the end of each major topic

Tips and techniques on Cloudera Certification Examination

Linux concepts and basic commands

On-Demand Services

Mock interviews for each individual will be conducted on a need basis

SQL basics on need basis

Core Java concepts on need basis

Resume preparation and guidance

Interview questions

About the Trainer

Avg Rating

0 Reviews

0 Students

225 Courses

Uplatz

Masters

10 years of industry experience

Students also enrolled in these courses

LIVE
346 reviews
40 Hours
30,000 Group Class (max 8)
550,000 1-on-1 Class

Course offered by Amit Raj

237 reviews
LIVE
9 reviews
50 Hours

Course offered by Prinshu Verma

6 reviews
LIVE
346 reviews
30 Hours
24,000 Group Class (max 6)
45,000 1-on-1 Class

Course offered by Amit Raj

237 reviews

Tutor has not setup batch timings yet. Book a Demo to talk to the Tutor.

Different batches available for this Course

No Reviews yet!

Reply to 's review

Enter your reply*

1500/1500

Please enter your reply

Your reply should contain a minimum of 10 characters

Your reply has been successfully submitted.

Certified

The Certified badge indicates that the Tutor has received good amount of positive feedback from Students.

Different batches available for this Course

tickYou have successfully registered

Hadoop Development Training by Uplatz

Uplatz Trainer picture
LIVE

Class
starts in

00

Days

01

Hour

01

Min

01

Sec

Select One

Register Now

Do you want to Register for this Free class?

Yes, Register No, not right now

Tell us a little more about yourself

Hadoop Development Training by Uplatz

Uplatz Trainer picture
LIVE

Class
starts in

00

Days

01

Hour

01

Min

01

Sec

Please enter Student name

Please enter your email address.

Please enter phone number.

Verify Your Mobile Number

Please verify your Mobile Number to book this free class.

Update

Please enter 10 digit phone number.

Please enter your phone number.

Please Enter a valid Mobile Number

This number is already in use.

Resend

Please enter OTP.

Or, give a missed call and get your number verified

080-66-0844-42

This website uses cookies

We use cookies to improve user experience. Choose what cookies you allow us to use. You can read more about our Cookie Policy in our Privacy Policy

Accept All
Decline All

UrbanPro.com is India's largest network of most trusted tutors and institutes. Over 55 lakh students rely on UrbanPro.com, to fulfill their learning requirements across 1,000+ categories. Using UrbanPro.com, parents, and students can compare multiple Tutors and Institutes and choose the one that best suits their requirements. More than 7.5 lakh verified Tutors and Institutes are helping millions of students every day and growing their tutoring business on UrbanPro.com. Whether you are looking for a tutor to learn mathematics, a German language trainer to brush up your German language skills or an institute to upgrade your IT skills, we have got the best selection of Tutors and Training Institutes for you. Read more