Signup as a Tutor

As a tutor you can connect with more than a million students and grow your network.

Share

course photo

Hadoop Training

Jayanagar, Bangalore

16,000

No Feedback Yet

No Reviews Yet
1 Interested

About the Course


Apache Hadoop, the open source data management software that helps organizations analyze massive volumes of structured and unstructured data, is a very hot topic across the tech industry. Employed by such big named websites as eBay, Facebook, and Yahoo, Hadoop is being tagged by many as one of the most desired tech skills for 2012 and coming years along with Cloud Computing.

Topics Covered

Virtual box/VM Ware

Basics
Installations
Backups
Snapshots

Linux

Basics
Installations
Commands


Hadoop
Why Hadoop?
Scaling
Distributed Framework
Hadoop v/s RDBMS
Brief history of hadoop

Setup hadoop
Pseudo mode
Cluster mode
IPv6
Installation of java, hadoop
Configurations of hadoop
Hadoop Processes ( NN, SNN, JT, DN, TT)
Temporary directory
UI
Common errors when running hadoop cluster, solutions

HDFS- Hadoop distributed File System
HDFS Design and Architecture
HDFS Concepts
Interacting HDFS using command line
Interacting HDFS using Java APIs
Data flow
Blocks
Replica

Hadoop Processes
Name node
Secondary name node
Job tracker
Task tracker
Data node

Map Reduce
Developing Map Reduce Application
Phases in Map Reduce Framework
Map Reduce Input and Output Formats
Advanced Concepts
Sample Applications
Combiner

Joining data sets in Map reduce jobs
Map-side join
Reduce-Side join

Map reduce – customization
Hadoop Programming Languages:

HIVE
Introduction
Installation and Configuration
Interacting HDFS using HIVE
Map Reduce Programs through HIVE
HIVE Commands
Loading, Filtering, Grouping…
Data types, Operators…
Joins, Groups…
Sample programs in HIVE

PIG
Basics
Installation and Configurations
Commands…

NOSQL Databases Concepts

Introduction
The Motivation for Hadoop
Problems with traditional large-scale systems
Requirements for a new approach

Hadoop: Basic Concepts
An Overview of Hadoop
The Hadoop Distributed File System
Hands-On Exercise
How MapReduce Works
Hands-On Exercise
Anatomy of a Hadoop Cluster
Other Hadoop Ecosystem Components

Writing a MapReduce Program
The MapReduce Flow
Examining a Sample MapReduce Program
Basic MapReduce API Concepts
The Driver Code
The Mapper
The Reducer
Hadoop’s Streaming API
Using Eclipse for Rapid Development
Hands-on exercise
The New MapReduce API

Common MapReduce Algorithms
Sorting and Searching
Indexing
Machine Learning With Mahout
Term Frequency – Inverse Document Frequency
Word Co-Occurrence
Hands-On Exercise

Using HBase
What is HBase?

HBase Architecture
HBase API

Managing large data sets with HBase
Using HBase in Hadoop applications
Working Hive With Hbase(Integration)
Sqoop Exports and Imports
Hands-on exercise

Who should attend

Java/Non Java professionals, Career changers, other language professionals, Domain experts who wants the pursue the career as a Data scientist, Data Architect.

Pre-requisites

Very much basic knowledge of Java

What you need to bring

Laptop

Key Takeaways

The ability to work as a Hadoop Developer / Administrator
Content
Reviews
There are no Reviews yet.
Questions and Comments

Thousands of experts Tutors, Trainers & other Professionals are available to answer your questions


There are no Reviews yet.
cash back100% Money Back Guarentee

About the Trainer

Multiple

Multiple


our all the trainers all Cisco certified. well verse with teaching skills.

Course Id: 15357