This course is best suited for those who want to gain Big Data & Hadoop skills to:
- Store & process huge data sets
- Build, deploy and run MapReduce applications on Hadoop clusters
- Importing & exporting data from various sources
- Develop distributed code using the Java/Python programming language
- Doing analytics using MapReduce, Pig, Hive, HBase, Mahout & other tools
Course Outline
| Introduction to Hadoop and the Hadoop Ecosystem |
|---|
|
| Hadoop Architecture and HDFS |
|
| Deep dive into HDFS |
|
| Deep dive into YARN & Map Reduce |
|
| Working with Hive |
|
| Introduction to Pig |
|
| Zookeeper |
|
| NoSQL & HBase |
|
| Flume, Sqoop, Oozie |
|
| Mahout |
|