loading......

coming soon

Coming Soon

We are in process of building the inventory of good professionals in this category

Got it!

Where do you need ?

location

Please select a Location.

Enquire

Submit your enquiry for BigData Hadoop Essentials

Please enter valid question or comment

Please enter your name.

Please enter valid email

Please enter valid Phone Number

Please enter the Pin Code.

By submitting, you agree to our Terms of use and Privacy Policy

Connect With frameworkgurus

You have reached a limit!

We only allow 20 Tutor contacts under a category. Please send us an email at support@urbanpro.com for contacting more Tutors.

You Already have an UrbanPro Account

Please Login to continue

Please Enter valid Email or Phone Number

Please Enter your Password

Please Enter valid Password or OTP

Forgot Password? Resend OTP OTP Sent

Sorry, we were not able to find a user with that username and password.

We have sent you an OTP to your register email address and registered number. Please enter OTP as Password to continue

Further Information Received

Thank you for providing more information about your requirement. You will hear back soon from the trainer

Share

course photo

BigData Hadoop Essentials

Koramangala 1st Block, Bangalore

9,990

No Reviews Yet
1 Interested

About the Course


BigData means data explosion. In order to handle it, the industry needs a mixed profile today. Professionals must be aware of setting up the enviornment as well as interact and troubleshoot the issues. After taking feedback from industry, our expert panel designed a course which prepares highly skilled employable resources.

Training approach adopted by us is simple,we ensure we pick up scenarios which is derived from task commonly required at work place.We ensure sound fundamentals with critical skill generation.

Topics Covered

Understanding Big Data
Understanding Big Data
    - 3V (Volume-Variety-Velocity) characteristics
    - Structured and Unstructured Data
    - Application and use cases of Big Data
Limitations of traditional large Scale systems
How a distributed way of computing is superior (cost and scale)
Opportunities and challenges with Big Data
HDFS (The Hadoop Distributed File System)
HDFS Overview and Architecture
    - Deployment Architecture
    - Name Node, Data Node and Checkpoint Node (aka Secondary Name Node)
    - Safe mode
    - Configuration files
    - HDFS Data Flows (Read vs Write)

How HDFS addresses fault tolerance?
    - CRC Check Sum
    - Data replication
    - Rack awareness and Block placement policy
    - Small files problem

HDFS Interfaces
     - Command Line Interface
        - File System
        - Administrative
     - Web Interface


MapReduce - 1 (Theoretical Concepts)
MapReduce overview
    - Functional Programming paradigms
    - How to think in a MapReduce way?

MapReduce Architecture
    - Legacy MR vs Next Generation MapReduce (aka YARN/MRv2)
    - Slots vs Containers
    - Schedulers
    - Shuffling, Sorting
    - Hadoop Data Types
    - Input and Output Formats
    - Input Splits
    - Partitioning (Hash Partitioner vs Customer Partitioner)
    - Configuration files
    - Distributed Cache

MR Algorithm and Data Flow
    - Word Count

Alternatives to MR
    - BSP (Bulk Synchronous Parallel)
    - Adhoc querying
    - Graph Computing Engines
MapReduce - 2 (Practice)
Developing, debugging and deploying MR programs
    - Stand alone mode (in Eclipse)
    - Pseudo distributed mode (as in the Big Data VM)
    - Fully distributed mode (as in Production)

MR API
    - Old and the new MR API
    - Java Client API
    - Hadoop data types and custom Writables/WritableComparables
    - Different input and output formats
    - Saving Binary Data using SequenceFiles and Avro Files

Hadoop Streaming (developing and debugging non Java MR programs - Ruby and Python)

Optimization techniques
    - Speculative execution
    - Combiners
    - JVM Reuse
    - Compression

MR algorithms (Non-graph)
    - Sorting
     - Term Frequency – Inverse Document Frequency
    - Student Data Base
    - Max Temperature
    - Different ways of joining data
    - Word Co-Occurrence
MR algorithms (Graph)
    - PageRank
    - Inverted Index
Higher Level Abstractions for MR (Pig)
Introduction and Architecture
Different Modes of executing Pig constructs
Data Types
Dynamic invokers
Pig streaming
Macros
Pig Latin language Constructs (LOAD, STORE, DUMP, SPLIT etc)
User Defined Functions
Use Cases
Higher Level Abstractions for MR (Hive)
Introduction and Architecture
Different Modes of executing Hive queries
Metastore Implementations
HiveQL(DDL & DML Operations)
External vs Managed Tables
Views
Partitions & Buckets
User Defined Functions
Transformations using Non Java
Use Cases

Comparison of Pig and Hive
NoSQL Databases - 1 (Theoretical Concepts)
NoSQL Concepts
    - Review of RDBMS
    - Need for NoSQL
    - Brewers CAP Theorem
    - ACID vs BASE
    - Schema on Read vs. Schema on Write
    - Different levels of consistency
    - Bloom filters

Different types of NoSQL databases
    - Key Value
    - Columnar
    - Document
    - Graph

Columnar Databases concepts



NoSQL Databases - 2 (Practice)
HBase Architecture
    - Master and the Region Server
    - Catalog tables (ROOT and META)
    - Major and Minor compaction
    - Configuration files
    - HBase vs Cassandra

Interfaces to HBase (for DDL and DML operations)
    - Java API
      - Client API
      - Filters

Who should attend

Developers,Architects and Administrators with passion to learn and enhance their skills To be Market Ready.

Pre-requisites

AnyOne with understanding of xml and basic Linux/Unix/Windows/MAC O.S.

What you need to bring

Notepad and Pen.

Key Takeaways

Understand Bigdata Challenges and Frameworks to resolve the problems.

Indepth understanding of Hadoop and Hadoop Ecosystem
Handon and practical understanding of Hadoop Recent version and commercial products like cloudera and HortonWorks.

Reviews
There are no Reviews yet. Be the First to Review
Questions and Comments

Thousands of experts Tutors, Trainers & other Professionals are available to answer your questions

Comment ?

Questions Guidelines

  • Start your question with simple statements like "what", "when", "where", or "how".
  • Ensure your question or answer is not offensive or insensitive - it may be voted down or banned.
  • Please provide as much detail as possible as this will allow our members to better understand and respond to your question.
  • Take some time to categorize your question. This will greatly help other users find the question.


You can add upto 6 Images

Ask


There are no Reviews yet. Be the First to Review

Date and Time

Not decided yet.

About the Trainer

Niranjan

Masters in Computer Application


Certified consultant and mentor having 15 years of expierience in BigData Hadoop,mongodb,cassendra and SOA.
Seats Left-

Students Interested 1

Course Id: 15285