Signup as a Tutor

As a tutor you can connect with more than a million students and grow your network.

Apache Spark

No Reviews Yet

Course type: Online Instructor led Course

Platform: Zoom

Course ID: 43944

Course type: Online Instructor led Course

Platform: Zoom

Students Interested 0 (Seats Left 0)

No Reviews Yet

About the Course

This course will train you to -



  • Use the core Spark APIs to operate on data

  • Articulate and implement typical use cases for Spark

  • Build data pipelines and query large data sets using Spark SQL and DataFrames

  • Analyze Spark jobs using the administration UIs inside Databricks

  • Create Structured Streaming jobs

  • Work with relational data using the GraphFrames APIs

  • Understand how a Machine Learning pipeline works

  • Understand the basics of Spark's internals


What will you learn in Spark?



  • Spark Overview

  • In-depth discussion of Spark SQL and DataFrames, including:


    • The DataFrames/Datasets API

    • Spark SQL

    • Data Aggregation

    • Column Operations

    • The Functions API: date/time, string manipulation, aggregation

    • Joins & Broadcasting

    • User Defined Functions

    • Caching and caching storage levels

    • Use of the Spark UI to analyze behavior and performance


  • In-depth discussion of Spark internals


    • Cluster Architecture

    • The Catalyst query optimizer

    • The Tungsten in-memory data format

    • How Spark schedules and executes jobs and tasks

    • Shuffling, shuffle files, and performance

    • How various data sources are partitioned

    • How Spark handles data reads and writes


  • Spark Structured Streaming


    • Sources and sinks

    • Structured Streaming APIs

    • Windowing & Aggregation

    • Checkpointing & Watermarking

    • Reliability and Fault Tolerance

    • Kafka Integration


  • Overview of Sparkâ??s MLlib Pipeline API for Machine Learning


    • Transformer/Estimator/Pipeline API

    • Perform feature preprocessing

    • Evaluate and apply ML models


  • Graph processing with GraphFrames


    • Transforming DataFrames into a graph

    • Perform graph analysis, including Label Propagation, PageRank, and ShortestPaths.


Date and Time

Not decided yet.

About the Trainer

Biswanath Banerjee picture

3 Avg Rating

8 Reviews

11 Students

7 Courses

Biswanath Banerjee

BTech IIT Roorkee

20 years of enriched technology experience in organizations - IBM, MBT (now Tech Mahindra), Tavant Technologies, Network Programs etc.
Highly experienced technical leader. Expert in Big Data Hadoop, PIG, Flume, Scoop Hive, Zookeeper, Apache Spark, Scala, Java stack
Solid experience in architecting solutions and programming experience in Scala, Python and core Java.
Design and development of Hadoop clusters from 60 to 800 nodes clusters, MapReduce jobs, PIG, Spark streaming data.
Undertaken and executed projects in Apache Spark and Scala.
Corporate Trainer in Big Data Hadoop, Scala and Spark, Agile Project Management, Core Java, Design Patterns in Java, Big Data architecture
Education: BTech (Mech) IIT Roorkee
Certifications: IBM Certified Senior Project Manager
Cloudera Certified Hadoop Developer
Certified Big Data Hadoop and Spark developer
Skills and expertise:
TECHNICAL EXPERTISE
Technology Stack: Core Java | Design patterns | Datastructures
Big Data technologies: Hadoop |MapReduce| Spark | Scala | HBase | Pig | Hive | Zookeeper | Scoop| Flume
Others:Java/J2EE | Python | DJango | Spark | Scala | MongoDB | NodeJS | Spring | Framework | MS Project | Rational Tools | CVS | Agile Scrum
@Corporate Trainer
Trained and mentored more than 500 people on Hadoop and Spark projects.
Apache Spark Developer Training at GIFAmpTechnologies
- 5 days developer training
- Spark, Spark Streaming and Spark SQL,Scala
- Real world use case project
- 12 people batch
Advanced Hadoop Data training at Genisys, Bangalore
-5 days developer training
-Advanced topics like YARN, Hadoop Federation, Spark
-Map/Reduce, HDFS, Hive, Pig Hands on
-Real world use case project
-20 people batch
Hadoop Data Training at Tavant Technologies, Bangalore
- 3 days Hadoop training
- Map/Reduce, HDFS, Hive, Pig Hands on
- 15 people batch
Other trainings
- 5 days Hadoop Developer training at GifAmp Technologies, Pune
- Hadoop Developer training at MindTree, Bangalore,
- Hadoop Administration training at NucleusMind Technologies, Noida
- Hadoop Developer public trainings at Idea labs
- Hadoop Developer training at L&T
- Apache Spark and Scala Training at Hexaware
- 5 days Hadoop and Spark training at a leading Enginering college in Kolkata

Reviews

No reviews currently Be the First to Review

Discussions

Students Interested 0 (Seats Left 0)

Post your requirement and let us connect you with best possible matches for Big Data Training Post your requirement now

Enquire

Submit your enquiry for Apache Spark

Please enter valid question or comment

Please enter your name.

Please enter valid Phone Number

Please enter the Pin Code.

Please check the fields again.

By submitting, you agree to our Terms of use and Privacy Policy

Connect With Biswanath

You have reached a limit!

We only allow 20 Tutor contacts under a category. Please send us an email at support@urbanpro.com for contacting more Tutors.

You Already have an UrbanPro Account

Please Login to continue

Please Enter valid Email or Phone Number

Please Enter your Password

Please enter the OTP sent to your registered mobile number.

Please Enter valid Password or OTP

Forgot Password? Resend OTP OTP Sent

Sorry, we were not able to find a user with that username and password.

We have sent you an OTP to your register email address and registered number. Please enter OTP as Password to continue

Further Information Received

Thank you for providing more information about your requirement. You will hear back soon from the trainer

UrbanPro.com is India's largest network of most trusted tutors and institutes. Over 25 lakh students rely on UrbanPro.com, to fulfill their learning requirements across 1,000+ categories. Using UrbanPro.com, parents, and students can compare multiple Tutors and Institutes and choose the one that best suits their requirements. More than 6.5 lakh verified Tutors and Institutes are helping millions of students every day and growing their tutoring business on UrbanPro.com. Whether you are looking for a tutor to learn mathematics, a German language trainer to brush up your German language skills or an institute to upgrade your IT skills, we have got the best selection of Tutors and Training Institutes for you. Read more