1,093 Student Reviews
Objectives – Having 18.5 years of experience As a Technical Lead and Architect, I am passionate about leveraging my expertise in Databricks, Data Build Tool, Spark, Confluent Kafka, Data Lake, Lake House and Cloud Solutions to drive innovation and efficiency. With a solid background in data architecture and IT Infrastructure, I am to contribute to the robust and vision of your company by providing ro bust technical solutions that align with strategic business goals. My Goal is to enhance data-driven decision-making processes, optimize big data pipelines and implement secure and scalable cloud architectures that people the organization forward in the ever-evolving technical landscape. Certification & Achievements – 1. Confluent Certified Administrator for Apache Kafka: expiry July 2026. Confluent Certified Administrator for Apache Kafka • Amit Raj • Confluent 2. Databricks Certified Data Engineer Professional: expiry Aug 2026 Databricks Certified Data Engineer Professional • Amit Raj • Databricks Badges 3. Databricks Accredited Lakehouse Fundamentals: expiry June 2025 Academy Accreditation - Databricks Lakehouse Fundamentals • Amit Raj • Databricks Badges 4. Microsoft Certified: Azure Administrator Associate (AZ104): Microsoft certification ID: 1100039942 Expires on: August 3, 2025 Credentials - AmitRaj-8869 | Microsoft Learn 5. Microsoft Certified: Azure Security Engineer Associate (AZ500): Microsoft certification ID: 1100039942 Expires on: August 6, 2025 Credentials - AmitRaj-8869 | Microsoft Learn 6. Recognition certificate from Fidelity for designing global solutions for Data exchange. 7. Got Achievement medal from DIB(Client) with appreciation for design event-based enterprise architecture & contribution – EventHub SUMMARY • Overall total of 18.5+ years of Experience years of Experience in Application Design, Development & Deployment of Hadoop Eco System/Java/J2EE systems with good exposure to Enterprise Architecture. • Relevant Experience 9.2 years in Big Data technologies working with multiple clients and domain knowledge. • Experienced in Cassandra data modelling, cluster setup and data management. • Experienced in working with Spark-SQL, Spark SQL and Spark Structure Streaming, MLib to process and analyse data queries. • Experienced in designing solutions using Spark Streaming and Kafka Streaming for Payment Gateway/point of sales events. • Individual Contribution (Kafka Architect): Delivered UAT and PROD Cluster within the timeline for Kafka cluster using Cloudera 6.x, CSP 2.0. • Implemented a unified data platform to gather data from different sources using Kafka Producers and consumers in Scala and java. • Solid background in Object-Oriented analysis & design, UML and various design patterns. • Worked using Azure cloud(Blob, EventHub), Kubernetes, docker with Spark, scala, Schema Registry, Avro Schema with home security application for Honeywell • Implemented KSQL, KTable and KStream using Confluent Kafka along with Kafka Connect. • Hands-on Data bricks - Databricks Clusters, Data Lakehouse, Delta lake, DBFS, EXPLORE, Analyze, Clean, Transform and Load Data using Databricks. • Experience with Azure: Azure Synapse Analytics, ADLS, ADF, CosmoDB, Azure Function, Stream Analytics, Power BI. • Experience with SQL and NoSQL databases including Mysql, Oracle, Cassandra, and PostgreSQL. BigTable • Experience building and optimizing the ‘big data’ data pipeline. • Experince with Azure Devops, CI/CD pipeline, Kubernetes and docker • Motivated Technical Architect with 5 years of progressive experience. • Having Experience AWS (Ec2,S3) • Having experience with Snowflake to design data lake and load data from multiple sources to the Snowflake database. • Effectively manages assignments and team members. • Dedicated to self-development to provide expectation-exceeding service. Customer-focused, successfully contributing to company profits by improving team efficiency and productivity. • Utilizes excellent organizational skills to enhance efficiency and lead teams to achieve outstanding delivery. SKILLS == =================== Database architecture Database architecture development Data Architecture Big Data ETL Technical solution development Azure data solutions Data insight provision Technical guidance IT Architecture Technical solutions Big data frameworks Technical Skills: Hortonworks2.5, Cloudera5/6, Apache Hadoop2/3 ,Spark2/3,Apache Kafka, Confluent Kafka, Hive 2/3, Impala, Sqoop, OOZie, Zookeeper, Snowflake, Data Build tool (DBT), HBase, Apache Cassandra /DataStax Cassandra, Data Bricks, Azure Cloud, AWS cloud, Talend, Airflow etc. Programming Language Python , Scala & Java Other Tools Kibana, Logstash, ElasticSearch, ELK. ============================= PROJECT UNDERTAKEN: Project: Implementation of Data Warehouse and reporting platform Roles: Databricks Architect & Engineer Teams: 12 members Technical Skills: Azure Cloud, Azure Data Factory (ADF), ADLS, Databricks, Spark3.x, Python, Scala2.15, DB2, Oracle 12g, Azure SQL My Contribution Data Bricks Infrastructure Solution: - Configured Unified Data Access Control using Unity Catalog – E1 & BY System provide a specific set of permissions, like Read Only, or, Write Only to a specific Group of Users on one, or, some of the Delta Tables, or, even at the Row Level, or, Column Level, which can contain Personally Identifiable Information, i.e., PII, of that Delta Tables - Provide Data Governance with centralized place: administer (TAI) the access to the data, and, also audit the access to the data. - Applied Data lineage for E1 & BY tables with look-up tables using Unity Catalog. - Implemented Data sharing protocol to apply secure data sharing downstream using Unity Catalog and - Design Architecture of Unity Catalog which can be linked to multiple Databricks Workspaces- DEV, UAT, PROD environment. - Created Metastore for the Unity Catalog - Apply User Management of the Unity Catalog for the TAI Lakehouse project: Users, Groups, or, the Service Principle, and, the permissions those have - Configure Data Bricks Cluste with spark 3.x for DEV, UAT & PROD for TAI -E1 & BY System. - Design & apply medallion architecture, Setup a Data Lake house with Bronze, Silver and Gold layers of a storage system using Azure Data Lake Gen2. Azure Cloud Infra and Security: - Install self-hosted integration runtime for the DB2 ON DEV, UAT & PROD and Oracle on-prem cluster on the source system. - Install Azure Virtual network managed IR On DEV, UAT & PROD. - Installed Db2 connector on DEV, UAT & PROD. - Created linked service lnk_BY_Azure_SQL, lnk_E1_Azure_SQL, lnk_Db2_E1 - Install and Configure Azure Key-Vault, added all the credential for Azure SQL, ADLS, Databricks, Users, global users, linked service to Azure Key Vault. – DEV, UAT & PROD. - Created 3 nodes for DEV and 5 nodes for PROD cluster to migrate data. - Setup and configure Azure Active Directory to provide team access policy for Databricks cluster, Azure Data Factory, Azure SQL, Azure Data Lake house. - Coordinate with TAI Client and Microsoft support team to resolve throughput issues. As Azure & Databricks Data Engineer: - developed most critical data ingestion pipelines using Azure Data Factory (ADF) for E1 to migrate 12.8TB of 120 tables from Db2 to ADLS RAW as a parquet file. There are many large tables with 2-4 TB of volume data containing 400 to 800 million records. - Initial & Incremental migration pipelines for both the E1 and BY sources with a watermark based on Julian's date & time - Design Audit table (Process log) and Control table (System) to achieve dynamic pipeline and audit information for master and child pipeline. - Design architecture solution to achieve delete for PKSNAPSHOT – E1 & BY. - Build dynamic delete pipeline using ADF (load PKTBL), Databricks PySpark for Daily, Weekly, OnDemand, and Yearly frequency to delete records from target (Analytics layer – gold layer) based on source system delete column and delete table - Build transformation using Datarbicks Spark with Scala for E1 to - Apply a transformation with a lookup table and transform to the Silver layer. - Build transformation to transform on Analytics layer (Gold) using Databricks Spark & scala. - Implemented UPSERT using Spark Structure Streaming with 5 minutes on the Analytics layer - Design pipeline architecture for master pipeline, child pipeline with different activity ID, Pipeline ID, Master pipeline ID with different pipeline Run ID to make sure for smooth transition audit. - Build logic, developed using Pyspark on Datarbicks – applied on DEV, UAT & PROD to check counter – master pipeline IN PROGRESS - or NOT so that pipeline execution should not overlap. - Pass pipeline parameter to insert or update Audit/Control table using Databricks -Pyspark. - Monitor Performance in DEV & PROD, worked with the team to reduce time. - Milestone – to achieve 10-minute SLA for Incremental load on E1 & BY (end-to-end completion time) - Milestone – achieved 1.53.45hrs to load (400millions record with2.3TB) at RAW as parquet file using ADF pipeline - Interact with Azure Devos’s engineer to build a CICD pipeline for DEV, UAT & PROD with - Develop pipeline as POC using Databricks Workflow, compare the cost with Azure Pipeline, and present to the client. My Contribution to Past Project: Project: Data Exchange (Security Framework) Roles: Technical Lead & Architect – Confluent KStream & KSQL Client: Fidelity & Westpac Team: 9 members Technical Skills: AzureDevops, Jdk 19.0, Confluent Kafka, Kstream, KSQL, Azure Databricks, DBFS, Delta lake, Azure Data Factory, ADLSGen2, Confluent Schema Registry, AES Algorithm, Hash Algorithm, Kubernetes Cluster(AKS).
Objectives – Having 18.5 years of experience As a Technical Lead and Architect, I am passionate about leveraging my expertise in Databricks, Data Build Tool, Spark, Confluent Kafka, Data Lake, Lake House and Cloud Solutions to drive innovation and efficiency. With a solid background in data architecture and IT Infrastructure, I am to contribute to the robust and vision of your company by providing ro bust technical solutions that align with strategic business goals. My Goal is to enhance data-driven decision-making processes, optimize big data pipelines and implement secure and scalable cloud architectures that people the organization forward in the ever-evolving technical landscape. Certification & Achievements – 1. Confluent Certified Administrator for Apache Kafka: expiry July 2026. Confluent Certified Administrator for Apache Kafka • Amit Raj • Confluent 2. Databricks Certified Data Engineer Professional: expiry Aug 2026 Databricks Certified Data Engineer Professional • Amit Raj • Databricks Badges 3. Databricks Accredited Lakehouse Fundamentals: expiry June 2025 Academy Accreditation - Databricks Lakehouse Fundamentals • Amit Raj • Databricks Badges 4. Microsoft Certified: Azure Administrator Associate (AZ104): Microsoft certification ID: 1100039942 Expires on: August 3, 2025 Credentials - AmitRaj-8869 | Microsoft Learn 5. Microsoft Certified: Azure Security Engineer Associate (AZ500): Microsoft certification ID: 1100039942 Expires on: August 6, 2025 Credentials - AmitRaj-8869 | Microsoft Learn 6. Recognition certificate from Fidelity for designing global solutions for Data exchange. 7. Got Achievement medal from DIB(Client) with appreciation for design event-based enterprise architecture & contribution – EventHub SUMMARY • Overall total of 18.5+ years of Experience years of Experience in Application Design, Development & Deployment of Hadoop Eco System/Java/J2EE systems with good exposure to Enterprise Architecture. • Relevant Experience 9.2 years in Big Data technologies working with multiple clients and domain knowledge. • Experienced in Cassandra data modelling, cluster setup and data management. • Experienced in working with Spark-SQL, Spark SQL and Spark Structure Streaming, MLib to process and analyse data queries. • Experienced in designing solutions using Spark Streaming and Kafka Streaming for Payment Gateway/point of sales events. • Individual Contribution (Kafka Architect): Delivered UAT and PROD Cluster within the timeline for Kafka cluster using Cloudera 6.x, CSP 2.0. • Implemented a unified data platform to gather data from different sources using Kafka Producers and consumers in Scala and java. • Solid background in Object-Oriented analysis & design, UML and various design patterns. • Worked using Azure cloud(Blob, EventHub), Kubernetes, docker with Spark, scala, Schema Registry, Avro Schema with home security application for Honeywell • Implemented KSQL, KTable and KStream using Confluent Kafka along with Kafka Connect. • Hands-on Data bricks - Databricks Clusters, Data Lakehouse, Delta lake, DBFS, EXPLORE, Analyze, Clean, Transform and Load Data using Databricks. • Experience with Azure: Azure Synapse Analytics, ADLS, ADF, CosmoDB, Azure Function, Stream Analytics, Power BI. • Experience with SQL and NoSQL databases including Mysql, Oracle, Cassandra, and PostgreSQL. BigTable • Experience building and optimizing the ‘big data’ data pipeline. • Experince with Azure Devops, CI/CD pipeline, Kubernetes and docker • Motivated Technical Architect with 5 years of progressive experience. • Having Experience AWS (Ec2,S3) • Having experience with Snowflake to design data lake and load data from multiple sources to the Snowflake database. • Effectively manages assignments and team members. • Dedicated to self-development to provide expectation-exceeding service. Customer-focused, successfully contributing to company profits by improving team efficiency and productivity. • Utilizes excellent organizational skills to enhance efficiency and lead teams to achieve outstanding delivery. SKILLS == =================== Database architecture Database architecture development Data Architecture Big Data ETL Technical solution development Azure data solutions Data insight provision Technical guidance IT Architecture Technical solutions Big data frameworks Technical Skills: Hortonworks2.5, Cloudera5/6, Apache Hadoop2/3 ,Spark2/3,Apache Kafka, Confluent Kafka, Hive 2/3, Impala, Sqoop, OOZie, Zookeeper, Snowflake, Data Build tool (DBT), HBase, Apache Cassandra /DataStax Cassandra, Data Bricks, Azure Cloud, AWS cloud, Talend, Airflow etc. Programming Language Python , Scala & Java Other Tools Kibana, Logstash, ElasticSearch, ELK. ============================= PROJECT UNDERTAKEN: Project: Implementation of Data Warehouse and reporting platform Roles: Databricks Architect & Engineer Teams: 12 members Technical Skills: Azure Cloud, Azure Data Factory (ADF), ADLS, Databricks, Spark3.x, Python, Scala2.15, DB2, Oracle 12g, Azure SQL My Contribution Data Bricks Infrastructure Solution: - Configured Unified Data Access Control using Unity Catalog – E1 & BY System provide a specific set of permissions, like Read Only, or, Write Only to a specific Group of Users on one, or, some of the Delta Tables, or, even at the Row Level, or, Column Level, which can contain Personally Identifiable Information, i.e., PII, of that Delta Tables - Provide Data Governance with centralized place: administer (TAI) the access to the data, and, also audit the access to the data. - Applied Data lineage for E1 & BY tables with look-up tables using Unity Catalog. - Implemented Data sharing protocol to apply secure data sharing downstream using Unity Catalog and - Design Architecture of Unity Catalog which can be linked to multiple Databricks Workspaces- DEV, UAT, PROD environment. - Created Metastore for the Unity Catalog - Apply User Management of the Unity Catalog for the TAI Lakehouse project: Users, Groups, or, the Service Principle, and, the permissions those have - Configure Data Bricks Cluste with spark 3.x for DEV, UAT & PROD for TAI -E1 & BY System. - Design & apply medallion architecture, Setup a Data Lake house with Bronze, Silver and Gold layers of a storage system using Azure Data Lake Gen2. Azure Cloud Infra and Security: - Install self-hosted integration runtime for the DB2 ON DEV, UAT & PROD and Oracle on-prem cluster on the source system. - Install Azure Virtual network managed IR On DEV, UAT & PROD. - Installed Db2 connector on DEV, UAT & PROD. - Created linked service lnk_BY_Azure_SQL, lnk_E1_Azure_SQL, lnk_Db2_E1 - Install and Configure Azure Key-Vault, added all the credential for Azure SQL, ADLS, Databricks, Users, global users, linked service to Azure Key Vault. – DEV, UAT & PROD. - Created 3 nodes for DEV and 5 nodes for PROD cluster to migrate data. - Setup and configure Azure Active Directory to provide team access policy for Databricks cluster, Azure Data Factory, Azure SQL, Azure Data Lake house. - Coordinate with TAI Client and Microsoft support team to resolve throughput issues. As Azure & Databricks Data Engineer: - developed most critical data ingestion pipelines using Azure Data Factory (ADF) for E1 to migrate 12.8TB of 120 tables from Db2 to ADLS RAW as a parquet file. There are many large tables with 2-4 TB of volume data containing 400 to 800 million records. - Initial & Incremental migration pipelines for both the E1 and BY sources with a watermark based on Julian's date & time - Design Audit table (Process log) and Control table (System) to achieve dynamic pipeline and audit information for master and child pipeline. - Design architecture solution to achieve delete for PKSNAPSHOT – E1 & BY. - Build dynamic delete pipeline using ADF (load PKTBL), Databricks PySpark for Daily, Weekly, OnDemand, and Yearly frequency to delete records from target (Analytics layer – gold layer) based on source system delete column and delete table - Build transformation using Datarbicks Spark with Scala for E1 to - Apply a transformation with a lookup table and transform to the Silver layer. - Build transformation to transform on Analytics layer (Gold) using Databricks Spark & scala. - Implemented UPSERT using Spark Structure Streaming with 5 minutes on the Analytics layer - Design pipeline architecture for master pipeline, child pipeline with different activity ID, Pipeline ID, Master pipeline ID with different pipeline Run ID to make sure for smooth transition audit. - Build logic, developed using Pyspark on Datarbicks – applied on DEV, UAT & PROD to check counter – master pipeline IN PROGRESS - or NOT so that pipeline execution should not overlap. - Pass pipeline parameter to insert or update Audit/Control table using Databricks -Pyspark. - Monitor Performance in DEV & PROD, worked with the team to reduce time. - Milestone – to achieve 10-minute SLA for Incremental load on E1 & BY (end-to-end completion time) - Milestone – achieved 1.53.45hrs to load (400millions record with2.3TB) at RAW as parquet file using ADF pipeline - Interact with Azure Devos’s engineer to build a CICD pipeline for DEV, UAT & PROD with - Develop pipeline as POC using Databricks Workflow, compare the cost with Azure Pipeline, and present to the client. My Contribution to Past Project: Project: Data Exchange (Security Framework) Roles: Technical Lead & Architect – Confluent KStream & KSQL Client: Fidelity & Westpac Team: 9 members Technical Skills: AzureDevops, Jdk 19.0, Confluent Kafka, Kstream, KSQL, Azure Databricks, DBFS, Delta lake, Azure Data Factory, ADLSGen2, Confluent Schema Registry, AES Algorithm, Hash Algorithm, Kubernetes Cluster(AKS).
We at Edunexten Teach Digital Marketing Course in Bangalore, Data Science Training in Bangalore, RPA training in Bangalore, web design and development courses in Bangalore. Learn all above courses from Industry experts with Live Projects; 100%Placement Assistance Register Now for Free Demo Lecture.
We at Edunexten Teach Digital Marketing Course in Bangalore, Data Science Training in Bangalore, RPA training in Bangalore, web design and development courses in Bangalore. Learn all above courses from Industry experts with Live Projects; 100%Placement Assistance Register Now for Free Demo Lecture.
NewTechWays provides training in advanced courses like Big Data (Hadoop, Spark, Kafka), Analytics, Databases, Cloud and Software Architecture. We provide both corporate and classroom training. Our training is suitable for experienced people who have prior knowledge of programming with Java or Python and a familiarity with databases, SQL and Linux. We spent about 50% of the time on hands-on exercises. NewTechWays was started in the year 2017 by Anurag Yadav who has 17 years of work experience in software product development. He has worked in various leadership and technical roles in many leading software companies. He holds B. Tech degree from IIT BHU Varanasi.
NewTechWays provides training in advanced courses like Big Data (Hadoop, Spark, Kafka), Analytics, Databases, Cloud and Software Architecture. We provide both corporate and classroom training. Our training is suitable for experienced people who have prior knowledge of programming with Java or Python and a familiarity with databases, SQL and Linux. We spent about 50% of the time on hands-on exercises. NewTechWays was started in the year 2017 by Anurag Yadav who has 17 years of work experience in software product development. He has worked in various leadership and technical roles in many leading software companies. He holds B. Tech degree from IIT BHU Varanasi.
VAIDEHI SOFTWARE TECHNOLOGIES PRIVATE LIMITED, popularly known as VAIDEHI SOFTWARE, was established in the heart of Silicon Valley Bangalore under the leadership of eminent personalities both from the industry & academics with a vision to contribute to the IT sector by focusing on IT education and research, entrepreneurship and innovation. Our company is registered under the Ministry of Corporate Affairs, Indian Government Ministry and soon will be accredited by many international standard organizations. At present we offer following services 1) Research & Development Services 2) Corporate Software Training Services 3) Consulting & Outsourcing
VAIDEHI SOFTWARE TECHNOLOGIES PRIVATE LIMITED, popularly known as VAIDEHI SOFTWARE, was established in the heart of Silicon Valley Bangalore under the leadership of eminent personalities both from the industry & academics with a vision to contribute to the IT sector by focusing on IT education and research, entrepreneurship and innovation. Our company is registered under the Ministry of Corporate Affairs, Indian Government Ministry and soon will be accredited by many international standard organizations. At present we offer following services 1) Research & Development Services 2) Corporate Software Training Services 3) Consulting & Outsourcing
I am working as a Bigdata Devops Engineer and have overall 7 years of IT experience. Skills - 1) Hadoop Administration 2) Cloudera Distribution 3) Ansible 4) Automation. 5) Hadoop Ecosystem 6) Linux Administration
I am working as a Bigdata Devops Engineer and have overall 7 years of IT experience. Skills - 1) Hadoop Administration 2) Cloudera Distribution 3) Ansible 4) Automation. 5) Hadoop Ecosystem 6) Linux Administration
I have 7 years of experience in Big Data using Spark and Machine Learning using R. Big Data Projects: 1. Data Mart Creation, 2. Data Stream Processing, 3. Spark Machine Learning, etc. Machine Learning: 1. Marketing Mix Modeling, 2. Linear Optimization problems, 3. Pricing Guidance, etc.
I have 7 years of experience in Big Data using Spark and Machine Learning using R. Big Data Projects: 1. Data Mart Creation, 2. Data Stream Processing, 3. Spark Machine Learning, etc. Machine Learning: 1. Marketing Mix Modeling, 2. Linear Optimization problems, 3. Pricing Guidance, etc.
I am a Data Engineer. I love teaching, i am an aspiring Data Engineer and Data Scientist. Interested in solving problems at scale using Data Engineering , Data Analysis, Machine learning and AI.
I am a Data Engineer. I love teaching, i am an aspiring Data Engineer and Data Scientist. Interested in solving problems at scale using Data Engineering , Data Analysis, Machine learning and AI.
our institute has a mission that encapsulates the great promise to improvise quality of life for all by providing a platform to learn. We are a renowned brand in training institutes providing top notch training in IT industry. We support students in learning and enhancing technical skills related to IT segment. We work closely with IT specific organizations so that our students are always updated which in turn helps build a bright career. We have a wide range of courses available with us which has been specially designed to supports todayâ??s as well future industry demands. We make use of the latest technologies to better serve our students and provide them valuable information. Our team meticulously design syllabus for all our courses based on the recent developments and requirements in the industry. Our syllabus and training materials are constantly reviewed and updated to help you stay competitive. This is well supported by our regular webinars and online conferences to help students gain more exposure
our institute has a mission that encapsulates the great promise to improvise quality of life for all by providing a platform to learn. We are a renowned brand in training institutes providing top notch training in IT industry. We support students in learning and enhancing technical skills related to IT segment. We work closely with IT specific organizations so that our students are always updated which in turn helps build a bright career. We have a wide range of courses available with us which has been specially designed to supports todayâ??s as well future industry demands. We make use of the latest technologies to better serve our students and provide them valuable information. Our team meticulously design syllabus for all our courses based on the recent developments and requirements in the industry. Our syllabus and training materials are constantly reviewed and updated to help you stay competitive. This is well supported by our regular webinars and online conferences to help students gain more exposure
Why iCress Focus, no distractions Focus away from distractions, accelerate your learning. One week to focus on just one thing, learning Big Data. Like-minded people, experienced instructors, comprehensive curriculum, and an amazing place. Mentorship Our experienced instructors will provide guidance in Hands-on lab and POCs and Helps in interview readiness. Instructors will guide you to think of the best way to solve a problem. Learn by doing Learn by doing, move fast and break things. We want you to code as much as possible, and make mistakes in a safe environment. You will practice everything we explain in our project-based hands-on training. Collaborative environment You will work in teams and pairs. You will have to explain what you do and how, rather than just doing. Explaining is a great way to learn and consolidate new skills. Class Materials Each participant in our class receives a well prepared comprehensive set of day wise materials, including course notes and all the class examples.
Why iCress Focus, no distractions Focus away from distractions, accelerate your learning. One week to focus on just one thing, learning Big Data. Like-minded people, experienced instructors, comprehensive curriculum, and an amazing place. Mentorship Our experienced instructors will provide guidance in Hands-on lab and POCs and Helps in interview readiness. Instructors will guide you to think of the best way to solve a problem. Learn by doing Learn by doing, move fast and break things. We want you to code as much as possible, and make mistakes in a safe environment. You will practice everything we explain in our project-based hands-on training. Collaborative environment You will work in teams and pairs. You will have to explain what you do and how, rather than just doing. Explaining is a great way to learn and consolidate new skills. Class Materials Each participant in our class receives a well prepared comprehensive set of day wise materials, including course notes and all the class examples.
TECHNICAL SKILLS Cloudera Distribution for Hadoop (CDH), MapReduce, HDFS, YARN, Hive, Pig, Sqoop, Storm, Spark, Scala, Elastic search, Kibana, Parquet, Flume, AWS Core Java LINUX, UNIX, Windows ORACLE, MySQL Eclipse Teradata , Base SAS Waterfall, Agile Responsibilities: Work on Hadoop Cluster with current size of 56 Nodes and 896 Terabytes capacity. Write Map Reduce Jobs, HIVEQL, Pig, Spark. Import data using Sqoop into Hive and Hbase from existing SQL Server. Support code/design analysis, strategy development and project planning. Create reports for the BI team using Sqoop to export data into HDFS and Hive. Develop multiple MapReduce jobs in Java for data cleaning and preprocessing. Involve in Requirement Analysis, Design, and Development. Export and Import data into HDFS, HBase and Hive using Sqoop. Involve in create Hive tables, loading with data and writing Hive queries which will run internally in MapReduce way. Work closely with the business and analytics team in gathering the system requirements. Load and transform large sets of structured and semi structu
TECHNICAL SKILLS Cloudera Distribution for Hadoop (CDH), MapReduce, HDFS, YARN, Hive, Pig, Sqoop, Storm, Spark, Scala, Elastic search, Kibana, Parquet, Flume, AWS Core Java LINUX, UNIX, Windows ORACLE, MySQL Eclipse Teradata , Base SAS Waterfall, Agile Responsibilities: Work on Hadoop Cluster with current size of 56 Nodes and 896 Terabytes capacity. Write Map Reduce Jobs, HIVEQL, Pig, Spark. Import data using Sqoop into Hive and Hbase from existing SQL Server. Support code/design analysis, strategy development and project planning. Create reports for the BI team using Sqoop to export data into HDFS and Hive. Develop multiple MapReduce jobs in Java for data cleaning and preprocessing. Involve in Requirement Analysis, Design, and Development. Export and Import data into HDFS, HBase and Hive using Sqoop. Involve in create Hive tables, loading with data and writing Hive queries which will run internally in MapReduce way. Work closely with the business and analytics team in gathering the system requirements. Load and transform large sets of structured and semi structu
I am a working professional and conduct training in my Organization to mentor the junior folks.
I am a working professional and conduct training in my Organization to mentor the junior folks.
I am working as Data Engineer, having knowledge on Spark, Scala, Azure, PySpark, Hive, Hbase. I am giving online tuition since 2 years. Also havind DataBricks Certification on Spark.
I am working as Data Engineer, having knowledge on Spark, Scala, Azure, PySpark, Hive, Hbase. I am giving online tuition since 2 years. Also havind DataBricks Certification on Spark.
I have around 15 years of experience in Business Intelligence worked in MNC's across different domains in different countries.
I have around 15 years of experience in Business Intelligence worked in MNC's across different domains in different countries.
I am a Software Developer with 5 years of experience. I scored 90% in ssc and 95% in Inter and 69% in btech. I am passionate about teaching and helping students with their subjects and guide them through the path i traversed. I believe teaching is all about empathy and being in a students shoes and explain him like how he would see the world than being monotonous or too technical with the subject. I am good with English Match Physics Chemistry and obviously my special skills lies in computer sciences domain.
I am a Software Developer with 5 years of experience. I scored 90% in ssc and 95% in Inter and 69% in btech. I am passionate about teaching and helping students with their subjects and guide them through the path i traversed. I believe teaching is all about empathy and being in a students shoes and explain him like how he would see the world than being monotonous or too technical with the subject. I am good with English Match Physics Chemistry and obviously my special skills lies in computer sciences domain.
Dallas Technologies is a leading career building and best Training Institute in Bangalore for Software Testing Courses, IBM Mainframes Courses, Cloud Computing Courses, Data Warehousing Courses, Business Intelligence Courses, I Phone Development Courses, Linux Administration Courses, JAVA / J2EE Courses, VLSI Course etc. We also train and outsource fresh talents to our clients.
Dallas Technologies is a leading career building and best Training Institute in Bangalore for Software Testing Courses, IBM Mainframes Courses, Cloud Computing Courses, Data Warehousing Courses, Business Intelligence Courses, I Phone Development Courses, Linux Administration Courses, JAVA / J2EE Courses, VLSI Course etc. We also train and outsource fresh talents to our clients.
I've 12 yrs of IT experience including MySQL, MongoDB, BigData, Hadoop, HBase, Hive, MapR, Elastic Search, Solr, Linux, Shell scripting, Java Script. Strong Knowledge on Database, Scripting.
I've 12 yrs of IT experience including MySQL, MongoDB, BigData, Hadoop, HBase, Hive, MapR, Elastic Search, Solr, Linux, Shell scripting, Java Script. Strong Knowledge on Database, Scripting.
UNICOM, started its operation in the greater hub of Kolkata associated with providing coaching related to computer software
UNICOM, started its operation in the greater hub of Kolkata associated with providing coaching related to computer software
I have 14 Years work experience in IT, and 4.6 years work experience in Big Data stack, Analytics, Data Science & AI. Strong work experience in Technical, Techno-functional and Managerial roles in Big Data, Data Science projects and Products. Strong experience on Hadoop eco system (HDFS, Map reduce, Hive QL, Spark) creating next generation data warehouses, data marts for analytics. Strong work experience in Programming languages Java, Python, Scala, R while using in Data Engineering and Data Science projects. Strong experience in using Python, R for Data Science and Machine learning for predictive analytics: Data Exploration, Feature engineering, Training, Evaluation, statistical methods, and using various datamining algorithms. Strong experience in analytics and visualization using Tableau, SAP Analytics cloud and Einstein Analytics. Strong work experience on different cloud environments such as AWS: Redshift, S3, EBS-EC2, EFS EC2, EMR, Kinesis, Elastic Search and Google Cloud for data storage and analytics. Hands-on experience in deep learning libraries such as Tensor Flow, Deep Learning Pipelines for spark (Data bricks). Strong understanding of SEMMA, CRISP-DM lifecycles of data mining. Strong experience in leading ETL projects using Informatica, SSIS, SAP HANA, SAP BW. Strong J2EE prior experience in building enterprise applications Experience in Project Management, Product Management: Business and Requirement analysis, translating requirement into design, Project planning, effort estimation, execution, project monitoring and closure. Leading and managing onshore and offshore activities. Experience in managing requirement and teams in Agile model. Maintaining project level metrics and reports to share with the customer and the senior management. Strong interpersonal, collaboration and problem-solving skills. Strong understanding of SDLC, Test life cycle and Product lifecycle. Experience in Client management, proposals creation and POCs. Strong domain knowledge across domains such as Banking & Finance, Travel, and Supply Chain.
I have 14 Years work experience in IT, and 4.6 years work experience in Big Data stack, Analytics, Data Science & AI. Strong work experience in Technical, Techno-functional and Managerial roles in Big Data, Data Science projects and Products. Strong experience on Hadoop eco system (HDFS, Map reduce, Hive QL, Spark) creating next generation data warehouses, data marts for analytics. Strong work experience in Programming languages Java, Python, Scala, R while using in Data Engineering and Data Science projects. Strong experience in using Python, R for Data Science and Machine learning for predictive analytics: Data Exploration, Feature engineering, Training, Evaluation, statistical methods, and using various datamining algorithms. Strong experience in analytics and visualization using Tableau, SAP Analytics cloud and Einstein Analytics. Strong work experience on different cloud environments such as AWS: Redshift, S3, EBS-EC2, EFS EC2, EMR, Kinesis, Elastic Search and Google Cloud for data storage and analytics. Hands-on experience in deep learning libraries such as Tensor Flow, Deep Learning Pipelines for spark (Data bricks). Strong understanding of SEMMA, CRISP-DM lifecycles of data mining. Strong experience in leading ETL projects using Informatica, SSIS, SAP HANA, SAP BW. Strong J2EE prior experience in building enterprise applications Experience in Project Management, Product Management: Business and Requirement analysis, translating requirement into design, Project planning, effort estimation, execution, project monitoring and closure. Leading and managing onshore and offshore activities. Experience in managing requirement and teams in Agile model. Maintaining project level metrics and reports to share with the customer and the senior management. Strong interpersonal, collaboration and problem-solving skills. Strong understanding of SDLC, Test life cycle and Product lifecycle. Experience in Client management, proposals creation and POCs. Strong domain knowledge across domains such as Banking & Finance, Travel, and Supply Chain.
I am an Engineer. I am giving home tuition since 2016. I have my engineering degree in Electronics and Communication. My key skills are Hadoop, Sqoop, Hive, Hue, Hbase and Spark.
I am an Engineer. I am giving home tuition since 2016. I have my engineering degree in Electronics and Communication. My key skills are Hadoop, Sqoop, Hive, Hue, Hbase and Spark.
Browse hundreds of experienced dance tutors across Bangalore. Compare profiles, teaching styles, reviews, and class timings to find the one that fits your goals — whether it's Apache Spark, Hadoop, Scala,
Select your preferred tutor and book a free demo session. Experience their teaching style, ask questions, and understand the class flow before you commit.
Once you're satisfied, make the payment securely through UrbanPro and start your dance journey! Learn at your own pace — online or in-person — and track your progress easily.
Find the best Big Data Tutor Training
Selected Location Do you offer Big Data Training?
Create Free Profile >>You can browse the list of best Big Data tutors on UrbanPro.com. You can even book a free demo class to decide which Tutor to start classes with.
The fee charged varies between online and offline classes. Generally you get the best quality at the lowest cost in the online classes, as the best tutors don’t like to travel to the Student’s location.
It definitely helps to join Big Data Training near me in Global Technology Park, Bellandur, Bangalore, as you get the desired motivation from a Teacher to learn. If you need personal attention and if your budget allows, select 1-1 Class. If you need peer interaction or have budget constraints, select a Group Class.
UrbanPro has a list of best Big Data Training
several blogs provided valuable insights and updates on Hadoop and the broader big data ecosystem....
Upgrad is one of the reputed online learning platforms. Upgrad provides a PGD-DS Data Science course...
Starting your own startup in the field of Big Data can be an exciting and rewarding venture, but it...
Hi Priya, since you have B Com and MBA Finance better to go with SAP FICO. Whether online or classroom...
Its a combination of tools available in Hadoop stack. There is no cookie cutter approach when it comes...
A SQL join is a Structured Query Language (SQL) instruction to combine data from two sets of data (e.g. two tables). Before we dive into the details of...
Looking for a Business Intelligence (BI) solution for your company can be intimidating. BI uses its own special terminology and the database design concepts...
i. Macros are little programs that run within Excel and help automate common repetitive tasks. Macros are one of Excel's most powerful, yet underutilized...
Prototype Design Pattern: Ø Prototype pattern refers to creating duplicate object while keeping performance in mind. Ø This pattern involves...
What is a Piggybank? Piggybank is a jar and its a collection of user contributed UDF’s that is released along with Pig. These are not included...