BIG DATA ANALYTICS USING HADOOP & SPARK

Our courses are available for individuals with various backgrounds to help them propel their career into analytics.

BIG DATA ANALYTICS
USING HADOOP & SPARK

COURSE ID – DBAL-105     DURATION – 48 Hours

 
Program Objective
In this Big Data training attendees will gain practical skill set on Hadoop in detail, including its core and eco system components. This course focuses on case study approach for learning various tools and completely industry relevant training and a great blend of analytics and technology. Candidates will be awarded Big Data Analytics using Hadoop & Spark Certification on successful completion of projects that are provided as part of the training.
After the completion of this course, you will be able to:

  • Work on the concepts of HDFS and MapReduce framework.
  • Learn data loading techniques using Sqoop and Flume
  • Perform data analytics using Pig, Hive, Impala, Hbase, and YARN.
  • Implement Spark applications on YARN (Hadoop).
  • Stream data using Spark Streaming API.
  • Analyze Hive and Spark SQL architecture.
  • Implement Spark SQL queries to perform several computations.
  • Learn large scale data processing.
  • End to end Big Data projects.
Who Should do this course?
This course is a foundation to anyone who aspires to embark into the field of big data and keep abreast of the latest developments around fast and efficient processing of ever-growing data using Hadoop & Spark and related projects.
This course is ideal for professionals like Data Scientists, Analytics professionals, IT/Software Professionals, BI/ETL/EDW Professionals, statisticians, Big Data Enthusiasts.
Pre-requisites
There are no prerequisites for this course. Knowledge of any Linux, SQL and any programming language and data analytics exposure would be an advantage. For beginners, its highly recommend to complete the courses “Data Science using R- Python” or “Data Science using SAS”.
Modules & Topics

Introduction

  • Hadoop: Introduction to Hadoop & Ecosystem

Hadoop

  • Hadoop core components- HDFS
  • Hadoop core components- Mapreduce (YARN)
  • Hadoop Data Analysis Tools: Hadoop-pig
  • Hadoop Data Analysis Tools: Hadoop-Hive
  • Hadoop Data Analysis Tools: Impala
  • Hadoop Data Analysis Tools: Hbase(NOSQL Database)
  • Hadoop: Intoduction to other Apache Projects

SPARK

  • SPARK: Introduction
  • Spark: Spark in Practice
  • Spark: Spark meets with Hive

Need help choosing what course you should enroll to?

Talk to us today to help us understand your requirements in detail and help you achieve your analytics training goals!