Certified Big Data Expert

Extensive Big data training on Hadoop core and its latest components

Do you know that in next 3 years more than half of the data in this world will move to Hadoop? No wonder McKinsey Global Institute estimates shortage of 1.7 million Big Data professionals over next 3 years.


Considering this increasing gap in the demand and supply with the help of this Big Data Hadoop training, IT/ ITES professionals can bag lucrative opportunities and boost their career by gaining sought after Big Data Analytics skills. In this Big Data training attendees will gain practical skill set on Hadoop in detail, including its core and latest components, like HDFS, MapReduce, Pig, Hive, Impala HBase, Jasper, Sqoop, Flume, Oozie, Zoopkeeper, Spark and Storm. For extensive hands-on practice, in both Hadoop online training and classroom training candidates will get access to the virtual lab and several assignments and projects for Big Data certification. At end of the program candidates are awarded Big Data Certification on successful completion of projects that are provided as part of the training. Optionally, candidates can also appear for the Cloudera or Hortonworks Big Data Hadoop certification after this course.


A completely industry relevant Big Data Analytics training and a great blend of analytics and technology, making it quite apt for aspirants who want to develop Big Data Analytics skills and head-start in Big Data!


Course duration: 100 hours (Atleast 50 hours live training + Practice and Self-study, with ~8hrs of weekly self-study)

Who Should do this course?

IT/ ITES, Business Intelligence, Database professionals/ computer science (or any other circuit branches) graduates who want to get into a Big Data Analytics/ Developer role.

Enroll to this course

Combo Deals!

Learn more, save more.
See our combo offers here.

Course Duration 100 hours
Classes 20
Tools Cloudera Hadoop quick start (CDH 5.4.1) and Apache Hadoop
Learning Mode Live Training
Next Batch22nd January, 2017

  • Introduction and relevance
  • Uses of Big Data analytics in various industries like Telecom, E- commerce, Finance and Insurance etc.
  • Problems with Traditional Large-Scale Systems
  • Motivation for Hadoop
  • Different types of projects by Apache
  • Role of projects in the Hadoop Ecosystem
  • Key technology foundations required for Big Data
  • Limitations and Solutions of existing Data Analytics Architecture
  • Comparison of traditional data management systems with Big Data management systems
  • Evaluate key framework requirements for Big Data analytics
  • Hadoop Ecosystem & Hadoop 2.x core components
  • Explain the relevance of real-time data
  • Explain how to use big and real-time data as a Business planning tool
  • Quick tour of Java
  • Quick tour of Linux commands
  • Introduction to Cloudera VM/Cloudera Manager(Apache Ambari)/Download & usage instructions
  • Hadoop Master-Slave Architecture
  • The Hadoop Distributed File System - data storage
  • Explain different types of cluster setups (Fully distributed/Pseudo etc)
  • Hadoop Cluster set up - Installation
  • Hadoop 2.x Cluster Architecture
  • A Typical enterprise cluster – Hadoop Cluster Modes
  • HDFS Overview & Data storage in HDFS
  • Get the data into Hadoop from local machine (Data Loading Techniques) - vice versa
  • MapReduce Overview (Traditional way Vs. MapReduce way)
  • Concept of Mapper & Reducer
  • Understanding MapReduce program skeleton
  • Running MapReduce job in Command line/Eclipse
  • Develop MapReduce Program in JAVA
  • Develop MapReduce Program with the streaming API
  • Test and debug a MapReduce program in the design time
  • How Partitioners and Reducers Work Together
  • Writing Customer Partitioners Data Input and Output
  • Creating Custom Writable and Writable Comparable Implementations
  • Integrating Hadoop into an existing Enterprise
  • Loading Data from an RDBMS into HDFS by Using Sqoop
  • Managing Real-Time Data Using Flume
  • Accessing HDFS from Legacy Systems with FuseDFS and HttpFS
  • Introduction to Talend (community system)
  • Data loading to HDFS using Talend
  • Introduction to Hadoop Data Analysis Tools
  • Introduction to PIG - MapReduce Vs Pig, Pig Use Cases
  • Pig Latin Program & Execution
  • Pig Latin : Relational Operators, File Loaders, Group Operator, COGROUP Operator, Joins and COGROUP, Union, Diagnostic Operators, Pig UDF
  • Use Pig to automate the design and implementation of MapReduce applications
  • Data Analysis using PIG
  • Introduction to Hive - Hive Vs. PIG - Hive Use Cases
  • Discuss the Hive data storage principle
  • Explain the File formats and Records formats supported by the Hive environment
  • Perform operations with data in Hive
  • Hive QL: Joining Tables, Dynamic Partitioning, Custom MapReduce Scripts
  • Hive Script, Hive UDF
  • Introduction to Impala & Architecture
  • How Impala executes Queries and its importance
  • Hive vs. PIG vs. Impala
  • Extending Impala with User Defined functions 
  • Improving Impala performance
  • Introduction to NoSQL Databases and Hbase
  • HBase v/s RDBMS, HBase Components, HBase Architecture
  • HBase Cluster Deployment
  • Introduction to role of R in Hadoop Eco-system
  • Introduction to Jasper Reports & creating reports by integrating with Hadoop
  • Role of Kafka & Avro in real projects
  • Introduction to Zookeeper - ZooKeeper Data Model, Zookeeper Service
  • Introduction to Oozie - Analyze workflow design and management using Oozie
  • Design and implement an Oozie Workflow
  • Introduction to Storm
  • Introduction to Spark
  • Final project including integration various key components
  • Follow-up session: Tips and tricks for projects, certification and interviews etc
Data storage using HDFS
This case study aims to give practical experience on Storing & managing different types of data(Structured/Semi/Unstructured) - both compressed and un-compressed.
Processing data using map reduce
This case study aims to give practical experience on understanding & developing Map reduce programs in JAVA & R and running streaming job in terminal & Ecclipse
Data integration using sqoop & flume
This case study aims to give practical experience on Extracting data from Oracle and load into HDFS and vice versa also Extracting data from twitter and store in HDFS
Data Analysis using Pig
This case study aims to give practical experience on complete data analysis using pig and create and usage of user defined function (UDF)
Data Analysis using Hive
This case study aims to give practical experience on complete data analysis using Hive and create and usage of user defined function (UDF)
Hbase-NoSql data base creation
This case study aims to give practical experience on Data table/cluster creation using Hbase
Final Project : Integration of PIG-HIVE-HBASE-OOZIE-ZOOKEEPER
The final project aims to give practical experience on how different modules(Pig-Hive-Hbase-Oozie-Zookeeper) can be used for solving big data problems

Access to 48 hours of instructor led live classes of 16x3 hours each, spread over 8 weekends

Video recordings of the class sessions for self study purpose

Weekly assignment, refernce codes and study material in PDF format

Module wise case studies/ projects

Specially curated study material and sample questions for Big Data Certification (Developer/Analyst)

Career guidance and career support post the completion of some selected assignments and case studies

What if I miss a class?

Don’t worry. You will always get a recording for the class in your inbox. Have a look at that and reach out to the faculty in case of doubts. All our live classes are recorded for self-study purpose and future reference, and these can also be accessed through our Learning Management System. Hence, in case you miss a class, you can refer to the video recording and then reach out to the faculty during their doubts clearing time or ask your question in the beginning of the subsequent class.

You can also repeat any class you want in the next one year after your course completion.

For how long are the recordings available to me?

6 months post your course completion. If needed, you can also repeat any number of classes you want in the next one year after course completion.

Virtually the recordings are available to you for lifetime, but for judicious use of IT resources, the access to these recordings get deactivated post 6 months, which can be extended upon requests.

Can I download the recordings?

No. Our recordings can be accessed through your account on LMS or stream them live online at any point of time though.

Recordings are integral part of AnalytixLabs intellectual property by Suo Jure. The downloading/distribution of these recordings in anyway is strictly prohibited and illegal as they are protected under copyright act. Incase a student is found doing the same, it will lead to an immediate and permanent suspension in the services, access to all the learning resources will be blocked, course fee will be forfeited and the institute will have all the rights to take strict legal action against the individual.

What if I share my LMS login details with a friend?

The sharing of LMS login credentials is unauthorized, and as a security measure, if the LMS is accessed by multiple places, it will flag in the system and your access to LMS can be terminated.

Will I get a certificate in the end?

Yes. All our course are certified. As part of the course, students get weekly assignments and module-wise case studies. Based on the selected assignments and case studies (atleast 70%), the certificate shall be awarded.

Do you help in placements?

We follow a comprehensive and a self-sustaining system to help our students with placements. This is a win-win situation for our candidates and corporate clients. As a pre-requisite for learning validation, candidates are required to submit the case studies and project work provided as a part of the course (flexible deadline). Support from our side is continuous and encompasses help in profile building, CV referrals through our ex-students, HR consultants and companies directly reaching out to us.

We will provide guidance to you in terms of what are the right profiles for you based on your education and experience, interview preparation and conducting mock interviews, if required. The placement process for us doesn’t end at a definite time post your course completion, but is a long relationship that we will like to build.

Do you guarantee placements?

No institute can guarantee placements, unless they are doing so as a marketing gimmick! It is on a best effort basis.

In professional environment, it is not feasible for any institute to do so, except for a marketing gimmick. For us, it is on a best effort basis but not time – bound – in some cases students reach out to us even after 3 years for career support.

Do you have a classroom option?

Yes we have classroom option for Delhi-NCR candidates. However, most of our students end up doing instructor led live online classes, including those who join classroom in the beginning. Based on the student feedback, the learning experience is same both in classroom and instructor led live online fully interactive mode.

How do I attend the online classes? Are they interactive or self-paced?

We provide both the options and for instructor led live online classes we use the gold standard platform used by the top universities across the globe. These video sessions are fully interactive and students can chat or even ask their questions verbally over the VoIP in real time to get their doubts cleared.

What do I need to attend the online classes?

To attend the online classes, all you need is a laptop/PC with a basic internet connection. Students have often shared good feedback of attending these live classes through their data card or even their mobile 3G connection, though we recommend a basic broadband connection.

For best user experience, a mic-headphone is recommended to enhance the voice quality, though the laptop’s in-built mic works fine and you can ask your question over the chat as well.

How can I reach out to someone if I have doubts post class?

Through the LMS, students can always connect with the trainer or even schedule one-to-one time over the phone or online. During the course we also schedule periodic doubts-clearing classes though students can also ask doubts of a class in the subsequent class.

LMS also has a discussion forum where a lot of your doubts might get easily answered.

Incase you are having a problem still, repeat the class and schedule one-to-one time with the trainer.

I am having difficulty coping up with my classes. What can I do?

For all the courses, we also provide the recordings of each class for their self-reference as well as revision in case you miss any concept in the class. In case you still have doubts after revising through the recordings, you can also take one-to-one time with the faculty outside classes during. Furthermore, if students want to break their courses in different modules, they get one year time to repeat any of the classes with other batches.

What is your refund policy?
  • Instructor Led Live online or Classroom - Within 7 days of registartion date and latest 3 days before batch start
  • Video-based - 2 days
Can I pay in instalments?

Not for this course. The instalment options are available only for our courses which are atleast 3 months long.

What are the system requirements for the software?

It is recommended to have 64-bit operating system with minimum 8GB RAM so that the virtual lab can be installed easily

AnalytixLabs stands out from the crowd of vast analytics institutes mainly because of its practical and conceptual approach.The faculty is highly skilled with phenomenal industry experience which is displayed in every learning aspect.They would try their best to resolve every doubt you have. The course content and assignments are very well crafted which would help a lot securing a good job. AnalytixLabs is the right place if someone is looking to learn Analytics from scratch.

- Prachi Khurana (Senior Associate Analyst at WNS Global Services)
Have Questions?
Contact us and we shall
get back with answers.

Change the course of your career

2000+ students have already registered to our courses. Learn analytics from the experts.
Course Brochure
Upcoming Batches
Student Reviews