Big Data MasterClass

The worldwide revenue for Big Data and Data Analytics will grow to more than $203 Billion in 2020.

Banking specifically will see the fastest spending growth, and industries such as telecommunications, insurance, transportation, and utilities will also start to increase their own spending during this same period, helping to fuel growth.

Together with an increase interest and investment in AI, new tools for collecting and analysing data and new enterprise roles and responsibilities will emerge presenting IT professionals and individuals planning to pursue a career in Big Data and Data Analytics with tremendous career opprotunities.

This can never be a better time to acquire the necessary skills and gain proficiency in Big Data and Data Analyics


Big Data MasterClass is a course on how to use Big Data to identify correlation and causation statistically valid models to help them make more accurate decisions.

logocgreenfb BDM01 – Big Data MasterClass
28 August 2017 to 30 August 2017
More Information | Register Now 

3 Days

    • Introduction to Big Data
      • Big Data Definition
      • Significance – Why Big Data?
      • Conventional Data vs Big Data
      • How Individuals Contribute Towards Big Data
      • Role of Big Data in Day-to-Day Life
      • Why Relational Database Management System (RDBMS) is Not Suitable for Big Data?
      • Disadvantage of Relational Database Management System (RDBMS)
    • Case Studies on the use of Big Data by Leading Companies
    • Introduction on Hadoop
      • Hadoop Architecture
      • Importance of Hadoop in Big Data
      • File Storage in Hadoop
      • Hadoop Components
      • Hadoop Ecosystem
      • Block Allocation in HDFS
      • HDFS Architecture
      • HDFS Read Operation
      • HDFS Write Operation
      • When should HDFS be used
      • Advantages of Hadoop
      • Disadvantages of Hadoop 1.0
      • Introduction to Hadoop 2.0
      • Yarn Architecture
      • Yarn Components
      • YARN Ecosystem
      • Difference between Hadoop 1.0 and 2.0
    • Hands-On Exercise on Cloudera 5.10
    • MapReduce
      • MapReduce Concepts
      • MapReduce Components
      • MapReduce Architecture
      • MapReduce Internals
      • Maper, Reducer, Driver
      • Running a MapReduce Job
    • Hands-On Exercise 
    • Introduction to Apache Pig
      • Apache Pig Architecture
      • Apache Pig Component
      • Apache Pig Latin Basics
      • Data Loading and Storing in Apache Pig
      • Filtering in Apache Pig
      • Data Transformation in Apache Pig
      • Grouping and Sorting in Apache Pig
      • Advanced Features
      • Joins in Apache Pig & User Defined Functions
    • Hands-On Exercise
    • Introduction to Apache Hive
      • Apache Hive Architecture
      • Apache Hive Components
      • Data Storage in Apache Hive
      • Data Type in Apache Hive
      • Apache Hive Query Language and Features
      • Partitions in Apache Hive
      • Joins in Apache Hive
      • Advanced Features – Handling JSON format in Apache Hive
Tools / software used

Hadoop, MapReduce, Apache Pig, Apache Hive


This course is designed to assist those people interested in Big Data and Data Analytics. The Big Data MasterClass is intended to equip you with a set of skills that you can draw on to implement the technology in your organisation.

Class is limited to 20 attendees as hands-on sessions and real-time demonstration is expected.


On completion of this Big Data MasterClass, attendees will have a robust set of Big Data skills that can be applied to any work setting.

Contact Us for more information