Upcoming Batch
25 to 29 Feb 2021
Online ‘Live’
9:30am to 5:30pm
Singapore Timezone

5 Sessions / 40-Hours


Apply Now

1 to 5 Mar 2021
Online ‘Live’
9:30am to 5:30pm
Singapore Timezone

5 Sessions / 40 Hours


Apply Now

26 to 30 Apr 2021
Online ‘Live’
9:30am to 5:30pm
Singapore Timezone

5 Sessions / 40 Hours


Apply Now

ExcellentCASUGOL4.9★★★★★koen de witkoen de wit ★★★★★ I wanted to expand my horizon and open myself to a new way of thinking about digital transformation. It certainly more than overdeliveredDavid MouldDavid Mould ★★★★★ Peesa ChoomsawasdiPeesa Choomsawasdi ★★★★★ Enjoyed the class, insightful and pragmatic.Selva Subramanian SivanaiahSelva Subramanian Sivanaiah ★★★★★ Ahmad Tariq Ahmad ZiyadAhmad Tariq Ahmad Ziyad ★★★★★ Thanks Dwayne for teaching us the CASUGOL Digital Transformation Framework. After the class, it makes me can't wait to start to many potential area of improvements in my company internally and externally on our Digital Transformation journeyKuchaiKuchai ★★★★★ Very exciting course to be joined by others in AI era.Joel SebastianJoel Sebastian ★★★★★ Marshall Tommie LajawaiMarshall Tommie Lajawai ★★★★★ Great training experience!Auriel AnthonyAuriel Anthony ★★★★★ Thank you CASUGOL for coming all the way to Sabah, East Malaysia at Sabah Credit Corporation. Had amazing digital transformation training with CASUGOL!cordon scccordon scc ★★★★★ great learning experience and learn more about digital transformation. Also great speakerJeamron AratJeamron Arat ★★★★★ Attended CASUGOL's CDTP training and learned a lot about the Digital Transformation framework and core principles which can be applied on the job. Plenty of real-world examples and active discussion during the trainings.Kira ChanKira Chan ★★★★★ toni gasangtoni gasang ★★★★★ Innocelzever AbdonInnocelzever Abdon ★★★★★ I've enjoyed the whole 2-day training as Dwayne kept it light and fun while being an educator at the same time, it made my experience memorable and informativeJaphet MoralejaJaphet Moraleja ★★★★★ Ian Curt SarmientoIan Curt Sarmiento ★★★★★ Great way of sharing expertise and knowledge in small steps but big meaning change.Patricia Caldito-LimPatricia Caldito-Lim ★★★★★ Casugol provides a better understanding on how to leverage well on Artificial Intelligence. I've a learned a lot on a 1-day session. Thank you!Kavilashini AlagenthranKavilashini Alagenthran ★★★★★ Tiwi HarefaTiwi Harefa ★★★★★ Raihan Muhammad IqbalRaihan Muhammad Iqbal ★★★★★ vira agathavira agatha ★★★★★ It was really fun to follow the casugol training, the lessons were easy to understandHanna YunitaHanna Yunita ★★★★★ Nurianda WulandariNurianda Wulandari ★★★★★ Terimakasih untuk pembelajarannya selama ini, semoga ilmu yang di berikan bermanfaat bagi semua orgLiza SinagaLiza Sinaga ★★★★★ ryanlau terate22ryanlau terate22 ★★★★★ Anita Romauli SimamoraAnita Romauli Simamora ★★★★★ During this training, I gained a lot of valuable experience, both in terms of knowledge, skills, and work attitudes. The material presented is very relevant to the needs of the world of work, and is presented in an interactive and easy-to-understand manner.Wahyu SyaputraWahyu Syaputra ★★★★★ Saya sangat senang mengikuti materi pembelajaran seminar iniEcah HerlinaEcah Herlina ★★★★★ The class is very goodvioretta angelavioretta angela ★★★★★ stephaniestephanie ★★★★★ so fun experience!thanks to mr. DwayneElvisElvis ★★★★★ Victor OctarinoVictor Octarino ★★★★★ Valent GValent G ★★★★★ Glae DisyaGlae Disya ★★★★★ Nice place to learnNsalim batamNsalim batam ★★★★★ Myoui Aldo ChoMyoui Aldo Cho ★★★★★ Sundi CenturiaSundi Centuria ★★★★★ Andira NasaruddinAndira Nasaruddin ★★★★★ AurellAurell ★★★★★ Putri MayangPutri Mayang ★★★★★ The class was very enjoyable and the material was meaningfulReza Seri Rahayu SaputriReza Seri Rahayu Saputri ★★★★★ Febby aulya putriFebby aulya putri ★★★★★ Good JobAmaliyaa NurAmaliyaa Nur ★★★★★ good experienceTania RidhaTania Ridha ★★★★★ Erin KristinaErin Kristina ★★★★★ Joining Casugol was one of the best decisions in my personal and professional development journey.SalmiahSalmiah ★★★★★ raffa witaaraffa witaa ★★★★★ Helena LukmetiablaHelena Lukmetiabla ★★★★★ js_loader
Course Information
  • Duration: 5-Day / 40 Hours

  • Certification: Participants will receive a Certificate of Competency upon successfully completing the course and passing the examination

  • Who Should Attend: Data Scientist, Analyst, Data Engineers, CIO, CTO, Software Programmers, and Anyone interested in acquiring advanced knowledge and skills in Big Data, Hadoop and Python

Course Objective

To provide participants with the advanced knowledge on Big Data Analytics.

Learn how Big Data Analytics is being applied in real life through real-time demonstration on scenario based hands-on exercises.

Pre-Requisite

It is preferred that participants successfully complete and pass Advanced Big Data Professional


Examination

Participants are required to attempt an examination upon completion of course. This exam tests a candidate’s knowledge and skills related to Big Data,Hadoop and Python based on the syllabus covered

Module 1 Overview on Big Data Hadoop Ecosystem

Topics Covered
  • Python Refresher
  • Standard Toolkit for Hadoop and Analytics
  • Understanding Relational, NoSQL, and Graph Databases
  • Construction of Data Pipelines
  • Data Modeling in Hadoop
  • HDFS Schema Design
  • HBase Schema Design
  • Working on Metadata
Module 2 Advanced Hadoop Techniques

Topics Covered
  • What is Data Ingestion?
  • Different Ways to Perform Data Ingestion
  • Data Extraction
  • Data Processing in Hadoop
  • Overview on MapReduce
  • Working on Spark components
  • Pig and How it is Being Used
  • Overview on Hive
  • Impala Speed-Oriented Design
Module 3 Introduction to Orchestration

Topics Covered
  • Orchestration Frameworks in Hadoop
  • Oozie Terminology and Workflow
  • Windowing Analysis using Spark
  • Parameterizing Workflows
  • Scheduling Patterns
  • Execution of Workflows
Module 4 Real-Time Processing with Hadoop

Topics Covered
  • Stream Processing
  • Integration of Apache Storm with HDFS and HBase
  • Trident Overview
  • Spark Streaming Overview
  • Flume Interceptors
  • Low-Latency Enrichment, Validation, Alerting, and Ingestion
  • NRT Counting, Rolling Averages, and Iterative Processing
  • Complex Data Pipelines
Module 5 Working with Big Data Framework using Python and Spark

Topics Covered
  • Hadoop and Spark Refresher
  • Spark SQL and Python Pandas DataFrame
  • Improving Analysis Performance with Parquet and Partitions
  • Working with Unstructured Data
  • Working on Spark DataFrames
  • Writing Output from Spark DataFrames
  • Data Manipulation with Spark DataFrames
  • Plotting Graph in Sparks
Module 6 Exploratory Data Analysis

Topics Covered
  • Handling of Missing Values using Spark DataFrame
  • Correlation Analysis with Python PySpark DataFrame
  • Improving Analysis Performance with Parquet and Partitions
  • Understanding Exploratory Data Analysis
  • Identify Target Variable and Related KPIs
  • Feature Importance of Target Variable
  • Different Phases of an Analytics Project Life Cycle
  • Gaussian Distribution of Numeric Features
Module 7 Advanced Big Data Analysis

Topics Covered
  • Reproducible Approach to Gathering Data
  • Understanding the Standards and Code Practices
  • Segmentation of Workflow
  • Missing Value Preprocessing with High Reproducibility
  • Use of Functions / Loops to Optimize Coding
  • Utilization of Libraries / Packages / Algorithms
  • Normalization of Data
Module 8 Putting Everything To Together

Topics Covered
  • Reading Data from a CSV File with Python PySpark Object
  • Reading JSON Data with Python PySpark Object
  • Using Python PySpark Objects for SQL Operations
  • Generating Statistical Measurements
  • Visualisation Using Plotly
Module 9 Big Data and Machine Learning using Spark

Topics Covered
  • Resilient Distributed Datasets with Spark
  • Introduction to Spark MLlib
  • Decision Tree with Spark MLlib
  • K-Means Clustering with Spark
  • Term Frequency – Inverse Document Frequency (TF-IDF)
  • DataFrame API with Spark MLlib
  • Understanding A/B Testing


Advanced Big Data Analytics Expert (ABDE) involves rigorous usage of real-time case studies, hands-on exercises and group discussion

Explore More Courses
What Past Participants Say
The knowledge acquired from CASUGOL program has allowed me to gain a deep understanding on the technology and will come handy when there is a requirement to kickstart future projects.

Syed Othman Abd Rahman

The hands-on exercises which is part of CASUGOL course syllabus are clearly explained, demonstrated and implemented in class with the guidance of experienced CASUGOL trainers.

Andrew Lim

Why CASUGOL

Customization of Programs for specific industry, organisation, government agencies, statutory boards.


Flexible programmes designed to cater to the individual needs of participants, whether for professional upskilling, or for general interest.

Benefit from contribution from leading Industry Experts, Academics, and Researchers from across the world.


Opportunities for employers to develop their workforce and for individuals to enhance their career.

Dynamic learning environment that providing participants with professional networking opportunity.


Online support for participants after the training.

Need more information?
Let us help if you are planning to advance your career and further your education. Request for more information.

Request for more