Big Data Programming - Essential

To stay competitive a business needs to know as much as it can about people, the environment it's operating in, and who and where the competitors are. The amount of data companies collect keeps growing. There is an urgent need of a strategy to make sense of it all. Star Big Data Programming is a certification course that will help learners master the skills they need to establish a successful career as a data engineer. The program will help the learners master the skills on HDFS, MapReduce, HBase, Hive, Pig, Yarn, Oozie, Flume and Sqoop using real-time use cases from retail, social media, aviation, tourism, and finance industries. It equips the learners with in-depth knowledge of writing code using the MapReduce framework and managing large data sets with HBase.


Beginner Level

Big Data Programming-Essential Course Objectives

In this course, you will learn about:

  • Big data and its business applications
  • Apache Hadoop and its big data eco-system
  • Deploying Hadoop in a clustered environment
  • Interacting with No-SQL databases
  • Managing key Hadoop components (HDFS, YARN and Hive)
  • Spark - the next-generation computational framework
  • Installing and working with Hadoop
  • Hadoop related technologies – Avro, Flume, Sqoop, Pig, Oozie, etc
  • Advanced topics like Hadoop security, Cloudera, IBM InfoSphere and more

Course Outcome

After competing this course, you will be able to:

  • Understand the finer nuances of the Big Data technology
  • Deal with Big Data related tools, platforms, and their architecture to store, program, process, and manage the data
  • Deploy Hadoop and its related technologies
  • Use the Hadoop ecosystem to manage your data
  • Deploy machine learning concepts with Mahout

Table Of Contents Outline          

  1. Introducing Data and Big Data
  2. Big Data and Hadoop
  3. HDFS - Storing Data in Hadoop
  4. Introduction to MapReduce Exploring the Working of a MapReduce Process
  5. Avro and Parquet
  6. Flume - Service for Streaming Event Data and Sqoop (MySQL to Hadoop)
  7. Apache Pig and Hive – Data Warehouse
  8. Exploring Spark and Scala Exploring HBase - Big Data Store
  9. HBase and Zookeeper - Coordination Service for Distributed Applications Exploring Storm
  10. Interacting with NoSQL Databases

Exam Details

Exam Codes SCBPE S07-121A (Academy customers use the same codes)
Launch Date Dec 01 2019
Number of Questions 60
Type of Questions MULTIPLE CHOICE
Length of Test 90 Minutes
Passing Score 70%
Recommended Experience Beginner Level
Languages English


Official Poster

Official Book


Participation Certificate


Examination Voucher


Global Certificate


Contact Us