The course delivers the key concepts of Big Data. Participants will get familiar with the main technologies involved and with the architectures behind them. Among other, the course will go over the Hadoop Eco-system, Spark and NoSQL databases. The course will also discuss the challenges faced by Big Data developers and what are the recommended tools to use in a given situation.

  • Developers
  • Architects
  • Analysts
  • DBAs
  • Basic knowledge of database concepts and development environments


24 Hours

Data Management


Certificate: No

Price: contact us for more details

Leave your details

Course Outline

Module 1: Introduction to Big Data

  • Key concepts
  • Use cases
  • Major technologies involved


Module 2: Introduction to the Hadoop Ecosystem

  • Problems with Traditional Large-scale Systems
  • The Hadoop Eco-System


Module 3: Hadoop Architecture

  • Distributed Processing on a Cluster
  • Storage: HDFS Architecture
  • Storage: Using HDFS
  • Resource Management: YARN Architecture


Module 4: Importing Data into Hadoop

  • Sqoop, Flume and Kafka


Module 5: Process data over the cluster

  • Introduction to Hive
  • Introduction to Spark
  • Other tools


Module 6: NoSQL

  • Basic concepts
  • NoSQL families
  • Major players in the market