The course delivers the key concepts of Big Data. Participants will get familiar with the main technologies involved and with the architectures behind them. Among other, the course will go over the Hadoop Eco-system, Spark and NoSQL databases. The course will also discuss the challenges faced by Big Data developers and what are the recommended tools to use in a given situation.

  • Developers
  • Architects
  • Analysts
  • DBAs
  • Basic knowledge of database concepts and development environments

}

24 Hours

Data Management

h

Certificate: No

Price: contact us for more details

Leave your details

Course Outline

Module 1: Introduction to Big Data

  • Key concepts
  • Use cases
  • Major technologies involved

 

Module 2: Introduction to the Hadoop Ecosystem

  • Problems with Traditional Large-scale Systems
  • The Hadoop Eco-System

 

Module 3: Hadoop Architecture

  • Distributed Processing on a Cluster
  • Storage: HDFS Architecture
  • Storage: Using HDFS
  • Resource Management: YARN Architecture

 

Module 4: Importing Data into Hadoop

  • Sqoop, Flume and Kafka

 

Module 5: Process data over the cluster

  • Introduction to Hive
  • Introduction to Spark
  • Other tools

 

Module 6: NoSQL

  • Basic concepts
  • NoSQL families
  • Major players in the market