Big Data & Data Science training

Duration 1 day Get a quote

Introduction

  • Origin of Big Data
  • Areas of application
  • Success stories

A Big Data Project

  • Attention, it's not just a technical problem
  • Let's create our first big data application
  • Let's understand the different key steps
    • Acquisition
    • Cleaning
    • Consolidation
    • Aggregation
    • Storage
    • Analysis
    • Extraction
    • Publication

Storing the data

  • the different storage tools
  • The NoSQL revolution, the different families

Notebooks

Data management

  • Definition of a data
  • Data processing in batch mode (Hadoop and spark)
  • Streaming, kafka and flink
  • The Datalake
    • The stakes and the different zones
    • Storage formats
    • Lambda and kappa architecture

Big Data in the company

  • Malfunctions
  • The failure of exploratory approaches
  • Governance and security
  • Successfully adopt the datalake