• Votre sélection est vide.

    Enregistrez les diplômes, parcours ou enseignements de votre choix.

Data engineering (INFO904_SIAY)

  • ECTS

    3 crédits

  • Composante

    IAE Savoie Mont Blanc

Description

The course introduces students to modern technologies used for data processing in the context of Big Data. In particular, the course aims to position the need for parallel processing architectures in the modern data environment. Through examples, students will learn how to implement simple processing pipelines and study the benefit of their usage.

Lire plus

Heures d'enseignement

  • Data engineering - CMCours Magistral12h
  • Data engineering - TDTravaux Dirigés12h

Pré-requis obligatoires

Programming skills in Python. Notions of data processing libraries, e.g., NumPy, Pandas, etc. Basic notions of parallel computing.

Lire plus

Plan du cours

  • Data engineering 101: data stores and parallel computing
  • Big data tools: MapReduce, Hadoop, Spark
  • Extract, Transform, and Load pipelines
Lire plus

Compétences visées

  • Understanding of big data processing tools
  • Ability to implement simple parallel processing pipelines
  • Ability to display the results of data processing pipelines
Lire plus