OVERVIEW

Key Program Details

<aside> 📅 8-12 months to complete

</aside>

<aside> ⏰ Minimum 15 hours per week

</aside>

<aside> 📖 Core program: 3 modules

</aside>

<aside> 👉 Optional part: 1 specialization module

</aside>

MAIN MODULES

1. Introduction to Data Engineering

Sprint 1

Intermediate Python & Git - Python data model, Python sequences, Git basics

This Sprint is focused on Python, introduction to data engineering, and git. The learners will cover the basics of Python's data model and learn about Python's sequences and how to use them effectively. They will also learn the basics of data engineering: data engineering lifecycle, data engineering ecosystem, and opportunities in data engineering. The learners will also practice git and Python skills with interactive exercises.

Sprint 2

Introduction to Relational Databases & SQL Basics - Python mutability and object references, SQL queries

This Sprint is focused on an intro to RDBMS and SQL, Python, and git. The learners will do a deeper dive into dictionaries and sets, learn how to work with bytes and unicode effectively, learn about data classes and their usage patterns, and how objects are referenced in Python. They will also learn what MySQL is, the primary keys, and foreign keys, how queries are executed, and how to use SQL aliases. The learners will also practice git and Python skills with interactive exercises.

Sprint 3

Intermediate SQL - SQL joins, subqueries, sets, and strings

This Sprint is focused on Python, SQL, and git. The learners will learn SQL basics: how to generate and manipulate strings, how to work with sets, and how to do joins and subqueries. They will also learn decorators, closures, and other advanced functions and they will dive deeper into types, types hints, and software design patterns. The learners will also practice git and Python skills with interactive exercises.


2. Fundamentals of Data Engineering