Airflow Data Quality (Day 1 Lab)

In this lab, Zach will provide an overview of airflow backfilling practices and discuss the pipeline code using pyspark. He will explain the importance of including the username in the DAG name and demonstrate how to run glue jobs and backfill data. He will also show a table that was backfilled and discuss the comparison of tables using union all. [Recorded on May 22nd, 2024]

42 mins

Purchase Required

You need to purchase this content in order to view it

Airflow Data Quality (Day 2 Lab)

Week 3: Airflow and Airflow Data Quality

Airflow Data Quality (Day 2 Lecture)