Possibly the fastest DataFrame-agnostic quality check library in town.
-
Updated
Jan 13, 2025 - Python
Possibly the fastest DataFrame-agnostic quality check library in town.
This project is an end-to-end ETL solution for processing and analyzing US flight data with Apache Spark and Iceberg. It features pipeline unit tests, data quality checks, and an interactive dashboard. Explore it below! ↓
Deequ incremental metrics implementation in python
Add a description, image, and links to the pydeequ topic page so that developers can more easily learn about it.
To associate your repository with the pydeequ topic, visit your repo's landing page and select "manage topics."