20
Data Engineering for ML
Feature stores, data versioning, and quality pipelines
3 lessons125 min totaladvanced
1
Feature Stores
What feature stores solve (training-serving skew), Feast (offline/online stores, feature services, materialization), Tecton, feature engineering best practices, and point-in-time correctness
2 exercisesQuiz45m
2
Data Versioning
DVC (tracking data, pipelines, experiments), LakeFS (git-like branching for data), data lineage, reproducibility, and dataset registries
2 exercisesQuiz40m
3
Data Quality
Great Expectations (expectations, suites, checkpoints, data docs), schema validation, anomaly detection, drift monitoring, and data contracts
2 exercisesQuiz40m