๐ Data Catalog
Browse Kedro data layers and track data lineage
๐ฅ
01_raw
Raw data from APIs (YES Energy, CAISO, etc.)
Datasets: -
Last Updated: -
Total Size: -
๐งน
02_intermediate
Cleaned and validated data
Datasets: -
Last Updated: -
Total Size: -
๐
03_primary
Primary datasets (assumptions, series definitions)
Datasets: -
Last Updated: -
Total Size: -
๐ง
04_feature
Engineered features for ML models
Datasets: -
Last Updated: -
Total Size: -
๐
05_model_input
Data prepared for model training/inference
Datasets: -
Last Updated: -
Total Size: -
๐ค
06_models
Trained models and model artifacts
Datasets: -
Last Updated: -
Total Size: -
๐
07_model_output
Forecast outputs (8,760 hourly values per forecast)
Datasets: -
Last Updated: -
Total Size: -
๐
08_reporting
Validation results, scorecards, comparisons
Datasets: -
Last Updated: -
Total Size: -
๐
09_tracking
Lineage, metadata, run tracking
Datasets: -
Last Updated: -
Total Size: -
Recent Pipeline Runs
Each pipeline run creates datasets in the data catalog
No pipeline runs yet. Trigger a forecast to see data catalog activity.
Understanding Data Flow
Kedro organizes data into layers, each representing a stage in the pipeline:
01_raw โ 02_intermediate โ 03_primary โ 04_feature โ 05_model_input
โ 06_models โ 07_model_output โ 08_reporting โ 09_tracking
โ 06_models โ 07_model_output โ 08_reporting โ 09_tracking
Each layer contains datasets that can be traced back through the pipeline, providing complete data lineage and reproducibility.