๐Ÿ“š Data Catalog

Browse Kedro data layers and track data lineage

โ† Back to Lab
๐Ÿ“ฅ

01_raw

Raw data from APIs (YES Energy, CAISO, etc.)

Datasets: -
Last Updated: -
Total Size: -
๐Ÿงน

02_intermediate

Cleaned and validated data

Datasets: -
Last Updated: -
Total Size: -
๐Ÿ“‹

03_primary

Primary datasets (assumptions, series definitions)

Datasets: -
Last Updated: -
Total Size: -
๐Ÿ”ง

04_feature

Engineered features for ML models

Datasets: -
Last Updated: -
Total Size: -
๐Ÿ“Š

05_model_input

Data prepared for model training/inference

Datasets: -
Last Updated: -
Total Size: -
๐Ÿค–

06_models

Trained models and model artifacts

Datasets: -
Last Updated: -
Total Size: -
๐Ÿ“ˆ

07_model_output

Forecast outputs (8,760 hourly values per forecast)

Datasets: -
Last Updated: -
Total Size: -
๐Ÿ“‘

08_reporting

Validation results, scorecards, comparisons

Datasets: -
Last Updated: -
Total Size: -
๐Ÿ”

09_tracking

Lineage, metadata, run tracking

Datasets: -
Last Updated: -
Total Size: -

Recent Pipeline Runs

Each pipeline run creates datasets in the data catalog

No pipeline runs yet. Trigger a forecast to see data catalog activity.

Understanding Data Flow

Kedro organizes data into layers, each representing a stage in the pipeline:

01_raw โ†’ 02_intermediate โ†’ 03_primary โ†’ 04_feature โ†’ 05_model_input
โ†’ 06_models โ†’ 07_model_output โ†’ 08_reporting โ†’ 09_tracking

Each layer contains datasets that can be traced back through the pipeline, providing complete data lineage and reproducibility.