Versioning Data
Created: 2023-02-13 11:50
#note
Managing data with a data versioning system provides the ability to change data continuously while ensuring consistent training results and collaboration.
Data versioning is a similar concept as code versioning -> it helps track of changes.
Versions can be at a feature, record, or dataset level.
Benefits:
- traceability;
- reproducibility;
- change log capture alternative;
- rollback to last know good state;
- multiple user can reference different versions.
Tags
#mlops #course #data