Məhsul bələdçisi · 6 dəq oxu
ML feature CSVs: eyeballing training exports before notebooks run
Spot constant columns, label leakage, and impossible ranges in flat files before sklearn or PyTorch.
Dərc 21 mart 2025 · Table
Human scan complements automated profiling: sort numeric features, search for sentinel strings like unknown, and verify label cardinality before training.
Red flags
- Future-dated columns co-present with targets (leakage).
- IDs that sort perfectly with labels (merge bugs).