Гид по продукту · 6 мин чтения
ML feature CSVs: eyeballing training exports before notebooks run
Spot constant columns, label leakage, and impossible ranges in flat files before sklearn or PyTorch.
Опубликовано 21 марта 2025 г. · Table
Human scan complements automated profiling: sort numeric features, search for sentinel strings like unknown, and verify label cardinality before training.
Red flags
- Future-dated columns co-present with targets (leakage).
- IDs that sort perfectly with labels (merge bugs).