FilterKit¶
FilterKit is the dataset filtering tool, launched via filterkit.
Purpose¶
Filter and curate datasets for quality and diversity before training.
Launch¶
Workflow¶
- Load a dataset directory.
- Apply filtering criteria (quality, diversity, metadata).
- Preview and validate the filtered subset.
- Export the filtered dataset for training or analysis.