Query Hugging Face datasets
hugging-face-dataset-viewerskillsetup L2★0
Sheshiyer/skill-clusters ↗What it does
Browse and preview datasets on Hugging Face
Best for
Inspect and validate Hugging Face datasets before training without downloading entire dataset.
Inputs
- · dataset name or path
- · split (train/val/test)
Outputs
- · dataset preview (first N rows)
- · schema with dtypes
- · statistics (size, unique values)
Requires
- · Hugging Face Datasets library
- · browser (optional for GUI)
Preconditions
Dataset publicly available on Hub or accessible via credentials
Failure modes
- · Dataset too large to load into memory
- · File format incompatible
- · Missing required columns
Trust signals
- · Streaming mode for large datasets
- · Schema validation
- · Statistics computation on remote data