Fine-tune vision models on Hugging Face
hugging-face-vision-trainerskillsetup L3★0
Sheshiyer/skill-clusters ↗What it does
Fine-tune vision models (object detection, classification, segmentation) using Hugging Face
Best for
When fine-tuning vision models (detection/classification/segmentation) for domain-specific image tasks with automatic Hub integration.
Inputs
- · image dataset with labels
- · vision model name from Hub
- · training hyperparameters
Outputs
- · fine-tuned vision model
- · pushed to Hub
- · evaluation metrics
Requires
- · Hugging Face Transformers
- · Hugging Face Datasets
- · timm
Preconditions
labeled image dataset ≥100 samples per class, model checkpoint available on Hub
Failure modes
- · class imbalance causing poor minority performance
- · augmentation not applied causing overfitting
- · image resolution mismatch with model expectations
Trust signals
- · evaluation metrics computed (mAP, accuracy, F1)