This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence.
Add this skill
npx mdskills install huggingface/hugging-face-model-trainerComprehensive TRL training guide with clear MCP integration, multi-method support, and practical examples
No comments yet. Sign in to start the discussion.
Threaded comments with markdown support coming soon.