Hugging Face Model Trainer

Name: Hugging Face Model Trainer
Author: hugging-face-model-trainer

Home/Ai-ml/Hugging Face Model Trainer

About this skill

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for

View on GitHub

GitHub Stats

Stars

Forks

Last Update

Author

hugging-face-model-trainer

License

Other

Version

1.0.0

Features

Related Skills

Word / DOCX

Create, inspect, and edit Microsoft Word documents and DOCX files with reliable styles, numbering, tracked changes, tables, sections, and compatibility checks. Use when (1) the task is about Word or `...

315.2kivangdavila

Ai-ml

Clanker's World

Operate Clankers World through the canonical `cw` CLI, with bundled runtime helpers, explicit Wall vs Sandbox separation, and safe room operations on `https://clankers.world`.

315.2kdecentraliser

Ai-ml

Self-Improving Agent (Proactive Self-Reflection)

Self-reflection + Self-criticism + Self-learning + Self-organizing memory. Agent evaluates its own work, catches mistakes, and improves permanently. Use before starting work and after responding to th...

315.2kself-improving

Ai-ml

Hugging Face Model Trainer

About this skill

GitHub Stats

Categories

Features

Related Skills

Word / DOCX

Clanker's World

Self-Improving Agent (Proactive Self-Reflection)