Category Not Found

DPO Fine-Tune of Llama 3.3 70B for customer support classification

Full fine-tuning recipe: DPO on Llama 3.3 70B via FSDP, targeting AWS p4d.24xlarge, with data mix and eval plan.

2961092

DPO Fine-Tune of Llama 3.1 70B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Llama 3.1 70B via LLaMA-Factory, targeting AWS p4d.24xlarge, with data mix and eval plan.

41904

Free

DPO Fine-Tune of Mistral Small 3 for legal clause extraction

Full fine-tuning recipe: DPO on Mistral Small 3 via Axolotl, targeting Lambda Labs 8xH100, with data mix and eval plan.

2511384

DPO Fine-Tune of Qwen 2.5 7B for customer support classification

Full fine-tuning recipe: DPO on Qwen 2.5 7B via LitGPT, targeting Lambda Labs 8xH100, with data mix and eval plan.

3881310

DPO Fine-Tune of Qwen 2.5 32B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Qwen 2.5 32B via torchtune, targeting Lambda Labs 8xH100, with data mix and eval plan.

303357

DPO Fine-Tune of Gemma 2 9B for legal clause extraction

Full fine-tuning recipe: DPO on Gemma 2 9B via Unsloth, targeting single A100 80GB, with data mix and eval plan.

2511068

DPO Fine-Tune of Gemma 2 27B for customer support classification

Full fine-tuning recipe: DPO on Gemma 2 27B via LitGPT, targeting single A100 80GB, with data mix and eval plan.

3191240

Free

DPO Fine-Tune of Phi-3.5-mini for function-calling with strict JSON

Full fine-tuning recipe: DPO on Phi-3.5-mini via torchtune, targeting single H100 80GB, with data mix and eval plan.

1511030

DPO Fine-Tune of DeepSeek-V3 base for legal clause extraction

Full fine-tuning recipe: DPO on DeepSeek-V3 base via Unsloth, targeting single H100 80GB, with data mix and eval plan.

491207

DPO Fine-Tune of Mixtral 8x7B for customer support classification

Full fine-tuning recipe: DPO on Mixtral 8x7B via OpenRLHF, targeting single H100 80GB, with data mix and eval plan.

3341059

Free

DPO Fine-Tune of Yi 1.5 34B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Yi 1.5 34B via DeepSpeed, targeting 2x A100 80GB, with data mix and eval plan.

286949

DPO Fine-Tune of Llama 3.3 70B for legal clause extraction

Full fine-tuning recipe: DPO on Llama 3.3 70B via Hugging Face TRL, targeting 2x A100 80GB, with data mix and eval plan.