Category Not Found

Free

DPO Fine-Tune of Llama 3.1 70B for customer support classification

Full fine-tuning recipe: DPO on Llama 3.1 70B via OpenRLHF, targeting 4x A100 40GB, with data mix and eval plan.

🟠Claude

2301260

DPO Fine-Tune of Mistral Small 3 for function-calling with strict JSON

Full fine-tuning recipe: DPO on Mistral Small 3 via DeepSpeed, targeting 4x A100 40GB, with data mix and eval plan.

163615

DPO Fine-Tune of Qwen 2.5 7B for legal clause extraction

Full fine-tuning recipe: DPO on Qwen 2.5 7B via Hugging Face TRL, targeting 8x H100, with data mix and eval plan.

791155

Free

DPO Fine-Tune of Qwen 2.5 32B for customer support classification

Full fine-tuning recipe: DPO on Qwen 2.5 32B via Megatron-LM, targeting 8x H100, with data mix and eval plan.

124219

DPO Fine-Tune of Gemma 2 9B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Gemma 2 9B via FSDP, targeting 8x H100, with data mix and eval plan.

941503

DPO Fine-Tune of Gemma 2 27B for legal clause extraction

Full fine-tuning recipe: DPO on Gemma 2 27B via LLaMA-Factory, targeting single RTX 4090 (24GB), with data mix and eval plan.

3321122

DPO Fine-Tune of Phi-3.5-mini for customer support classification

Full fine-tuning recipe: DPO on Phi-3.5-mini via Axolotl, targeting single RTX 4090 (24GB), with data mix and eval plan.

2441264

DPO Fine-Tune of DeepSeek-V3 base for function-calling with strict JSON

Full fine-tuning recipe: DPO on DeepSeek-V3 base via FSDP, targeting single RTX 3090 (24GB), with data mix and eval plan.

245306

Free

DPO Fine-Tune of Mixtral 8x7B for legal clause extraction

Full fine-tuning recipe: DPO on Mixtral 8x7B via LLaMA-Factory, targeting single RTX 3090 (24GB), with data mix and eval plan.

2841164

DPO Fine-Tune of Yi 1.5 34B for customer support classification

Full fine-tuning recipe: DPO on Yi 1.5 34B via Axolotl, targeting single RTX 3090 (24GB), with data mix and eval plan.

181417

DPO Fine-Tune of Llama 3.3 70B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Llama 3.3 70B via LitGPT, targeting 2x RTX 4090, with data mix and eval plan.

💬ChatGPT

1581362