Category Not Found

DPO Fine-Tune of Llama 3.1 70B for technical documentation QA

Full fine-tuning recipe: DPO on Llama 3.1 70B via DeepSpeed, targeting 2x A100 80GB, with data mix and eval plan.

🟠Claude

52504

Free

DPO Fine-Tune of Mistral Nemo 12B for financial report summarization

Full fine-tuning recipe: DPO on Mistral Nemo 12B via Hugging Face TRL, targeting 2x A100 80GB, with data mix and eval plan.

🟠Claude

116339

DPO Fine-Tune of Qwen 2.5 7B for SQL-from-text generation

Full fine-tuning recipe: DPO on Qwen 2.5 7B via Megatron-LM, targeting 4x A100 40GB, with data mix and eval plan.

381973

DPO Fine-Tune of Qwen 2.5 32B for technical documentation QA

Full fine-tuning recipe: DPO on Qwen 2.5 32B via FSDP, targeting 4x A100 40GB, with data mix and eval plan.

350516

DPO Fine-Tune of Gemma 2 9B for financial report summarization

Full fine-tuning recipe: DPO on Gemma 2 9B via LLaMA-Factory, targeting 8x H100, with data mix and eval plan.

61419

DPO Fine-Tune of Gemma 2 27B for SQL-from-text generation

Full fine-tuning recipe: DPO on Gemma 2 27B via Axolotl, targeting 8x H100, with data mix and eval plan.

55922

Free

DPO Fine-Tune of Phi-4 for technical documentation QA

Full fine-tuning recipe: DPO on Phi-4 via FSDP, targeting 8x H100, with data mix and eval plan.

214692

DPO Fine-Tune of DeepSeek-V3 base for financial report summarization

Full fine-tuning recipe: DPO on DeepSeek-V3 base via LLaMA-Factory, targeting single RTX 4090 (24GB), with data mix and eval plan.

90611

DPO Fine-Tune of Mixtral 8x22B for SQL-from-text generation

Full fine-tuning recipe: DPO on Mixtral 8x22B via Axolotl, targeting single RTX 4090 (24GB), with data mix and eval plan.

43873

DPO Fine-Tune of Yi 1.5 34B for technical documentation QA

Full fine-tuning recipe: DPO on Yi 1.5 34B via LitGPT, targeting single RTX 3090 (24GB), with data mix and eval plan.

441216

DPO Fine-Tune of Llama 3.1 8B for financial report summarization

Full fine-tuning recipe: DPO on Llama 3.1 8B via torchtune, targeting single RTX 3090 (24GB), with data mix and eval plan.

2601075