Full fine-tuning recipe: LoRA on DeepSeek-V3 base via LLaMA-Factory, targeting 8x H100, with data mix and eval plan.
Full fine-tuning recipe: LoRA on Mixtral 8x7B via Axolotl, targeting single RTX 4090 (24GB), with data mix and eval plan.
Full fine-tuning recipe: LoRA on Yi 1.5 34B via LitGPT, targeting single RTX 4090 (24GB), with data mix and eval plan.
Full fine-tuning recipe: LoRA on Llama 3.3 70B via LLaMA-Factory, targeting single RTX 3090 (24GB), with data mix and eval plan.
Full fine-tuning recipe: LoRA on Llama 3.1 70B via Axolotl, targeting single RTX 3090 (24GB), with data mix and eval plan.
Full fine-tuning recipe: LoRA on Mistral Small 3 via LitGPT, targeting single RTX 3090 (24GB), with data mix and eval plan.
Full fine-tuning recipe: LoRA on Mistral Nemo 12B via torchtune, targeting 2x RTX 4090, with data mix and eval plan.
Full fine-tuning recipe: LoRA on Mistral Nemo 12B via Megatron-LM, targeting single A100 80GB, with data mix and eval plan.
Full fine-tuning recipe: LoRA on Llama 3.1 8B via Unsloth, targeting 8x H100, with data mix and eval plan.
Full fine-tuning recipe: LoRA on Llama 3.1 8B via FSDP, targeting AWS g5.12xlarge, with data mix and eval plan.
Full fine-tuning recipe: LoRA on Llama 3.1 8B via Unsloth, targeting 2x A100 80GB, with data mix and eval plan.
Full fine-tuning recipe: LoRA on Mixtral 8x22B via LLaMA-Factory, targeting single RTX 3090 (24GB), with data mix and eval plan.