Reuse the LoRA config, add use_dora
Rank 16, alpha 32, all-linear, use_dora=True. PEFT 0.10+ supports DoRA natively. No data or LR change needed.
Decomposes the weight matrix into magnitude (a scalar per column) and direction (a unit vector). LoRA modulates only the direction, the magnitude is trained separately. Reports +1 to +4.4% over LoRA on commonsense benchmarks (LLaMA-7B/13B, LLaMA3-8B). Our default replacement for plain LoRA in 2026.
A weight update has two parts: how much (magnitude) and which way (direction). LoRA's low-rank decomposition mixes them. DoRA decomposes the pretrained weight first, then trains magnitude and direction separately. The intuition matches what full SFT does naturally and what LoRA approximates with friction.
Use the same data, the same hyperparameters, the same eval. Set use_dora=True and the trainer handles the decomposition.
Rank 16, alpha 32, all-linear, use_dora=True. PEFT 0.10+ supports DoRA natively. No data or LR change needed.
Same pipeline as LoRA. Adapter file is the same size. vLLM and LoRAX serve it the same way.
Anywhere you would use LoRA. The gain is a flag flip.
When the framework does not support it (older PEFT, custom training stacks).
We turn on use_dora unless the runtime does not support it. The accuracy gain is consistent and the cost is zero.
Sourcing, PII redaction (Presidio), synthetic data (Distilabel, Nemotron), DEITA quality scoring, MinHash + SemDedup, labeling vendors, feedback loops.
Read moreOpenAI RFT, Anthropic on Bedrock, Vertex, Azure Foundry, Databricks Mosaic, Together, Predibase, NeMo Customizer, Modal, Lambda. Side-by-side with our take.
Read moreUnder 1k examples to over 1M, single A10G to 128 B200. Indicative cost, recommended method, hardware tier.
Read moreContinued pretraining, SFT, preference optimization (DPO, SimPO, ORPO), reasoning distillation (R1 lineage), model merging (TIES, DARE). The full build pipeline.
Read moreEU AI Act (Article 25 substantial-modification trap), GDPR, HIPAA, FedRAMP, Colorado AI Act, India DPDP, China GenAI Measures. Region-by-region for tuned LLMs.
Read more