DPO Status

DPO is not part of the current KT fine-tuning support scope yet.

The current fine-tuning section focuses on MoE LoRA SFT through LLaMA-Factory. DPO needs target-environment validation before it can join the current path:

Item	Required information
Training entry	Current LLaMA-Factory command; old patching paths are historical references.
KT backend	Explicit `kt_config` backend that maps to current KT SFT code.
Model	Exact checkpoint and prepared expert weights if needed.
Runtime	Minimum training validation on the target hardware and software environment.
Output	Adapter files and at least one minimal post-training sanity check.

Until then, older DPO pages remain historical references only.