KTransformers

Fine-Tuning Model Tutorials

Use this page to decide which model tutorial belongs in the current SFT section and which one still needs validation.

Current Tutorial Candidates

Model familyLLaMA-Factory exampleCurrent KT SFT scope
DeepSeek V2 Litedeepseek_v2_lora_sft_kt.yamlAMXBF16, AMXINT8, AMXINT4; needs smoke before production wording.
DeepSeek V3-0324deepseek_v3_lora_sft_kt.yamlAMX SFT only; FP8 source checkpoints must be converted or prepared for the target backend.
Qwen3-235B-A22Bqwen3moe_lora_sft_kt.yamlAMX SFT; needs runtime smoke on the documented hardware tuple.
Qwen3.5-397B-A17Bqwen3_5moe_lora_sft_kt.yamlUse AMXINT8 as the first documented path until BF16/INT4 runs are separately recorded.

Not Current Yet

TopicStatus
Kimi K2 / Kimi K2.5 SFTNot current public support. Do not add to the SFT quick start until the current LLaMA-Factory path exists and passes smoke.
DPOUnconfirmed for the current KT integration. Keep out of current support claims until validated.
Old kt_optimize_rule tutorialsHistorical. Rewrite around current LLaMA-Factory configs before publishing as current docs.

Required Tutorial Shape

Each model tutorial should state:

model checkpoint + source precision + target KT backend + conversion step + hardware tuple + launch command + validation result

For DeepSeek V3-family SFT, always explain all three AMX precision options: AMXBF16, AMXINT8, and AMXINT4.