KTransformers

DeepSeek Models

DeepSeek is a primary KTransformers model family for both inference and SFT. Keep inference and fine-tuning claims separate because they use different methods and package paths.

Inference

ModelEntryMethodStatus
DeepSeek V4-Flashkt run deepseek-v4-flashMXFP4Needs smoke; narrow path.
DeepSeek V3.2kt run deepseek-v3.2 or manual tutorialRegistry uses FP8; older tutorial uses AMXINT4Needs reconciliation and smoke.
DeepSeek V3-0324 / R1-0528kt run deepseek-v3 / kt run deepseek-r1Registry default AMXINT4Current / Needs docs.

Fine-Tuning

Use DeepSeek SFT. DeepSeek V3 public checkpoints may be FP8, but current KT SFT uses AMXBF16, AMXINT8, or AMXINT4, not native FP8 SFT.

Documentation Source

Current GitHub sources to migrate carefully:

  • doc/en/DeepSeek-V4-Flash.md
  • doc/en/kt-kernel/deepseek-v3.2-sglang-tutorial.md
  • doc/en/SFT/KTransformers-Fine-Tuning_Quick-Start.md
  • doc/en/SFT/KTransformers-Fine-Tuning_User-Guide.md