Popular Model Usage

Use this page as the model-family router. Exact validation status lives in the Support Matrix; individual model pages describe the current support scope and remaining validation needs.

Model family	Current entry	Notes
DeepSeek V4-Flash	`kt run deepseek-v4-flash` or manual `MXFP4` launch	Narrow path; verify package and attention constraints before production use.
DeepSeek V3.2	`kt run deepseek-v3.2` for registry FP8; tutorial path still mentions AMXINT4	Needs reconciliation between registry default and tutorial method.
DeepSeek V3 / R1	`kt run deepseek-v3`, `kt run deepseek-r1`	Current registry exists; older DeepSeek pages using legacy servers are historical references.
Kimi K2 Thinking	`kt run kimi-k2-thinking` or RAWINT4 manual launch	Use backend-specific behavior within the scope described on the model page.
MiniMax M2 / M2.1	`kt run m2`, `kt run m2.1`	Registry includes parser defaults and tensor-parallel constraints.
MiniMax M2.5	Manual SGLang-KT tutorial	Needs minimum validation before use in the target environment.
Qwen3 / Qwen3.5 / Qwen3-Coder-Next	Manual BF16, FP8, or GPTQ_INT4 examples	Choose method by exact checkpoint and CPU backend.
GLM-5 / GLM-5.1	Manual BF16, FP8, or FP8_PERCHANNEL examples	Treat transformer version constraints as part of the supported configuration.

Before Copying Commands

Commands are directly reusable when all of these match:

model family + checkpoint + KT method + CPU ISA/backend + GPU count + package versions

If any field changes, treat the command as a starting point and run a minimum validation on your target environment.