Installation

KTransformers has two current public package paths. Choose the one that matches your task.

Task	Packages	Use when
Inference	`kt-kernel sglang-kt`	You want to run a model server with `kt run` or SGLang-KT.
Fine-tuning	`ktransformers[sft]`	You want to run MoE LoRA SFT through LLaMA-Factory.

Inference Install

pip install kt-kernel sglang-kt

kt-kernel provides the KT MoE expert backend. sglang-kt provides the SGLang serving path used by current KTransformers inference docs.

Verify the command-line tools:

kt version
kt doctor

Then continue with First inference server.

Fine-Tuning Install

For LLaMA-Factory based SFT:

cd /path/to/LLaMA-Factory
pip install -e .
pip install -r requirements/ktransformers.txt

requirements/ktransformers.txt should contain the public KT SFT entry:

ktransformers[sft]

This installs ktransformers and its SFT dependencies, including kt-kernel, transformers-kt, and accelerate-kt. It does not install sglang-kt, because sglang-kt is inference-only.

Then continue with First LoRA SFT run.

Before Running a Model

Check Support Matrix before treating a model tutorial as current support. Older pages may still exist for traceability, but old local_chat.py, ktransformers/server/main.py, balance_serve, and kt_optimize_rule paths are legacy unless explicitly rewritten.

Useful follow-up pages: