KTransformers

GLM Models

GLM support is currently inference-focused and should be documented as exact environment tuples.

Inference

ModelEntryMethodStatus
GLM-5Manual SGLang-KT tutorialFP8 or BF16Needs smoke.
GLM-5.1Manual SGLang-KT tutorialFP8, BF16, or FP8_PERCHANNEL depending on pageNeeds isolated environment smoke.

GLM-5.1 may require a specific Transformers stack. Do not collapse GLM-5 and GLM-5.1 into a generic support claim.

Documentation Source

  • doc/en/kt-kernel/GLM-5-Tutorial.md
  • doc/en/kt-kernel/GLM-5.1-Tutorial.md