GLM Models
GLM support is currently inference-focused and should be documented as exact environment tuples.
Inference
| Model | Entry | Method | Status |
|---|---|---|---|
| GLM-5 | Manual SGLang-KT tutorial | FP8 or BF16 | Needs smoke. |
| GLM-5.1 | Manual SGLang-KT tutorial | FP8, BF16, or FP8_PERCHANNEL depending on page | Needs isolated environment smoke. |
GLM-5.1 may require a specific Transformers stack. Do not collapse GLM-5 and GLM-5.1 into a generic support claim.
Documentation Source
doc/en/kt-kernel/GLM-5-Tutorial.mddoc/en/kt-kernel/GLM-5.1-Tutorial.md