Runtime Smoke Checklist
Use this checklist before upgrading a model, precision, hardware, or fine-tuning path from Needs smoke to Current.
Inference Smoke
Record:
- commit or package versions for
kt-kernel,sglang-kt, and relevant dependencies - hardware: CPU SKU, GPU SKU/count, RAM, NUMA count
- model checkpoint and
--kt-weight-path - exact launch command
- request command or OpenAI client snippet
- first-token success and a short generated response
- logs for selected KT method/backend
- known warnings that are benign or blocking
Minimum request:
curl -s http://127.0.0.1:30000/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{"model":"my-model","messages":[{"role":"user","content":"Say hello in one sentence."}],"max_tokens":32}'
Fine-Tuning Smoke
Record:
- LLaMA-Factory commit
requirements/ktransformers.txt- training YAML
- Accelerate config
kt_config- first training steps and loss logging
- checkpoint/adaptor output behavior
When Smoke Is Not Enough
Smoke confirms the path starts and produces output. It does not prove long-context stability, performance, quality, tool calling, or multi-node behavior. Those require separate benchmark or evaluation records.