make fp8 model quantized by llm-compressor can be inferenced in turbomind#4509

Open

43758726 wants to merge 3 commits intoInternLM:mainfrom

43758726:add/llm-compressor-fp8-inference

Commits on Apr 8, 2026

make fp8 model quantized by llm-compressor can be inferenced in turbomind
43758726
committed
Merge branch 'main' into add/llm-compressor-fp8-inference
43758726
committed

Commits on Apr 18, 2026

add documents for llm-compressor fp8 quant
43758726
committed