Add Quanto4,2, HQQ4,2 KV cache quantization support to Transformers loader#6768
Open
dinerburger wants to merge 3 commits into
Open
Add Quanto4,2, HQQ4,2 KV cache quantization support to Transformers loader#6768dinerburger wants to merge 3 commits into
dinerburger wants to merge 3 commits into