Update GPT-QModel references and deprecate AutoGPTQ by Qubitium · Pull Request #3190 · huggingface/peft

Qubitium · 2026-04-23T15:20:45Z

Update references to GPTQModel as GPT-QModel as the pkg is no longer gptq only, it now fully supports awq and many other quant methods.
Full deprecate AutoGPTQ (repo archived since Marh 2025, I was last commiter/maintainer)
Fix PEFT using hard coded kernel check assumptions for AWQ and tests (as gptqmodel is also handling awq kernel loading)

Companion (but not required) PR in Optimum: huggingface/optimum#2426

BenjaminBossan · 2026-04-27T09:38:58Z

@Qubitium Thanks for the PR. What is the state, does it depend on the optimum change to be released first? Also, we had a recent update to the Dockerfile in #3188, could you please check the merge conflict?

Qubitium · 2026-04-28T02:28:36Z

@BenjaminBossan Conflict resolved and we have tested this PR and relevant tests are passing and does not rely on the Optimum PR to be merged first. This one is ready for merge. Please trigger ci tests to make sure there are no regressions.

BenjaminBossan

Thanks for the PR to fully remove AutoGPTQ and move to GPT-QModel.

The PR generally looks good and both single and multi GPU tests passed in my testing. There are a few smaller issues left, please check.

BenjaminBossan · 2026-04-28T09:22:01Z

 ```bash
-# gptqmodel install
-pip install gptqmodel --no-build-isolation
+# GPTQ-Model install


Suggested change

# GPTQ-Model install

# GPT-QModel install

Qubitium · 2026-04-28T21:42:43Z

@BenjaminBossan Ready for ci tests re-run.

Now pinned to just released gpt-qmodel v7.0.0 and remove --no-build-isolation depend
Updated/sycned name from GPTQModel to GPT-QModel to align with pkg repo.
Updated requested changes and dead comments/code.

Qubitium · 2026-04-28T21:44:11Z

 # a NVIDIA L4). So we fix the compute capability to 8.9. In the future we might extend this
 # to a list of compute capabilities (separated by ;).
-RUN CUDA_ARCH_LIST=8.9 conda run -n peft pip install --no-build-isolation "gptqmodel>=6.0.3"
+RUN CUDA_ARCH_LIST=8.9 conda run -n peft pip install "gptqmodel>=7.0.0"


7.0.0 no longer requires --no-build-isolation as all kernels have moved from setup stage to first-use JIT.

HuggingFaceDocBuilderDev · 2026-04-29T10:54:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks for the update, everything looks good. Failing CI is unrelated and I ran the GPU tests locally to ensure that they pass too.

And congrats to the 7.0 release.

Qubitium added 3 commits April 23, 2026 15:17

update gptqmodel integration for auto kernel selection

fd3ec2d

remove autogptq support from gptq paths

d6c03d1

fix gptq support formatting

e32e477

Qubitium marked this pull request as ready for review April 23, 2026 15:47

Qubitium marked this pull request as draft April 23, 2026 15:54

Qubitium added 2 commits April 23, 2026 16:02

Merge main into update-gptqmodel-support

493e753

make gptqmodel awq handling kernel-agnostic

98c4b50

Qubitium marked this pull request as ready for review April 23, 2026 16:11

Qubitium changed the title ~~Update GPT-QModel references~~ Update GPT-QModel references and deprecate AutoGPTQ Apr 23, 2026

Qubitium added 2 commits April 28, 2026 00:50

Merge branch 'main' into update-gptqmodel-support

dc4adcd

fix github bad auto merge

7253afc

BenjaminBossan requested changes Apr 28, 2026

View reviewed changes

Address GPT-QModel review feedback

76c09c2

Qubitium requested a review from BenjaminBossan April 28, 2026 21:42

Qubitium commented Apr 28, 2026

View reviewed changes

BenjaminBossan approved these changes Apr 29, 2026

View reviewed changes

BenjaminBossan merged commit 8c6943c into huggingface:main Apr 29, 2026
4 of 12 checks passed

Conversation

Qubitium commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenjaminBossan commented Apr 27, 2026

Uh oh!

Qubitium commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

BenjaminBossan Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Qubitium commented Apr 28, 2026

Uh oh!

Qubitium Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2026

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Qubitium commented Apr 23, 2026 •

edited

Loading

Qubitium commented Apr 28, 2026 •

edited

Loading