Replies: 2 comments
-
Group size is configurable in 'QuantizeConfig'. For example, |
Beta Was this translation helpful? Give feedback.
0 replies
-
@hzfantasy Can you tell me what exactly the hardware requirement that will need group_size 512? Quality of post-quant recovery may be very bad with > 128 gpsize. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
To meet the requirements of the hardware, I need to quantize with group_size >= 512. Is it possible to do this in GPTQModel?
Beta Was this translation helpful? Give feedback.
All reactions