4-bit llada #54

1773226512 · 2025-03-23T15:58:47Z

Thank you for your great works. We have released the 4-bit GPTQ quantized LLaDA model on Hugging Face:

Based on the published evaluation code, we have evaluated the quantized base model. The results are as follows:

NieShenRuc · 2025-04-02T12:13:10Z

I am so sorry for the late response. Thank you for your excellent work! May I ask what is the accuracy of quantized LLaDA model on GSM8K?

4-bit llada

1e5943d

NieShenRuc mentioned this pull request Apr 22, 2025

Onboarding LLaDA to major cloud providers and Ollama #61

Open

Provide feedback