Fix bugs for `eval_llada.py` #91

Kamichanw · 2025-07-05T16:39:41Z

What does this PR do?

Fix bugs listed below:

Remove unnecessary accelerator.prepare operations from:

LLaDA/eval_llada.py

Line 83 in 3f5e0d0

self.model = self.accelerator.prepare(self.model)
Add more docstring to improve readability.
Change the operations of generating datasets in generate_unitl and loglikelihood to direct processing

Detailed explanation

1. Remove `accelerator.prepare`

Based on note from official documents of accelerate v0.34.2, we

don’t need to prepare a model if you only use it for inference without any kind of mixed precision.

Besides, calling prepare on a model leads to higher GPU memory consumption. For instance, LLaDA-8B-Base takes about 15GB of VRAM when loaded with bf16, but after calling prepare, more than 40GB is allocated, which causes an OOM error on an RTX 3090.

2. Improve readability

I copied the official docstrings from lm-eval (generation_until, loglikelihood and _encode_pair ) to help those who are not familiar with lm-eval interfaces.

I also renamed several variable names to match docstrings.

3. Remove datasets mapping

When I first ran eval code, I got a warning:

When using .map on a dataset, the datasets library generates a cache based on a fingerprint, which is typically created by hashing certain elements such as the mapping function (e.g., _tokenize). However, in this case, hashing is not feasible, so a new dataset is generated repeatedly. This eventually leads to excessive memory usage, causing the system to kill the process.

I change the operations of generating datasets to directly processing them.

Kamichanw added 3 commits July 6, 2025 00:09

fix eval code@

a98b3a1

fix encode pair

37bfaae

convert tensor in loglikelihood

8c698c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bugs for `eval_llada.py` #91

Fix bugs for `eval_llada.py` #91

Uh oh!

Kamichanw commented Jul 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Fix bugs for eval_llada.py #91

Are you sure you want to change the base?

Fix bugs for eval_llada.py #91

Uh oh!

Conversation

Kamichanw commented Jul 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Detailed explanation

1. Remove accelerator.prepare

2. Improve readability

3. Remove datasets mapping

Uh oh!

Uh oh!

Fix bugs for `eval_llada.py` #91

Fix bugs for `eval_llada.py` #91

Kamichanw commented Jul 5, 2025 •

edited

Loading

1. Remove `accelerator.prepare`