Skip to content

Conversation

Seas0
Copy link

@Seas0 Seas0 commented Jun 5, 2025

Running the model against inputs without any [mask] token is absurd and useless,
thus we should try to intercept this situation and skip the model when sampling.

I think use the resolved num_transfer_tokens token unmasking/sampling schedule is enough for this case.

Running the model against inputs without any `[mask]` token is absurd and useless, try to intercept this situation and skip the model when sampling.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant