Skip to content

Conversation

xushaoxuan123
Copy link

Add LLaDA-Instruct Evaluation Toolkit

Added evaluation toolkit for LLaDA-Instruct and LLaMA-Instruct models based on OpenCompass.

What's included:

  • Evaluation scripts for both models
  • Easy installation and usage

What's changed:

  • a new gpqa dataset and config

Usage:

bash eval_llada_instruct.sh
bash eval_llama_instruct.sh

Based on open-compass/opencompass.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant