Pinned Loading
-
LLM-eval-benchmark-lab
LLM-eval-benchmark-lab PublicA modular, configurable benchmarking harness for evaluating LLM behavior across tasks, constraints, and model classes.
Python 1
-
-
-
Natural-Scaling-Predictor
Natural-Scaling-Predictor PublicNovel methodology: First comprehensive LLM performance prediction framework Empirical insights: Analysis of 127 LLMs across 15 model families Theoretical advances: Extended scaling laws with emerge…
Python 1
-
ai-physicist-central-llm
ai-physicist-central-llm PublicA specialized language model architecture for physics reasoning, combining a central LLM "brain" with external computational "hands" for enhanced problem-solving capabilities.
Python 1
If the problem persists, check the GitHub status page or contact support.