EvalHub 🚀

Explore various evaluation metrics here, then head over to the lm-evaluation-harness repository by EleutherAI to evaluate your models.

Newly Added Benchmarks

All Benchmarks

© 2024 EvalHub