Dark

ModelLens

Free AI models, ranked daily loadingโ€ฆ

๐Ÿค–
โ€“
Models Tested
โšก
โ€“
Fastest tok/s
๐ŸŽฏ
โ€“
Top Score /100
๐Ÿ“
3
Size Categories

Rankings

Scores are normalized within each size tier โ€” a Small model's #1 rank is against other Small models only

Show:
HumanEval ยท code GSM8K ยท reasoning MMLU ยท knowledge Translation
Large 50B+ parameters
Medium 10โ€“50B parameters
Small โ‰ค10B parameters
Large 50B+ parameters
Medium 10โ€“50B parameters
Small โ‰ค10B parameters

What We Test

Industry-standard benchmarks โ€” no invented metrics

๐Ÿ’ป
HumanEval
Code quality
Functions are executed against real test cases. Syntax and edge-case logic must pass.
๐Ÿงฎ
GSM8K
Reasoning
Math word problems and multi-step logic. Verified correct answers only, no partial credit.
๐Ÿ“š
MMLU
Knowledge & instructions
Instruction following, format compliance, and multi-domain knowledge checks.
๐ŸŒ
Translation
Multilingual
English โ†” Russian, English โ†” Spanish. Scored via script detection and vocabulary matching.
โšก
Throughput
Speed
Tokens/second averaged over short, medium, and long prompts. Ranked per size tier.
๐Ÿ”„
Daily Updates
Automated
GitHub Actions runs every 24 hours. Results reflect the current state of each model.

Latest AI News

View all โ†’
0 selected

Model Comparison

Radar charts show relative strengths ยท highlighted = best in row