Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Quadyun
's Collections
Math-LongCoT
LongCoT Dataset
Reward Model
All Math Benchmark Datasets
MATH-TIR
All Math Benchmark Datasets
updated
Dec 19, 2024
Upvote
-
AI-MO/aimo-validation-aime
Viewer
•
Updated
May 7, 2025
•
90
•
5.71k
•
65
HuggingFaceH4/MATH-500
Benchmark
•
Updated
22 days ago
•
500
•
96.8k
•
272
TIGER-Lab/MMLU-STEM
Viewer
•
Updated
Jun 20, 2024
•
3.15k
•
286
•
17
Upvote
-
Share collection
View history
Collection guide
Browse collections