Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

university
https://evalmodels.github.io/rbench
Activity Feed

AI & ML interests

None defined yet.

Yi Zhang's profile picture Xuanyu Chu's profile picture LalaXu's profile picture

uyzhang 
authored a paper 2 months ago

Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs

Paper • 2510.13795 • Published Oct 15 • 57
uyzhang 
updated a dataset 7 months ago

R-Bench/R-Bench

Viewer • Updated May 27 • 3.52k • 1.13k • 22
uyzhang 
authored 2 papers 7 months ago

R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

Paper • 2505.02018 • Published May 4 • 3

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Paper • 2505.16770 • Published May 22 • 12
CXY07 
authored a paper 7 months ago

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Paper • 2505.16770 • Published May 22 • 12
CXY07 
updated a dataset 8 months ago

R-Bench/R-Bench-V

Viewer • Updated May 16 • 1.61k • 156 • 8
CXY07 
published a dataset 8 months ago

R-Bench/R-Bench-V

Viewer • Updated May 16 • 1.61k • 156 • 8
BiggerXu 
updated a dataset 9 months ago

R-Bench/R-Bench

Viewer • Updated May 27 • 3.52k • 1.13k • 22
uyzhang 
published a dataset 9 months ago

R-Bench/R-Bench

Viewer • Updated May 27 • 3.52k • 1.13k • 22
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs