R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

university

https://evalmodels.github.io/rbench

AI & ML interests

None defined yet.

uyzhang

authored a paper 2 months ago

Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs

Paper • 2510.13795 • Published Oct 15 • 57

uyzhang

updated a dataset 7 months ago

R-Bench/R-Bench

Viewer • Updated May 27 • 3.52k • 1.13k • 22

uyzhang

authored 2 papers 7 months ago

R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

Paper • 2505.02018 • Published May 4 • 3

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Paper • 2505.16770 • Published May 22 • 12

CXY07

authored a paper 7 months ago

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

Paper • 2505.16770 • Published May 22 • 12

CXY07

updated a dataset 8 months ago

R-Bench/R-Bench-V

Viewer • Updated May 16 • 1.61k • 156 • 8

CXY07

published a dataset 8 months ago

R-Bench/R-Bench-V

Viewer • Updated May 16 • 1.61k • 156 • 8

BiggerXu

updated a dataset 9 months ago

R-Bench/R-Bench

Viewer • Updated May 27 • 3.52k • 1.13k • 22

uyzhang

published a dataset 9 months ago

R-Bench/R-Bench

Viewer • Updated May 27 • 3.52k • 1.13k • 22