Spaces:
Sleeping
Sleeping
Add arXiv link
Browse files
app.py
CHANGED
|
@@ -301,7 +301,7 @@ def grade(file_obj, progress=gr.Progress()):
|
|
| 301 |
model_result_example = "https://raw.githubusercontent.com/yuweihao/MM-Vet/main/v2/results/gpt-4o-2024-05-13_detail-high.json"
|
| 302 |
|
| 303 |
markdown = f"""
|
| 304 |
-
# [MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities](https://
|
| 305 |
|
| 306 |
We offer MM-Vet v2 LLM-based (GPT-4) evaluator to grade open-ended outputs from your models.
|
| 307 |
|
|
|
|
| 301 |
model_result_example = "https://raw.githubusercontent.com/yuweihao/MM-Vet/main/v2/results/gpt-4o-2024-05-13_detail-high.json"
|
| 302 |
|
| 303 |
markdown = f"""
|
| 304 |
+
# [MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities](https://arxiv.org/abs/2408.00765)
|
| 305 |
|
| 306 |
We offer MM-Vet v2 LLM-based (GPT-4) evaluator to grade open-ended outputs from your models.
|
| 307 |
|