Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -66,7 +66,7 @@ This project introduces **General-Level** and **General-Bench**.
|
|
66 |
---
|
67 |
|
68 |
# πππ General-Level<a name="level" />
|
69 |
-
|
70 |
**A 5-scale level evaluation system with a new norm for assessing the multimodal generalists (multimodal LLMs/agents).
|
71 |
The core is the use of <b style="color:red">synergy</b> as the evaluative criterion, categorizing capabilities based on whether MLLMs preserve synergy across comprehension and generation, as well as across multimodal interactions.**
|
72 |
|
@@ -89,7 +89,7 @@ The core is the use of <b style="color:red">synergy</b> as the evaluative criter
|
|
89 |
---
|
90 |
|
91 |
# πππ General-Bench<a name="bench" />
|
92 |
-
|
93 |
**A companion massive multimodal benchmark dataset, encompasses a broader spectrum of skills, modalities, formats, and capabilities, including over 700 tasks and 325K instances.**
|
94 |
|
95 |
|
|
|
66 |
---
|
67 |
|
68 |
# πππ General-Level<a name="level" />
|
69 |
+
|
70 |
**A 5-scale level evaluation system with a new norm for assessing the multimodal generalists (multimodal LLMs/agents).
|
71 |
The core is the use of <b style="color:red">synergy</b> as the evaluative criterion, categorizing capabilities based on whether MLLMs preserve synergy across comprehension and generation, as well as across multimodal interactions.**
|
72 |
|
|
|
89 |
---
|
90 |
|
91 |
# πππ General-Bench<a name="bench" />
|
92 |
+
|
93 |
**A companion massive multimodal benchmark dataset, encompasses a broader spectrum of skills, modalities, formats, and capabilities, including over 700 tasks and 325K instances.**
|
94 |
|
95 |
|