Update README.md (#11)
Browse files- Update README.md (f4e420390ff36f81b8663bc6e5fbf206b24dead4)
Co-authored-by: Gang Li <[email protected]>
README.md
CHANGED
@@ -45,11 +45,11 @@ Additional Notes:
|
|
45 |
|
46 |
#### Notes
|
47 |
|
48 |
-
|
49 |
-
2️⃣ It is the model submitted to [EvalAI](https://eval.ai/web/challenges/challenge-page/2391/overview) for evaluation, trained multiple times to achieve balanced results. (**The results on the [leaderboard](https://sakshi113.github.io/mmau_homepage/#leaderboard) contain some typographical errors, and we are currently in communication with the organizers to correct them.**)
|
50 |
-
\* The data are sourced from the [MMAU official website](https://sakshi113.github.io/mmau_homepage/).
|
51 |
\[1\] Xie, Zhifei, et al. "Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models." arXiv preprint arXiv:2503.02318 (2025).
|
52 |
\[2\] Ma, Ziyang, et al. "Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model." arXiv preprint arXiv:2501.07246 (2025).
|
|
|
|
|
53 |
|
54 |
## Inference
|
55 |
|
|
|
45 |
|
46 |
#### Notes
|
47 |
|
48 |
+
\* The data are sourced from the [MMAU leaderboard](https://sakshi113.github.io/mmau_homepage/#leaderboard).
|
|
|
|
|
49 |
\[1\] Xie, Zhifei, et al. "Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models." arXiv preprint arXiv:2503.02318 (2025).
|
50 |
\[2\] Ma, Ziyang, et al. "Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model." arXiv preprint arXiv:2501.07246 (2025).
|
51 |
+
1️⃣ It is the original model, identical to the one on Hugging Face and described in our technical report.
|
52 |
+
2️⃣ It is the model submitted to the [MMAU leaderboard](https://sakshi113.github.io/mmau_homepage/#leaderboard), trained multiple times to achieve balanced results.
|
53 |
|
54 |
## Inference
|
55 |
|