Audio-Text-to-Text
Transformers
Safetensors
qwen2_audio
text2text-generation
frankenliu GrantL10 commited on
Commit
e1068f8
·
verified ·
1 Parent(s): 320ac24

Update README.md (#11)

Browse files

- Update README.md (f4e420390ff36f81b8663bc6e5fbf206b24dead4)


Co-authored-by: Gang Li <[email protected]>

Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -45,11 +45,11 @@ Additional Notes:
45
 
46
  #### Notes
47
 
48
- 1️⃣ It is the original model, identical to the one on Hugging Face and described in our technical report.
49
- 2️⃣ It is the model submitted to [EvalAI](https://eval.ai/web/challenges/challenge-page/2391/overview) for evaluation, trained multiple times to achieve balanced results. (**The results on the [leaderboard](https://sakshi113.github.io/mmau_homepage/#leaderboard) contain some typographical errors, and we are currently in communication with the organizers to correct them.**)
50
- \* The data are sourced from the [MMAU official website](https://sakshi113.github.io/mmau_homepage/).
51
  \[1\] Xie, Zhifei, et al. "Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models." arXiv preprint arXiv:2503.02318 (2025).
52
  \[2\] Ma, Ziyang, et al. "Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model." arXiv preprint arXiv:2501.07246 (2025).
 
 
53
 
54
  ## Inference
55
 
 
45
 
46
  #### Notes
47
 
48
+ \* The data are sourced from the [MMAU leaderboard](https://sakshi113.github.io/mmau_homepage/#leaderboard).
 
 
49
  \[1\] Xie, Zhifei, et al. "Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models." arXiv preprint arXiv:2503.02318 (2025).
50
  \[2\] Ma, Ziyang, et al. "Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model." arXiv preprint arXiv:2501.07246 (2025).
51
+ 1️⃣ It is the original model, identical to the one on Hugging Face and described in our technical report.
52
+ 2️⃣ It is the model submitted to the [MMAU leaderboard](https://sakshi113.github.io/mmau_homepage/#leaderboard), trained multiple times to achieve balanced results.
53
 
54
  ## Inference
55