Update README.md
Browse files
README.md
CHANGED
@@ -1,6 +1,6 @@
|
|
1 |
---
|
2 |
title: README
|
3 |
-
emoji:
|
4 |
colorFrom: red
|
5 |
colorTo: gray
|
6 |
sdk: static
|
@@ -11,25 +11,30 @@ pinned: false
|
|
11 |
|
12 |
Welcome to the SeaLLMs project - a family of large language models tailored for Southeast Asian languages including English, Chinese, Indonesian, Vietnamese, Thai, Tagalog, Malay, Burmese, Khmer, Lao, Tamil, and Javanese.
|
13 |
|
14 |
-
Unlike models primarily designed for high-resource languages like English, our mission is to democratize access to advanced language technologies for regional and potentially under-represented languages
|
15 |
-
|
16 |
|
|
|
|
|
|
|
|
|
|
|
17 |
|
18 |
## SeaLLMs Models
|
19 |
|
20 |
-
-
|
21 |
-
- <strong style="color: red">[SeaLLMs/SeaLLMs-v3-7B-Chat](https://huggingface.co/SeaLLMs/SeaLLM3-7B-Chat)</strong>: Latest 7B chat version of SeaLLMs-v3, achieving SOTA performance of diverse tasks while specifically enhanced to be more trustworthy, exhibiting reduced hallucination and providing safe response.
|
22 |
-
- <strong style="color: red">[SeaLLMs/SeaLLMs-v3-1.5B-Chat](https://huggingface.co/SeaLLMs/SeaLLMs-v3-1.5B-Chat)</strong>: Latest 1.5B chat version of SeaLLMs-v3, specifically fine-tuned to follow human instructions effectively. It is designed to be resource-efficient, making it suitable for use even on your laptop.
|
23 |
-
- <strong style="color: red">[SeaLLMs/SeaLLMs-v3-1.5B](https://huggingface.co/SeaLLMs/SeaLLMs-v3-1.5B)</strong> and <strong style="color: red">[SeaLLMs/SeaLLMs-v3-7B](https://huggingface.co/SeaLLMs/SeaLLMs-v3-7B)</strong>: two base version models for you further conducting customized fine-tuning with your own data.
|
24 |
- [SeaLLMs/SeaLLM-7B-v2.5](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5): New SeaLLM-7B model with 7B-SOTA on many world knowledge and reasoning tasks in SEA languages.
|
25 |
- [SeaLLMs/SeaLLM-7B-v2](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2): The most significant upgrade since SeaLLM-13B with half the size, outperforming performance across diverse multilingual tasks, from world knowledge, math reasoning, instruction following, etc.
|
26 |
- [SeaLLMs/SeaLLM-13B-Chat](https://huggingface.co/SeaLLMs/SeaLLM-13B-Chat): A chatbot optimized for Vietnamese 🇻🇳, Indonesian 🇮🇩, Thai 🇹🇭, Malay 🇲🇾, Khmer🇰🇭, Lao🇱🇦, Tagalog🇵🇭 and Burmese🇲🇲.
|
27 |
|
28 |
## Multilingual Evaluations for SEA
|
29 |
-
- [
|
|
|
|
|
30 |
|
31 |
## Quick Links
|
32 |
-
- [
|
|
|
33 |
- [SeaLLMs Github Repo](https://github.com/DAMO-NLP-SG/SeaLLMs)
|
34 |
- [SeaLLMs Paper](https://arxiv.org/abs/2312.00738) (ACL 2024 Demo)
|
35 |
-
-
|
|
|
1 |
---
|
2 |
title: README
|
3 |
+
emoji: 🌍
|
4 |
colorFrom: red
|
5 |
colorTo: gray
|
6 |
sdk: static
|
|
|
11 |
|
12 |
Welcome to the SeaLLMs project - a family of large language models tailored for Southeast Asian languages including English, Chinese, Indonesian, Vietnamese, Thai, Tagalog, Malay, Burmese, Khmer, Lao, Tamil, and Javanese.
|
13 |
|
14 |
+
Unlike models primarily designed for high-resource languages like English, our mission is to democratize access to advanced language technologies for regional and potentially under-represented languages,
|
15 |
+
while prioritizing safety and trustworthiness within the regional context.
|
16 |
|
17 |
+
## ☄️ What's New?
|
18 |
+
After the release of SeaLLMs-v3, we've focused on extending along two directions: language coverage and multimodal support. We are happy to share:
|
19 |
+
|
20 |
+
- [Babel](https://babel-llm.github.io/babel-llm/): a multilingual LLM that covers the top 25 languages by number of speakers, supports over 90% of the global population
|
21 |
+
- [SeaLLMs-Audio](https://damo-nlp-sg.github.io/SeaLLMs-Audio/): the multimodal (audio) extension of SeaLLMs and the first large audio-language model designed to support multiple Southeast Asian languages
|
22 |
|
23 |
## SeaLLMs Models
|
24 |
|
25 |
+
- SeaLLMs-v3: The latest version of the SeaLLMs family, achieving SOTA performance of diverse tasks while specifically enhanced to be more trustworthy, available in multiple variants: [7B-Chat](https://huggingface.co/SeaLLMs/SeaLLM3-7B-Chat), [1.5B-Chat](https://huggingface.co/SeaLLMs/SeaLLMs-v3-1.5B-Chat), [1.5B-base](https://huggingface.co/SeaLLMs/SeaLLMs-v3-1.5B) and [7B-base](https://huggingface.co/SeaLLMs/SeaLLMs-v3-7B).
|
|
|
|
|
|
|
26 |
- [SeaLLMs/SeaLLM-7B-v2.5](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5): New SeaLLM-7B model with 7B-SOTA on many world knowledge and reasoning tasks in SEA languages.
|
27 |
- [SeaLLMs/SeaLLM-7B-v2](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2): The most significant upgrade since SeaLLM-13B with half the size, outperforming performance across diverse multilingual tasks, from world knowledge, math reasoning, instruction following, etc.
|
28 |
- [SeaLLMs/SeaLLM-13B-Chat](https://huggingface.co/SeaLLMs/SeaLLM-13B-Chat): A chatbot optimized for Vietnamese 🇻🇳, Indonesian 🇮🇩, Thai 🇹🇭, Malay 🇲🇾, Khmer🇰🇭, Lao🇱🇦, Tagalog🇵🇭 and Burmese🇲🇲.
|
29 |
|
30 |
## Multilingual Evaluations for SEA
|
31 |
+
- [LLM Leaderboard for Southeast Asian Languages](https://huggingface.co/spaces/SeaLLMs/LLM_Leaderboard_for_SEA): evaluates LLMs on Southeast Asian languages through two comprehensive benchmarks - SeaExam and SeaBench
|
32 |
+
- SeaExam assesses world knowledge and reasoning capabilities through exam-style questions (for both base and chat version models) [data (public)](https://huggingface.co/datasets/SeaLLMs/SeaExam) [eval code](https://github.com/DAMO-NLP-SG/SeaExam)
|
33 |
+
- SeaBench evaluates instruction-following abilities and multi-turn conversational skills (thus only for chat version models). [data (public)](https://huggingface.co/datasets/SeaLLMs/SeaBench) [eval code](https://github.com/DAMO-NLP-SG/SeaBench)
|
34 |
|
35 |
## Quick Links
|
36 |
+
- [Project Page](https://damo-nlp-sg.github.io/DAMO-SeaLLMs/): project page that contains link to everything you need
|
37 |
+
- [SeaLLM-Chatbot](https://huggingface.co/spaces/SeaLLMs/SeaLLM-Chat): online demo for the latest chatbot version of SeaLLMs (currently SeaLLMs-v3-7B-chat)
|
38 |
- [SeaLLMs Github Repo](https://github.com/DAMO-NLP-SG/SeaLLMs)
|
39 |
- [SeaLLMs Paper](https://arxiv.org/abs/2312.00738) (ACL 2024 Demo)
|
40 |
+
- [SeaLLMs 3 Paper](https://arxiv.org/abs/2407.19672) (NAACL 2025 Demo)
|