Sailor2

community

Activity Feed Request to join this org

AI & ML interests

Open language models for South-East Asia

Recent Activity

afaji authored a paper 3 days ago

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

jjzha authored a paper 28 days ago

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

koalazf99 authored a paper about 1 month ago

MegaMath: Pushing the Limits of Open Math Corpora

View all activity

sailor2's activity

afaji

authored a paper 3 days ago

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

Paper • 2504.20966 • Published 9 days ago • 25

Muennighoff

authored a paper 8 days ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published 9 days ago • 50

jjzha

authored a paper 28 days ago

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Paper • 2504.07072 • Published 29 days ago • 8

Cameron-Chen

authored a paper about 1 month ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 48

samuelcahyawijaya

authored a paper about 1 month ago

Command A: An Enterprise-Ready Large Language Model

Paper • 2504.00698 • Published Apr 1 • 26

dreamerdeo

in sailor2/sea-wildbench about 1 month ago

[bot] Conversion to Parquet

#2 opened about 1 month ago by

parquet-converter

dreamerdeo

updated a dataset about 1 month ago

sailor2/sea-wildbench

Viewer • Updated Mar 26 • 1.02k • 43

dreamerdeo

updated a Space about 2 months ago

README

HoangHa

authored a paper about 2 months ago

Pensez: Less Data, Better Reasoning -- Rethinking French LLM

Paper • 2503.13661 • Published Mar 17 • 5

gabrielchua

authored a paper about 2 months ago

MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13 • 4

afaji

authored a paper about 2 months ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 98

samuelcahyawijaya

authored a paper about 2 months ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 98

muhammadravi251001

authored a paper about 2 months ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 98

joanitolopo

authored a paper about 2 months ago

Constructing and Expanding Low-Resource and Underrepresented Parallel Datasets for Indonesian Local Languages

Paper • 2404.01009 • Published Apr 1, 2024

jjzha

authored a paper about 2 months ago

How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale

Paper • 2503.04290 • Published Mar 6 • 1

jjzha

authored a paper 2 months ago

HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings

Paper • 2502.15411 • Published Feb 21 • 2

Cameron-Chen

authored a paper 2 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 17

ryanhoangt

authored a paper 2 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 17

dreamerdeo

updated a collection 2 months ago

Sailor2 Models

10 items • Updated Feb 24 • 5

yongzx

authored a paper 3 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 17