Junyang Lin

JustinLin610

AI & ML interests

Pretraining, NLP, CV, etc.

Recent Activity

published a model 10 days ago
Qwen/Qwen3-235B-A22B-FP8
published a model 10 days ago
Qwen/Qwen3-235B-A22B
View all activity

Organizations

OFA-Sys's profile picture ICML 2022's profile picture Qwen's profile picture Unofficial Mistral Community's profile picture MLX Community's profile picture OpenHands's profile picture Social Post Explorers's profile picture AI4Bio@ZJLab's profile picture

JustinLin610's activity

New activity in Qwen/Qwen3-235B-A22B 8 days ago

fix: use tp 8 for SGLang

#1 opened 10 days ago by
zhyncs
New activity in Qwen/QwQ-32B 2 months ago
New activity in Qwen/Qwen2.5-Math-7B-Instruct 7 months ago
New activity in Qwen/Qwen2-VL-7B-Instruct 8 months ago
New activity in Qwen/Qwen2-72B-Instruct 10 months ago

32B

5
1
#13 opened 11 months ago by
zzzzzxx
New activity in Qwen/CodeQwen1.5-7B-Chat 12 months ago

fine-tuning

4
#16 opened about 1 year ago by
SaghirAya

Maybe a silly question...

2
#18 opened about 1 year ago by
urtuuuu

This model is Awesome

5
#20 opened 12 months ago by
areumtecnologia
New activity in Qwen/Qwen1.5-110B-Chat-AWQ about 1 year ago

Update tokenizer_config.json

#3 opened about 1 year ago by
JustinLin610
New activity in Qwen/Qwen1.5-7B-Chat about 1 year ago
New activity in Qwen/Qwen1.5-0.5B about 1 year ago

tie_word_embeddings=true ?

1
#6 opened about 1 year ago by
salmitta
New activity in Qwen/CodeQwen1.5-7B-Chat about 1 year ago

The llm output is incomplete

1
#11 opened about 1 year ago by
lijianqiang

GGUF models

3
1
#1 opened about 1 year ago by
MaziyarPanahi