Junda Zhu
chuhac
AI & ML interests
None yet
Recent Activity
new activity
10 days ago
tngtech/DeepSeek-R1T-Chimera:Questions on how routed experts are merged
new activity
10 days ago
tngtech/DeepSeek-R1T-Chimera:Questions on how routed experts are merged
authored
a paper
3 months ago
Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language
Models from Jailbreaking
Organizations
None yet
chuhac's activity
Questions on how routed experts are merged
17
7
#1 opened 11 days ago
by
chuhac

Adding `safetensors` variant of this model
#1 opened 11 months ago
by
SFconvertbot

Add support for HF Clip
7
#10 opened over 1 year ago
by
G-AshwinKumar
add remote code and hf-format "pytorch_model.bin"
1
1
#20 opened about 1 year ago
by
chuhac

AttributeError: 'BaichuanTokenizer' object has no attribute 'sp_model'
7
#18 opened over 1 year ago
by
lucasjin
AttributeError: 'BaichuanTokenizer' object has no attribute 'sp_model'
7
#18 opened over 1 year ago
by
lucasjin