Jintao Huang's picture

Jintao Huang

study-hjt

·

https://github.com/Jintao-Huang

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

Qwen/Qwen3-235B-A22B:GPTQ/AWQ

new activity 6 days ago

Qwen/Qwen3-30B-A3B:AWQ quantized model support timeline?

new activity 9 days ago

Qwen/Qwen3-235B-A22B:🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋

View all activity

Organizations

study-hjt's activity

New activity in Qwen/Qwen3-235B-A22B 5 days ago

GPTQ/AWQ

#3 opened 9 days ago by

New activity in Qwen/Qwen3-30B-A3B 6 days ago

AWQ quantized model support timeline?

#12 opened 8 days ago by

New activity in Qwen/Qwen3-235B-A22B 9 days ago

🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋

#6 opened 9 days ago by

New activity in Qwen/Qwen3-30B-A3B 9 days ago

🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋

#3 opened 9 days ago by

New activity in Qwen/Qwen3-32B 9 days ago

🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋

#7 opened 9 days ago by

New activity in Qwen/Qwen3-8B 9 days ago

🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋

#3 opened 9 days ago by

New activity in Qwen/Qwen2.5-Omni-7B 13 days ago

[Fine-tuning] 🚀SFT/DPO/GRPO support!

#20 opened about 1 month ago by

New activity in microsoft/Phi-4-multimodal-instruct 2 months ago

thanks , how to fine tune?

#1 opened 2 months ago by

upvoted a paper 5 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 367

updated a model 9 months ago

study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4

Text Generation • Updated Aug 14, 2024 • 12 • 2

updated a dataset 11 months ago

modelscope/self-cognition

Viewer • Updated Jun 8, 2024 • 108 • 75 • 19

liked a dataset 11 months ago

modelscope/self-cognition

Viewer • Updated Jun 8, 2024 • 108 • 75 • 19

liked a model 12 months ago

study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4

Text Generation • Updated Apr 23, 2024 • 13 • 6

liked 2 models about 1 year ago

study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int8

Text Generation • Updated Apr 23, 2024 • 36 • 2

study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int8

Text Generation • Updated Apr 23, 2024 • 29 • 2

updated 2 models about 1 year ago

study-hjt/Qwen1.5-110B-Chat-AWQ

Text Generation • Updated Apr 27, 2024 • 6

study-hjt/Qwen1.5-110B-Chat-GPTQ-Int8

Text Generation • Updated Apr 27, 2024 • 13

liked a model about 1 year ago

study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4

Text Generation • Updated Aug 14, 2024 • 12 • 2

updated 2 models about 1 year ago

study-hjt/Qwen1.5-32B-Chat-GPTQ-Int8

Text Generation • Updated Apr 26, 2024 • 14 • 1

study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int8

Text Generation • Updated Apr 26, 2024 • 18 • 1