Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
16
1
10
Jintao Huang
study-hjt
Follow
kk3dmax's profile picture
ealix's profile picture
21world's profile picture
7 followers
·
2 following
https://github.com/Jintao-Huang
AI & ML interests
None yet
Recent Activity
new
activity
5 days ago
Qwen/Qwen3-235B-A22B:
GPTQ/AWQ
new
activity
6 days ago
Qwen/Qwen3-30B-A3B:
AWQ quantized model support timeline?
new
activity
9 days ago
Qwen/Qwen3-235B-A22B:
🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋
View all activity
Organizations
study-hjt
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Qwen/Qwen3-235B-A22B
5 days ago
GPTQ/AWQ
12
4
#3 opened 9 days ago by
ndurkee
New activity in
Qwen/Qwen3-30B-A3B
6 days ago
AWQ quantized model support timeline?
7
2
#12 opened 8 days ago by
hyunw55
New activity in
Qwen/Qwen3-235B-A22B
9 days ago
🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋
5
1
#6 opened 9 days ago by
study-hjt
New activity in
Qwen/Qwen3-30B-A3B
9 days ago
🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋
4
#3 opened 9 days ago by
study-hjt
New activity in
Qwen/Qwen3-32B
9 days ago
🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋
3
#7 opened 9 days ago by
study-hjt
New activity in
Qwen/Qwen3-8B
9 days ago
🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋
3
#3 opened 9 days ago by
study-hjt
New activity in
Qwen/Qwen2.5-Omni-7B
13 days ago
[Fine-tuning] 🚀SFT/DPO/GRPO support!
#20 opened about 1 month ago by
study-hjt
New activity in
microsoft/Phi-4-multimodal-instruct
2 months ago
thanks , how to fine tune?
19
#1 opened 2 months ago by
NickyNicky
upvoted
a
paper
5 months ago
Qwen2.5 Technical Report
Paper
•
2412.15115
•
Published
Dec 19, 2024
•
367
updated
a model
9 months ago
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
Updated
Aug 14, 2024
•
12
•
2
updated
a dataset
11 months ago
modelscope/self-cognition
Viewer
•
Updated
Jun 8, 2024
•
108
•
75
•
19
liked
a dataset
11 months ago
modelscope/self-cognition
Viewer
•
Updated
Jun 8, 2024
•
108
•
75
•
19
liked
a model
12 months ago
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
•
Updated
Apr 23, 2024
•
13
•
6
liked
2 models
about 1 year ago
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int8
Text Generation
•
Updated
Apr 23, 2024
•
36
•
2
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int8
Text Generation
•
Updated
Apr 23, 2024
•
29
•
2
updated
2 models
about 1 year ago
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
•
Updated
Apr 27, 2024
•
6
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int8
Text Generation
•
Updated
Apr 27, 2024
•
13
liked
a model
about 1 year ago
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
Updated
Aug 14, 2024
•
12
•
2
updated
2 models
about 1 year ago
study-hjt/Qwen1.5-32B-Chat-GPTQ-Int8
Text Generation
•
Updated
Apr 26, 2024
•
14
•
1
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int8
Text Generation
•
Updated
Apr 26, 2024
•
18
•
1
Load more