Kawamura Masaki
KMasaki
AI & ML interests
None yet
Recent Activity
updated
a model
29 days ago
KMasaki/Qwen2.5-1.5B-Open-R1-GRPO
published
a model
about 1 month ago
KMasaki/8expert_2granularity_0shared_top2_0.52b-GRPO
updated
a model
about 1 month ago
KMasaki/8expert_2granularity_0shared_top2_0.52b-Distill
Organizations
Collections
3
-
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp8-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp6-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058
Updated -
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp3-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000065
Updated • 2
models
21
KMasaki/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
9
KMasaki/8expert_2granularity_0shared_top2_0.52b-GRPO
Updated
KMasaki/8expert_2granularity_0shared_top2_0.52b-Distill
Text Generation
•
Updated
•
7
KMasaki/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
6
KMasaki/llm-jp-3-3.7b-Open-R1-GRPO
Updated
•
1
KMasaki/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
3
KMasaki/llm-jp-3-3.7b-Open-R1-Distill
Text Generation
•
Updated
•
7
KMasaki/Llama-3.1-8B-Instruct-safety-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000390
Updated
KMasaki/Llama-3.1-8B-Instruct-safety-exp2-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000387
Updated
•
9
KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp7-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000123
Updated
datasets
0
None public yet