Kawamura Masaki

KMasaki

KMasaki0210

AI & ML interests

None yet

Recent Activity

updated a model 29 days ago

KMasaki/Qwen2.5-1.5B-Open-R1-GRPO

published a model about 1 month ago

KMasaki/8expert_2granularity_0shared_top2_0.52b-GRPO

updated a model about 1 month ago

KMasaki/8expert_2granularity_0shared_top2_0.52b-Distill

View all activity

Organizations

Collections 3

models 21

KMasaki/Llama-3.1-8B-Instruct-safety-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000390

Updated Oct 20, 2024

KMasaki/Llama-3.1-8B-Instruct-safety-exp2-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000387

Updated Oct 20, 2024 • 9

KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp7-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000123

Updated Oct 17, 2024

datasets 0

None public yet

Kawamura Masaki

AI & ML interests

Recent Activity

Organizations

Collections 3

KMasaki/Llama-3.1-8B-Instruct-safety-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000390

KMasaki/Llama-3.1-8B-Instruct-safety-exp2-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000387

KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp8-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058

KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp6-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058

KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000058

KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp3-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000065

models 21

KMasaki/Qwen2.5-1.5B-Open-R1-GRPO

KMasaki/8expert_2granularity_0shared_top2_0.52b-GRPO

KMasaki/8expert_2granularity_0shared_top2_0.52b-Distill

KMasaki/Qwen2.5-1.5B-Open-R1-Distill

KMasaki/llm-jp-3-3.7b-Open-R1-GRPO

KMasaki/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

KMasaki/llm-jp-3-3.7b-Open-R1-Distill

KMasaki/Llama-3.1-8B-Instruct-safety-exp1-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000390

KMasaki/Llama-3.1-8B-Instruct-safety-exp2-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000387

KMasaki/Llama-3.1-8B-Instruct-gsm8k-exp7-LR_2.5e-5_MINLR_2.5e-6_WD_0.1_GC_1-iter_0000123

datasets 0

Kawamura Masaki

AI & ML interests

Recent Activity

Organizations

Collections 3

models 21 Sort: Recently updated

datasets 0

models 21