GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-14336_critic Token Classification • Updated about 2 hours ago • 16
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-14336_actor Text Generation • Updated about 2 hours ago
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_critic Token Classification • Updated about 4 hours ago • 5
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_actor Text Generation • Updated about 4 hours ago • 27
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-14336_actor Text Generation • Updated about 2 hours ago
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1 Text Generation • Updated about 8 hours ago • 25
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_hmmt-feb-25_expanded_prompt_0_eval Viewer • Updated about 10 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_hmmt-feb-25_expanded_prompt_0_eval Viewer • Updated about 10 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_hmmt-feb-24_expanded_prompt_0_eval Viewer • Updated about 12 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_hmmt-feb-24_expanded_prompt_0_eval Viewer • Updated about 12 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-25_expanded_prompt_0_eval Viewer • Updated about 14 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-25_expanded_prompt_0_eval Viewer • Updated about 14 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-24_expanded_prompt_1_eval Viewer • Updated about 16 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-24_expanded_prompt_1_eval Viewer • Updated about 16 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-24_expanded_prompt_0_eval Viewer • Updated about 18 hours ago • 30
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-24_expanded_prompt_0_eval Viewer • Updated about 18 hours ago • 30
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_critic Token Classification • Updated about 4 hours ago • 5
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_actor Text Generation • Updated about 4 hours ago • 27
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-14336_critic Token Classification • Updated about 2 hours ago • 16
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1 Text Generation • Updated about 8 hours ago • 25