7 14 49

web

dim

dmitrymailk

AI & ML interests

dimweb, LM/LLM pronouns

Recent Activity

updated a dataset 9 days ago

dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

published a dataset 9 days ago

dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

updated a dataset 9 days ago

dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

View all activity

Organizations

dim's activity

updated a dataset 9 days ago

dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

Viewer • Updated 9 days ago • 1k • 60

published a dataset 9 days ago

dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

Viewer • Updated 9 days ago • 1k • 60

updated a dataset 9 days ago

dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

Viewer • Updated 9 days ago • 500 • 110

published a dataset 15 days ago

dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy

Viewer • Updated 9 days ago • 500 • 110

updated a dataset 16 days ago

dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096

Viewer • Updated 16 days ago • 500 • 85

published a dataset 16 days ago

dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096

Viewer • Updated 16 days ago • 500 • 85

updated a dataset 19 days ago

dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_8192

Viewer • Updated 19 days ago • 12k • 85

published a dataset 19 days ago

dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_8192

Viewer • Updated 19 days ago • 12k • 85

updated a dataset 19 days ago

dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096

Viewer • Updated 19 days ago • 12k • 137

published a dataset 19 days ago

dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096

Viewer • Updated 19 days ago • 12k • 137

upvoted a paper 20 days ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19 • 48

updated a dataset 20 days ago

dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768

Viewer • Updated 20 days ago • 12k • 54

published a dataset 20 days ago

dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768

Viewer • Updated 20 days ago • 12k • 54

upvoted a paper 23 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published about 1 month ago • 129

liked a model 24 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated Feb 24 • 1.84M • 1.18k

liked a Space 28 days ago

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

🎙

Generate speech from text using reference audio

liked a Space about 1 month ago

382

SanaSprint

👁

Ultra fast high quality image generation

liked a Space about 2 months ago

187

Orpheus TTS

🚀

Try Orpheus TTS here

updated a dataset about 2 months ago

dim/open_orca_4475_DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Mar 13 • 4.48k • 28

published a dataset about 2 months ago

dim/open_orca_4475_DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Mar 13 • 4.48k • 28