Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
14
4
Dan Goldstein
SmerkyG
Follow
ZhangRC's profile picture
numiros's profile picture
21world's profile picture
15 followers
·
5 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 18 hours ago
recursal/QRWKV6-7B-Instruct
updated
a model
about 18 hours ago
recursal/QRWKV6-7B-Base
authored
a paper
about 23 hours ago
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale
View all activity
Organizations
Papers
4
arxiv:
2505.03005
arxiv:
2503.14456
arxiv:
2407.12077
arxiv:
2404.05892
models
18
Sort: Recently updated
SmerkyG/Qwen3Softpick-8B-Base
Updated
4 days ago
SmerkyG/RWKV7-1.5B-World3-128k-250309
Updated
Mar 9
•
1
SmerkyG/rwkv7-0.4B-world
Text Generation
•
Updated
Mar 2
•
7
SmerkyG/RWKV7-2.9B-World3-128k-250225
Updated
Feb 26
•
4
SmerkyG/rwkv7-1.5b-ctxlen-tests
Updated
Feb 4
SmerkyG/RWKV7-Goose-0.1B-Pile-HF
Updated
Feb 2
•
28
SmerkyG/RWKV7-Goose-0.4B-Pile-HF
Updated
Feb 2
•
1
SmerkyG/RWKV7-Goose-1.4B-Pile-HF
Updated
Feb 2
SmerkyG/RWKV7-Goose-0.1B-World2.8-HF
Updated
Dec 18, 2024
•
10
•
1
SmerkyG/rwkv-6-world-v2.1-3b
Text Generation
•
Updated
Jun 6, 2024
•
14
Expand 18 models
datasets
1
SmerkyG/DCLM-10B-Qwen2-binidx
Updated
Mar 26
•
76