robbie's picture

robbie

robb-0

·

AI & ML interests

🍥

Recent Activity

updated a model 2 days ago

robb-0/miami-beach-hologram

updated a model 2 days ago

robb-0/billys-vintage-cars

updated a model 2 days ago

robb-0/toyland-dreamstyle

View all activity

Organizations

robb-0's activity

upvoted a paper 5 days ago

Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models

Paper • 2504.10615 • Published 24 days ago • 1

upvoted a collection 5 days ago

Granite 4.0 Language Models

2 items • Updated 6 days ago • 9

upvoted a paper 20 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 140

upvoted a collection about 1 month ago

Llama 4

Llama 4 release • 13 items • Updated 9 days ago • 480

upvoted a paper about 1 month ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted 3 collections about 1 month ago

— UI is a good thing 💅 —

cool spaces with a cool UI, what could be better? • 5 items • Updated 3 days ago • 17

My Bookmarks

144 items • Updated 21 days ago • 3

Spaces for LLM / VLM / NLP

1064 items • Updated about 9 hours ago • 10

upvoted 5 papers about 2 months ago

Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation

Paper • 2503.15222 • Published Mar 19 • 1

The AI Community Building the Future? A Quantitative Analysis of Development Activity on Hugging Face Hub

Paper • 2405.13058 • Published May 20, 2024 • 2

SpaceByte: Towards Deleting Tokenization from Large Language Modeling

Paper • 2404.14408 • Published Apr 22, 2024 • 7

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 11

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20 • 26

upvoted a collection about 2 months ago

Foundation Text-Generation Models Below 360M Parameters

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 36 items • Updated Apr 6 • 31

upvoted a paper about 2 months ago

Finch: Prompt-guided Key-Value Cache Compression

Paper • 2408.00167 • Published Jul 31, 2024 • 18

upvoted a collection about 2 months ago

Hallucination

14 items • Updated Jun 10, 2024 • 8

upvoted 3 papers about 2 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 162

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Paper • 2503.08686 • Published Mar 11 • 19

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 81