4 33 4

Samuel Arcadinho

SSamDav

SSamDav

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

upvoted a paper 9 days ago

Implicit Language Models are RNNs: Balancing Parallelization and Expressivity

upvoted a paper 23 days ago

MIEB: Massive Image Embedding Benchmark

View all activity

Organizations

SSamDav's activity

upvoted a paper about 22 hours ago

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published 3 days ago • 23

upvoted a paper 9 days ago

Implicit Language Models are RNNs: Balancing Parallelization and Expressivity

Paper • 2502.07827 • Published Feb 10 • 1

upvoted a paper 23 days ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published 24 days ago • 16

upvoted a paper 27 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 28 days ago • 125

upvoted 2 papers 29 days ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published about 1 month ago • 81

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 30 days ago • 159

upvoted a paper about 1 month ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 150

upvoted 5 papers about 2 months ago

upvoted 2 papers 2 months ago

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 100

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18 • 17

upvoted 2 collections 3 months ago

Dria-Agent-a

Collection

powerful agentic models built for pythonic function calling • 4 items • Updated Feb 14 • 4

Tiny-Agent-a

Collection

fast and powerful agentic models designed to run on edge devices. • 6 items • Updated Feb 12 • 7

upvoted 4 papers 3 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 140

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 22

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Paper • 2411.04983 • Published Nov 7, 2024 • 13