Yoshi Suhara's picture

9 5 16

Yoshi Suhara

suhara

·

https://yoshi-suhara.com/

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

nvidia/ClimbLab

liked a dataset 6 days ago

nvidia/ClimbMix

liked a dataset 10 days ago

nvidia/When2Call

View all activity

Organizations

suhara's activity

upvoted a paper 22 days ago

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published 23 days ago • 10

upvoted a paper 6 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 45

upvoted a paper 7 months ago

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Paper • 2409.17481 • Published Sep 26, 2024 • 48

upvoted a collection 8 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 3 days ago • 61

upvoted a paper 9 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 59