MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling Paper • 2503.13440 • Published Mar 17 • 2
Training-Free Long-Context Scaling of Large Language Models Paper • 2402.17463 • Published Feb 27, 2024 • 25 • 4
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Paper • 2408.15237 • Published Aug 27, 2024 • 42 • 6
An Empirical Study of Mamba-based Language Models Paper • 2406.07887 • Published Jun 12, 2024 • 1 • 2