Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More about 1 month ago • 16
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax Paper • 2504.20966 • Published 9 days ago • 25
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications Paper • 2303.15446 • Published Mar 27, 2023 • 1
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications Paper • 2206.10589 • Published Jun 21, 2022
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Paper • 2406.09418 • Published Jun 13, 2024
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Paper • 2503.21782 • Published Mar 27
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding Paper • 2504.13180 • Published 21 days ago • 17
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 14 items • Updated 19 days ago • 10