Jaward Sesay

Jaward

AI & ML interests

I like to train large deep neural nets too 🧠🤖💥 | First Paper (AutoAgents: A Framework for Automatic Agent Generation) Accepted @ IJCAI 2024 | Role Model Karpathy

Recent Activity

liked a Space 2 days ago

maitrix-org/Voila-demo

liked a model 2 days ago

maitrix-org/Voila-audio-alpha

authored a paper 2 days ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

View all activity

Organizations

Jaward's activity

upvoted a paper 3 days ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published 3 days ago • 70

upvoted a paper 10 days ago

Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family

Paper • 2504.18225 • Published 13 days ago • 12

upvoted a paper 11 days ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published 15 days ago • 105

upvoted a paper 16 days ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published 17 days ago • 42

upvoted an article 20 days ago

Article

Cohere on Hugging Face Inference Providers 🔥

23 days ago

• 124

upvoted a paper 24 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 27 days ago • 123

upvoted 2 papers 30 days ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published about 1 month ago • 102

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published about 1 month ago • 180

upvoted 3 papers about 2 months ago

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16 • 66

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 140

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 162

upvoted 2 papers 2 months ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 20

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 87

upvoted 2 papers 3 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 213

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 852

upvoted a paper 4 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 114

upvoted a collection 4 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 3 days ago • 287

upvoted 2 papers 6 months ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 47

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 125