-
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 58 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 64 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 95 -
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 274
Jakhongir Saydaliev
Jakh0103
AI & ML interests
None yet
Recent Activity
updated
a model
3 days ago
Jakh0103/Qwen2.5-VL-3B-SFT-VSR
published
a model
3 days ago
Jakh0103/Qwen2.5-VL-3B-SFT-VSR
updated
a model
3 days ago
Jakh0103/Qwen2.5-VL-3B-GRPO-VSR