view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 246
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 22 days ago • 60
view article Article Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC 29 days ago • 24
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 125
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 • 74
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 131