When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust Paper • 2305.07230 • Published May 12, 2023 • 2
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models Paper • 2504.02273 • Published Apr 3 • 5
Multi-Reference Preference Optimization for Large Language Models Paper • 2405.16388 • Published May 26, 2024 • 1