Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Paper ⢠2406.20085 ⢠Published Jun 28, 2024 ⢠13 ⢠3
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper ⢠2504.07866 ⢠Published 18 days ago ⢠10 ⢠3
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper ⢠2504.07866 ⢠Published 18 days ago ⢠10 ⢠3
Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Paper ⢠2405.20216 ⢠Published May 30, 2024 ⢠22 ⢠3
MoBA: Mixture of Block Attention for Long-Context LLMs Paper ⢠2502.13189 ⢠Published Feb 18 ⢠17 ⢠2
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper ⢠2411.14405 ⢠Published Nov 21, 2024 ⢠62 ⢠4
Zero-shot Model-based Reinforcement Learning using Large Language Models Paper ⢠2410.11711 ⢠Published Oct 15, 2024 ⢠9 ⢠4
Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media Paper ⢠2410.12791 ⢠Published Oct 16, 2024 ⢠5 ⢠3
Named Clinical Entity Recognition Benchmark Paper ⢠2410.05046 ⢠Published Oct 7, 2024 ⢠17 ⢠3
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Paper ⢠2410.02749 ⢠Published Oct 3, 2024 ⢠12 ⢠3
LLaVA-Critic: Learning to Evaluate Multimodal Models Paper ⢠2410.02712 ⢠Published Oct 3, 2024 ⢠36 ⢠3
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper ⢠2409.12568 ⢠Published Sep 19, 2024 ⢠51 ⢠4
Insights from Benchmarking Frontier Language Models on Web App Code Generation Paper ⢠2409.05177 ⢠Published Sep 8, 2024 ⢠7 ⢠3
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak Paper ⢠2409.04269 ⢠Published Sep 6, 2024 ⢠11 ⢠3
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak Paper ⢠2409.04269 ⢠Published Sep 6, 2024 ⢠11 ⢠3