MoM: Linear Sequence Modeling with Mixture-of-Memories Paper • 2502.13685 • Published Feb 19 • 36
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper • 2501.12895 • Published Jan 22 • 61
ringos/output_Llama-3.1-8B-simpleqa-0_1000-m_generation-n_128-t_1.0-k_50-p_0.95-l_128 Updated Dec 25, 2024 • 22
Running 557 557 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
ringos/output_Llama-3.1-8B-simpleqa-0_-1-m_generation-n_128-t_1.0-k_50-p_0.95-l_128 Updated Dec 17, 2024 • 76
ringos/output_Mistral-Nemo-Base-2407-simpleqa-0_1000-m_generation-n_32-t_1.0-k_40-p_0.9-l_128 Viewer • Updated Dec 2, 2024 • 216 • 58
ringos/bio-detailed-Llama-3.1-8B-gemma2-rm-gold_True-n_32 Viewer • Updated Nov 13, 2024 • 371 • 32
ringos/bio-detailed-Llama-3.1-8B-gemma2-rm-gold_True-n_32 Viewer • Updated Nov 13, 2024 • 371 • 32
ringos/bio-detailed-Llama-3.1-8B-gemma2-rm-gold_True-n_32 Viewer • Updated Nov 13, 2024 • 371 • 32