Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
thehandsomefrog4825 's Collections
Attention 🧐
Other research
Tool 🛠️
Top papers ⭐
Object detection 🔍
LLM 🦜
VLM 👁️👁️
Object segmentation 🧩
Model 🖥️
Reinforce learning 🔃
Agent 🤖
RAG 🔄️
Benchmark📏
GAN
Reasoning 🧠
Robotic 🤖🔧
TTI ⌨️➡️🖼️
TTS ⌨️➡️🗣️
TTV 📝➡️📺
Generative 🎨

LLM 🦜

updated Feb 12
Upvote
-

  • Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

    Paper • 2412.13663 • Published Dec 18, 2024 • 149

  • Qwen2.5 Technical Report

    Paper • 2412.15115 • Published Dec 19, 2024 • 367

  • Are Your LLMs Capable of Stable Reasoning?

    Paper • 2412.13147 • Published Dec 17, 2024 • 95

  • Byte Latent Transformer: Patches Scale Better Than Tokens

    Paper • 2412.09871 • Published Dec 13, 2024 • 102

  • Apollo: An Exploration of Video Understanding in Large Multimodal Models

    Paper • 2412.10360 • Published Dec 13, 2024 • 146

  • Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

    Paper • 2412.05271 • Published Dec 6, 2024 • 157

  • Enhancing Human-Like Responses in Large Language Models

    Paper • 2501.05032 • Published Jan 9 • 57

  • Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

    Paper • 2502.06703 • Published Feb 10 • 152
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs