Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
floom 's Collections
ShowAndTell-2025-01-30
ShowAndTell
ShowAndTell-2024-12-03
Coding
Reasoning
ICL
RL
Model Training
Agents
NLU
Training data
RAG
Data Efficient Approaches
Long-context
Personalization
sentence-transformer-models
Tool Use & more
Feedback Analysis
Model Safety
Webscraping
Timeseries
Evaluation
Memory
SSM
TabularData
Efficient Serving/Inference
Synthetic Data Generation
Hallucination
Frontier research ideas

ShowAndTell-2025-01-30

updated Feb 3
Upvote
-

  • Atla Selene Mini: A General Purpose Evaluation Model

    Paper • 2501.17195 • Published Jan 27 • 36

  • DeepSeek-V3 Technical Report

    Paper • 2412.19437 • Published Dec 27, 2024 • 62

  • Optimizing Large Language Model Training Using FP4 Quantization

    Paper • 2501.17116 • Published Jan 28 • 38

  • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

    Paper • 2402.03300 • Published Feb 5, 2024 • 118

  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Paper • 2501.12948 • Published Jan 22 • 390

  • Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

    Paper • 2501.11873 • Published Jan 21 • 66

  • RL + Transformer = A General-Purpose Problem Solver

    Paper • 2501.14176 • Published Jan 24 • 28

  • Autonomy-of-Experts Models

    Paper • 2501.13074 • Published Jan 22 • 45
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs