Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tempo14 's Collections
Interpretability
Encoder
Transformer
Diffusion
scaling
self critic
layer
latent reasoning
images
RWKV
Autoregressvie Image Generation
video
World Model
Tools
Reasoning
Attention
interesting
Summary
Long Context
QA
hallucination
small models
Traffic
Code
Fine-Tuning
cpu inference
Prompt Engineering
Mixture of Experts
motion
chain of thought
robotic
new architecture
outperform gpt-4
RLHF
german model
fast
mobile device
efficient inference
alignment
quantization
practical
agents
Synthetic Dataset
mamba
Instruction Tuning
reinforcement learning
compress
Self Improvement
Inpaint
Training
vision
Linear
3D
Math
Embedding
RAG
Stable Diffusion
In-Context
comparison
Molecular
Merging
Pre-Training
Unlearning
Tokenizer
Memory
Spaces
Multimodal
Edit Pictures
Yolo
Music

images

updated Mar 30
Upvote
-

  • SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

    Paper • 2501.18427 • Published Jan 30 • 20

  • Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

    Paper • 2502.20388 • Published Feb 27 • 16

  • SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

    Paper • 2503.09641 • Published Mar 12 • 38

  • Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

    Paper • 2503.16430 • Published Mar 20 • 35
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs