Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SakanaAI 's Collections
TinySwallow
CycleQD

TinySwallow

updated Jan 30

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"

Upvote
16

  • SakanaAI/TinySwallow-1.5B

    Text Generation • Updated Jan 30 • 37.2k • 24

  • SakanaAI/TinySwallow-1.5B-Instruct

    Text Generation • Updated Jan 30 • 2.87k • 46

  • SakanaAI/TinySwallow-1.5B-Instruct-q4f32_1-MLC

    Text Generation • Updated Jan 30 • 3

  • SakanaAI/TinySwallow-1.5B-Instruct-GGUF

    Text Generation • Updated Jan 30 • 937 • 23

  • TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

    Paper • 2501.16937 • Published Jan 28 • 6
Upvote
16
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs