daisuke's picture

16 134

daisuke

dai

·

AI & ML interests

dai incl. AI. so Contender.

Recent Activity

commented on an article 3 days ago

300以上のモデルコンテキストプロトコル（MCP）サーバーを探る：Claude、Cursor、AIコーディング向け

liked a model 7 days ago

JetBrains/Mellum-4b-base

liked a model 10 days ago

moonshotai/Kimi-Audio-7B

View all activity

Organizations

dai's activity

upvoted 2 collections about 1 month ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 10 days ago • 463

upvoted a collection about 2 months ago

EXAONE-Deep

EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated Mar 18 • 86

upvoted a collection 2 months ago

steiner-preview

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 32

upvoted a collection 3 months ago

vision

1 item • Updated Feb 19 • 1

upvoted 3 articles 3 months ago

Article

Welcome Fireworks.ai on the Hub 🎆

Feb 14

• 58

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

Feb 12

• 64

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 174

upvoted a collection 4 months ago

DeepSeek-V3

4 items • Updated Mar 25 • 249

upvoted a collection 7 months ago

Gemma 2 JPN Release

A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated Apr 3 • 28

upvoted an article 9 months ago

Article

XetHub is joining Hugging Face!

Aug 8, 2024

• 92

upvoted a collection 9 months ago

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Apr 3 • 79

upvoted a paper over 1 year ago

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 52