SeongWan Kim's picture

167 3

SeongWan Kim

idgmatrix

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

upvoted a paper 1 day ago

Phi-4-reasoning Technical Report

upvoted a paper 1 day ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

View all activity

Organizations

None yet

idgmatrix's activity

upvoted 3 papers 1 day ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 8 days ago • 88

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published 7 days ago • 34

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published 2 days ago • 65

upvoted 2 papers 9 days ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published 12 days ago • 41

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published 14 days ago • 105

upvoted 2 papers 14 days ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 15 days ago • 102

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published 15 days ago • 60

upvoted 3 papers 15 days ago

QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models

Paper • 2502.12346 • Published Feb 17 • 1

Training LLMs with MXFP4

Paper • 2502.20586 • Published Feb 27 • 1

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11, 2024 • 34

upvoted 7 papers 16 days ago

When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization

Paper • 2411.05882 • Published Nov 8, 2024 • 1

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 615

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

Paper • 2410.16144 • Published Oct 21, 2024 • 5

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

Paper • 2502.11880 • Published Feb 17 • 2

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7, 2024 • 69

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 102

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 19 days ago • 119

upvoted a paper 17 days ago

FANformer: Improving Large Language Models Through Effective Periodicity Modeling

Paper • 2502.21309 • Published Feb 28 • 1

upvoted a paper 19 days ago

Cobra: Efficient Line Art COlorization with BRoAder References

Paper • 2504.12240 • Published 21 days ago • 27

upvoted a paper 20 days ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 21 days ago • 70