121 114 938

Yasunori Ozaki PRO

alfredplpl

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

liked a model 1 day ago

Freepik/nsfw_image_detector

updated a dataset 3 days ago

aidealab/aidealab-videojp-eval

liked a dataset 3 days ago

Spawning/pd12m-full

View all activity

Organizations

alfredplpl's activity

upvoted an article 6 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 601

upvoted a paper 24 days ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published 27 days ago • 39

upvoted a paper 27 days ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 28 days ago • 27

upvoted a paper 28 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published about 1 month ago • 73

upvoted a collection about 1 month ago

Llama 4

Collection

Llama 4 release • 13 items • Updated 10 days ago • 481

upvoted a paper about 1 month ago

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Paper • 2503.19462 • Published Mar 25 • 10

upvoted a paper about 2 months ago

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 9

upvoted 2 collections about 2 months ago

Gemma 3

Collection

4 items • Updated Mar 12 • 15

Gemma 3 Release

Collection

24 items • Updated 20 days ago • 357

upvoted 8 papers 3 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 144

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published Feb 7 • 24

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 104

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 229

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4 • 65

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 852

upvoted 2 papers 4 months ago

Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions

Paper • 2501.10020 • Published Jan 17 • 23

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 55