11 44 170

dinhanhx

dinhanhx

AI & ML interests

Vision Language

Recent Activity

upvoted a collection about 9 hours ago

DocAI

liked a model 3 days ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

upvoted an article 9 days ago

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

View all activity

Organizations

dinhanhx's activity

upvoted a collection about 9 hours ago

DocAI

Collection

20 items • Updated 18 days ago • 1

upvoted an article 9 days ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 179

upvoted an article 10 days ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

• 125

upvoted an article 11 days ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 161

upvoted a collection 12 days ago

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 7 days ago • 80

upvoted a collection 14 days ago

NVILA

Collection

9 items • Updated 21 days ago • 11

upvoted a paper 2 months ago

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 11

upvoted 4 articles 3 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 326

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 174

Article

Visual Document Retrieval Goes Multilingual

Jan 10

• 73

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 852

upvoted a paper 4 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 22

upvoted 3 collections 6 months ago