Daniel van Strien's picture

Daniel van Strien PRO

davanstrien

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset 6 minutes ago

davanstrien/video-subfolder-example

published a dataset 7 minutes ago

davanstrien/video-subfolder-example

liked a model about 1 hour ago

nvidia/parakeet-tdt-0.6b-v2

View all activity

Organizations

davanstrien's activity

upvoted 8 collections about 2 hours ago

Annif models

Annif models for text classification and subject indexing. FintoAI prefixed models are in use at Finto AI: https://ai.finto.fi • 6 items • Updated Feb 13 • 3

Eynollah models

Eynollah models for document image processing and layout analysis tasks. • 14 items • Updated Mar 27 • 3

YOLOv8 Datasets

This collection contains all our datasets for YOLOv8 Object detection trainings. • 1 item • Updated Aug 20, 2024 • 1

YOLOv8 Models

This collection includes models designed for Object detection using YOLOv8. • 1 item • Updated Aug 20, 2024 • 1

Datasets ATR line-level

This collection contains all our datasets for Automatic Text Recognition on line images. • 12 items • Updated Mar 14, 2024 • 4

SpaCy

This collection includes models designed for Named Entity Recognition. • 3 items • Updated Mar 13, 2024 • 1

Doc-UFCN

This Doc-UFCN collection contains models designed to run various DLA tasks like the text line detection or page segmentation. • 4 items • Updated Mar 13, 2024 • 3

PyLaia

The PyLaia collection contains models designed for Automatic Text Recognition (ATR) from line images. • 15 items • Updated Aug 9, 2024 • 4

upvoted an article about 22 hours ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

9 days ago

• 23

upvoted a collection 9 days ago

Qwen3

27 items • Updated about 7 hours ago • 544

upvoted a paper 14 days ago

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Paper • 2502.10341 • Published Feb 14 • 2

upvoted a paper 16 days ago

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Paper • 2411.05735 • Published Nov 8, 2024 • 1

upvoted 2 collections 20 days ago

Cell2Sentence Models

Cell2Sentence models trained for single-cell tasks • 5 items • Updated 22 days ago • 7

blt

4 items • Updated 21 days ago • 17

upvoted a paper 21 days ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 61

upvoted a collection 22 days ago

🏜️MIRAGE-Bench [NAACL'25]

Dataset Collection from the MIRAGE-Bench paper • 13 items • Updated Mar 31 • 2