Mathias Nielsen's picture

Mathias Nielsen

mathiasn1

·

https://grandaiwizard.com/

AI & ML interests

🏢 Senior Machine Learning Engineer @ https://mediacatch.io/

Recent Activity

liked a Space 4 days ago

nvidia/describe-anything-model-demo

liked a model 4 days ago

JetBrains/Mellum-4b-base

liked a model 6 days ago

Qwen/Qwen3-235B-A22B

View all activity

Organizations

mathiasn1's activity

upvoted a paper 7 days ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published 8 days ago • 34

upvoted an article 8 days ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

9 days ago

• 23

upvoted a collection 9 days ago

Qwen3

27 items • Updated about 7 hours ago • 544

upvoted an article 12 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

13 days ago

• 220

upvoted a paper 13 days ago

Scaling Language-Free Visual Representation Learning

Paper • 2504.01017 • Published Apr 1 • 29

upvoted a collection 13 days ago

Web-SSL

17 items • Updated 15 days ago • 14

upvoted a collection 19 days ago

blt

4 items • Updated 21 days ago • 17

upvoted a paper 19 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 102

upvoted 2 collections 19 days ago

Perception Encoder

9 items • Updated 21 days ago • 47

Perception LM

7 items • Updated 21 days ago • 42

upvoted a collection 27 days ago

Cogito v1 Preview

5 items • Updated about 1 month ago • 108

upvoted an article about 1 month ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

Apr 5

• 142

upvoted 2 collections about 1 month ago

Llama 4

Llama 4 release • 13 items • Updated 9 days ago • 480

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187

upvoted a paper about 1 month ago

Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages

Paper • 2503.20212 • Published Mar 26 • 5

upvoted 2 articles about 1 month ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

• 125

Article

Introducing Gradio's new Dataframe!

Mar 24

• 24

upvoted 2 collections about 2 months ago

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 3 days ago • 31

Llama Nemotron

Open, Production-ready Enterprise Models • 5 items • Updated 3 days ago • 50

upvoted an article about 2 months ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

Mar 18

• 35