Carlos Fonseca's picture

3 64

Carlos Fonseca PRO

carlfm01

·

carlfm01

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

BAAI/bge-m3

liked a dataset 2 days ago

rajpurkarlab/ReXGradient-160K

liked a dataset about 1 month ago

HuggingFaceM4/DocumentVQA

View all activity

Organizations

None yet

carlfm01's activity

liked a model 1 day ago

BAAI/bge-m3

Sentence Similarity • Updated Jul 3, 2024 • 3.64M • • 2.03k

liked a dataset 2 days ago

rajpurkarlab/ReXGradient-160K

Viewer • Updated 3 days ago • 160k • 762 • 45

liked a dataset about 1 month ago

HuggingFaceM4/DocumentVQA

Viewer • Updated Dec 18, 2023 • 50k • 2.62k • 33

upvoted a paper about 2 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 104

upvoted a collection about 2 months ago

Mistral Small 3 (All Versions)

A collection of Mistral's new Small 3.1 and 3 models including GGUF, 4-bit and more! • 14 items • Updated 8 days ago • 10

liked a dataset 2 months ago

unsloth/RLAIF-V-Dataset

Viewer • Updated Sep 26, 2024 • 2.49k • 47 • 6

liked a model 3 months ago

HuggingFaceTB/SmolLM2-360M

Text Generation • Updated Feb 6 • 94k • 48

reacted to Jaward's post with 👀🔥 3 months ago

Post

3916

Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram.
Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

2 replies

·

liked 2 datasets 3 months ago

ylacombe/cml-tts

Viewer • Updated Nov 24, 2023 • 1.34M • 10.7k • 21

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated Jan 31 • 16.7k • 19.6k • 308

liked 2 models 4 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • Updated Feb 24 • 199k • • 669

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 1.31M • • 12.1k

liked a dataset 4 months ago

microsoft/PEACE

Viewer • Updated Mar 17 • 7.73k • 576 • 19

reacted to danielhanchen's post with ❤️🚀 5 months ago

Post

3773

Yay we got 500K+ monthly HF downloads on our Unsloth HF repo! :) Super appreciate everyone in the OSS community - and thanks for using Unsloth!!

4 replies

·

liked a dataset 5 months ago

microsoft/MAGIC

Viewer • Updated Dec 17, 2024 • 48.1k • 40 • 13

liked 2 models 5 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • Updated Sep 25, 2024 • 2.17M • • 426

deepseek-ai/DeepSeek-V2.5-1210

Text Generation • Updated Dec 11, 2024 • 523 • 254