6 5 27

its5Q PRO

its5Q

https://t.me/dno5iq

its5Q

AI & ML interests

None yet

Recent Activity

liked a Space about 18 hours ago

HuggingFaceTB/wikiracing-llms

liked a dataset 3 days ago

nyuuzyou/svgfind

liked a model 10 days ago

qingy2024/Qwen3-0.6B

View all activity

Organizations

its5Q's activity

liked a Space about 18 hours ago

WikiRacing Language Models

🏃

Find answers by racing against LLM in a quiz game

liked a dataset 3 days ago

nyuuzyou/svgfind

Viewer • Updated 10 days ago • 3.66M • 586 • 26

liked a model 10 days ago

qingy2024/Qwen3-0.6B

Updated 10 days ago • 107 • 16

liked a dataset 11 days ago

evborjnvioerjnvuowsetngboetgjbeigjaweuofjf/I-love-capybara

Viewer • Updated 10 days ago • 543k • 164 • 3

liked a Space 11 days ago

Multilingual LLM Tokenizers

⚡

you to experience how Multilingual tokenizers work.

reacted to nyuuzyou's post with 👍 11 days ago

Post

3702

🖼️ SVGRepo Icons Dataset - nyuuzyou/svgrepo

Collection of 217,510 Scalable Vector Graphics (SVG) icons featuring:

- Sourced from SVGRepo.com across diverse categories & styles
- Includes metadata: title, tags, source collection, and specific license
- Contains minified SVG markup for direct use or processing
- Organized into splits based on individual icon license (e.g., MIT, CC0, Apache)

reacted to nyuuzyou's post with 👍 13 days ago

Post

3588

🦅 SmolLM2-Eagle Collection - nyuuzyou/smollm2-eagle-680263bf97f0c7e6bbe4936b

Collection of fine-tuned bilingual language models featuring:
- Models in three parameter sizes: 135M, 360M, and 1.7B based on HuggingFaceTB's SmolLM2 models
- Both standard and GGUF formats for flexible deployment in llama.cpp and Ollama
- Fine-tuned on nyuuzyou/EagleSFT dataset (536,231 Russian-English QA pairs derived from 739k+ real user queries)
- Experimental Russian language capabilities while maintaining English performance
- Limited Russian capabilities due to SFT-only approach without Russian pre-training
- Environmental impact: ~19.75 kg CO2eq

This collection provides compact models for research on bilingual language capabilities, resource-constrained environments, and educational applications. Not recommended for production use due to experimental nature and inherent limitations. Available under Apache 2.0 license.

1 reply

upvoted a collection 16 days ago

SmolLM2-Eagle

Collection

7 items • Updated 13 days ago • 4

liked a model 16 days ago

nyuuzyou/SmolLM2-135M-Eagle

Text Generation • Updated 20 days ago • 67 • 3

liked a dataset 20 days ago

nyuuzyou/EagleSFT

Viewer • Updated 22 days ago • 1.07M • 201 • 8

reacted to nyuuzyou's post with 👍 21 days ago

Post

2951

🦅 EagleSFT Dataset - nyuuzyou/EagleSFT

Collection of 536,231 question-answer pairs featuring:

- Human-posed questions and machine-generated responses for SFT
- Bilingual content in Russian and English with linked IDs
- Derived from 739k+ real user queries, primarily educational topics
- Includes unique IDs and machine-generated category labels

This dataset provides a resource for supervised fine-tuning (SFT) of large language models, cross-lingual research, and understanding model responses to diverse user prompts. Released to the public domain under CC0 1.0 license.