21 16 42

Noob

noobmldude

AI & ML interests

Explainable AI

Recent Activity

liked a model 7 days ago

JetBrains/Mellum-4b-base

upvoted an article 14 days ago

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

new activity 16 days ago

mistralai/Codestral-22B-v0.1:🚩 Report: Not working

View all activity

Organizations

noobmldude's activity

liked a model 7 days ago

JetBrains/Mellum-4b-base

Text Generation • Updated about 18 hours ago • 1.89k • 259

upvoted an article 14 days ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Apr 29, 2024

• 77

New activity in mistralai/Codestral-22B-v0.1 16 days ago

🚩 Report: Not working

#55 opened 4 months ago by

garbagedog2

liked a Space 21 days ago

On-Device LLM Throughput Calculator

🚀

Generate throughput plots for LLMs on devices

upvoted 2 papers about 1 month ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published Mar 12 • 10

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

liked a dataset about 1 month ago

bigcode/self-oss-instruct-sc2-exec-filter-50k

Viewer • Updated Nov 4, 2024 • 50.7k • 329 • 99

liked a Space about 1 month ago

190

Check My Progress Deep RL Course

👀

Check your progress in a Deep RL course

New activity in nanotron/ultrascale-playbook about 1 month ago

How to download as pdf?

#74 opened 3 months ago by

vcoyk

upvoted 3 papers about 2 months ago

2BP: 2-Stage Backpropagation

Paper • 2405.18047 • Published May 28, 2024 • 27

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Paper • 2410.21438 • Published Oct 28, 2024 • 2

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 28

liked a Space 3 months ago

2.56k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in hexgrad/Kokoro-82M 3 months ago

Free kokoro TTS endpoint - no strings attached.

#37 opened 4 months ago by

mhenrichsen

liked a Space 3 months ago

729

TTS Arena V2

🏆

Vote on the latest TTS models!

upvoted a paper 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 229

liked a model 6 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • Updated Jan 12 • 296k • • 474

liked a Space 7 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

upvoted a collection 7 months ago

Code Evaluation

Collection

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15

upvoted an article 7 months ago

Article

FineVideo: behind the scenes

Sep 23, 2024

• 31