Elie Bakouch's picture

Elie Bakouch

eliebak

·

AI & ML interests

Training LLM's @ 🤗

Recent Activity

commented on a paper 2 days ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

View all activity

Organizations

eliebak's activity

commented a paper 2 days ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published 13 days ago • 27 •

commented a paper 19 days ago

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published 23 days ago • 77 •

New activity in nanotron/ultrascale-playbook about 2 months ago

typo error report

#100 opened about 2 months ago by

fix

#101 opened about 2 months ago by

New activity in open-r1/OpenR1-Qwen-7B about 2 months ago

About Training Detail

#4 opened about 2 months ago by

New activity in HuggingFaceTB/SmolLM2-1.7B-intermediate-checkpoints about 2 months ago

I think the ckpt orders are messed up.

#1 opened about 2 months ago by

New activity in nanotron/ultrascale-playbook about 2 months ago

Few Errors

#86 opened 2 months ago by

New activity in open-r1/OpenR1-Qwen-7B about 2 months ago

different max_position_embeddings and rope_theta in and OpenR1-Qwen-7B-SFT and it's base Qwen2.5-Math-7B-Instruct ?

#3 opened about 2 months ago by

New activity in nanotron/ultrascale-playbook about 2 months ago

Fix typos

#99 opened about 2 months ago by

luismirandacruz

Fix typos (merged with #95)

#96 opened about 2 months ago by

luismirandacruz

fix-typos

#97 opened about 2 months ago by

fix typos

#78 opened 2 months ago by

Typos

#80 opened 2 months ago by

Typos

#81 opened 2 months ago by

Fix typos

#92 opened about 2 months ago by

luismirandacruz

Fix typo

#76 opened 2 months ago by

typosss

#95 opened about 2 months ago by

Make it easier to import into reader applications

#77 opened 2 months ago by

How can the following figure be obtained, and is there a way to tag the name of each tensor during profiling?

#83 opened 2 months ago by

🚩 Report: Ethical issue(s)

#87 opened 2 months ago by