Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
ProCreations 
posted an update about 17 hours ago
Post
976
Post of the Day

I’m fine-tuning Qwen 2.5-0.5B to be extremely good at math, using high-quality datasets and some smart training strategies.
The logs are looking really promising so far!

Expected release:
Tomorrow morning?
I’ll post as soon as it’s ready — stay tuned.

If you want faster updates or just wanna chat about it, come join my Discord:
https://discord.gg/EXsug2Ux29
(Heads up: we might ask a couple quick questions when you join — just making sure we keep the server safe.)

Also, check out one of the datasets we’re using:
ProCreations/SimpleMath

This project is also helping shape the future of IntellIte.
The insights and techniques we’re developing here — better dataset curation, fine-tuning tricks, and evaluation methods — will directly contribute to making IntellIte even sharper, faster, and more reliable, especially for math and reasoning tasks.

Big progress ahead. Can’t wait to share it with you all!

I've done something like that before but I got a model overfitted to memorize multiplication tables. 🥲
Starting with the basics is a good idea. Those LLMs are big and hard to debug. Hope it turns out well!