9-Volt Fan

9voltfan2009

AI & ML interests

None yet

Recent Activity

Organizations

9-Volt Fan's profile picture

9voltfan2009's activity

reacted to AdinaY's post with ๐Ÿ˜Ž about 18 hours ago
view post
Post
3316
ACE-Step ๐ŸŽต a music generation foundation model released by
StepFun & ACEStudio

Model: ACE-Step/ACE-Step-v1-3.5B
Demo: ACE-Step/ACE-Step

โœจ 3.5B, Apache2.0 licensed
โœจ 115ร— faster than LLMs (4-min music in 20s on A100)
โœจ Diffusion + DCAE + linear transformer = speed + coherence
โœจ Supports voice cloning, remixing, lyric editing & more
  • 1 reply
ยท
reacted to prithivMLmods's post with ๐Ÿ‘ about 19 hours ago
view post
Post
1951
Well, hereโ€™s the updated version with the 20,000+ entry sampled dataset for Watermark Filter Content Moderation models incl. [Food25, Weather, Watermark, Marathi/Hindi Sign Language Detection], post-trained from the base models: sigLip2 patch16 224 โ€” now with mixed aspect ratios for better performance and reduced misclassification. ๐Ÿ”ฅ

Models :
โžฎ Watermark-Detection : prithivMLmods/Watermark-Detection-SigLIP2
โŒจ๏ธŽ Watermark Detection & Batch Image Processing Experimentals, Colab Notebook : https://colab.research.google.com/drive/1mlQrSsSjkGimUt0VyRi3SoWMv8OMyvw3?usp=drive_link
โžฎ Weather-Image-Classification : prithivMLmods/Weather-Image-Classification
โžฎ TurkishFoods-25 : prithivMLmods/TurkishFoods-25
โžฎ Marathi-Sign-Language-Detection : prithivMLmods/Marathi-Sign-Language-Detection
โžฎ Hindi-Sign-Language-Detection : prithivMLmods/Hindi-Sign-Language-Detection

Datasets :
Watermark : qwertyforce/scenery_watermarks
Weather : prithivMLmods/WeatherNet-05-18039
Turkish Foods 25 : yunusserhat/TurkishFoods-25
Marathi Sign Language : VinayHajare/Marathi-Sign-Language
Hindi Sign Language : Vedant3907/Hindi-Sign-Language-Dataset

Collection : prithivMLmods/content-filters-siglip2-vit-68197e3357d4de18fb3b4d2b
reacted to nyuuzyou's post with โค๏ธ๐Ÿ”ฅ about 19 hours ago
view post
Post
3201
nyuuzyou/svgfind ๐Ÿ‘€

Well, everything happens for the first time ๐Ÿค—. Thank you all!
reacted to fdaudens's post with ๐Ÿ‘๐Ÿ”ฅ 5 days ago
view post
Post
2914
Forget everything you know about transcription models - NVIDIA's parakeet-tdt-0.6b-v2 changed the game for me!

Just tested it with Steve Jobs' Stanford speech and was speechless (pun intended). The video isnโ€™t sped up.

3 things that floored me:
- Transcription took just 10 seconds for a 15-min file
- Got a CSV with perfect timestamps, punctuation & capitalization
- Stunning accuracy (correctly captured "Reed College" and other specifics)

NVIDIA also released a demo where you can click any transcribed segment to play it instantly.

The improvement is significant: number 1 on the ASR Leaderboard, 6% error rate (best in class) with complete commercial freedom (cc-by-4.0 license).

Time to update those Whisper pipelines! H/t @Steveeeeeeen for the finding!

Model: nvidia/parakeet-tdt-0.6b-v2
Demo: nvidia/parakeet-tdt-0.6b-v2
ASR Leaderboard: hf-audio/open_asr_leaderboard
  • 1 reply
ยท
reacted to DevinGrey's post with ๐Ÿ‘€ 6 days ago
view post
Post
1481
hello All. I am new to all of this and just beginning to learn how to use hugging face and AI in general. How can I access an ai code developer for help in setting up a website?
ยท
reacted to lukmanaj's post with ๐Ÿ˜Ž 6 days ago
view post
Post
2162
Iโ€™m excited to share that Iโ€™ve completed the Hugging Face Agents Course and earned my certificate.

Over the past few months, I explored how to build intelligent, autonomous agents using cutting-edge tools like smolagents, LlamaIndex, and LangGraph. The course covered everything from the fundamentals of agents to advanced topics like fine-tuning for function-calling, observability, evaluation, and even agents in games.

Some key content included:

1. Introduction to AI Agents

2. Agentic RAG use cases

3. Multi-framework implementation: smolagents, LlamaIndex, and LangGraph

4. Building, testing, and certifying a complete agent project

This was a hands-on, practical experience that deepened my understanding of how to design reliable, tool-using LLM agents. Looking forward to leveraging these skills in real-world applications in healthcare, logistics, and beyond.

Many thanks to the Hugging Face team for putting this together.
Letโ€™s build safe and useful agents!

ยท