Hugging Face

Enterprise

company

Verified

https://huggingface.co

huggingface

Activity Feed

AI & ML interests

The AI community building the future.

Recent Activity

julien-c new activity 1 day ago

huggingface/HuggingDiscussions:[FEEDBACK] Local apps

julien-c new activity 1 day ago

huggingface/HuggingDiscussions:[FEEDBACK] Inference Providers

julien-c new activity 1 day ago

huggingface/the-no-branch-repo:Create README.md

View all activity

Articles

Yay! Organizations can now publish blog Articles

Jan 20

• 41

huggingface's activity

julien-c

in huggingface/HuggingDiscussions 1 day ago

[FEEDBACK] Local apps

#31 opened 11 months ago by

kramp

[FEEDBACK] Inference Providers

103

#49 opened 3 months ago by

julien-c

in huggingface/the-no-branch-repo 1 day ago

Create README.md

#1 opened 17 days ago by

Gajduk

julien-c

in huggingface/transformers-metadata 1 day ago

Create Sporty-Video-Download,cutting app

#4 opened 16 days ago by

joerg23

julien-c

in huggingface/brand-assets 1 day ago

Request for social media icon vector

#4 opened 4 months ago by

umarbutler

julien-c

in huggingface/label-files 1 day ago

Rename lvis-id2label.json to trees.json

#12 opened 25 days ago by

HARENDRAKDAD

Upload trees.json

#10 opened 25 days ago by

HARENDRAKDAD

julien-c

posted an update 2 days ago

Post

3203

BOOOOM: Today I'm dropping TINY AGENTS

the 50 lines of code Agent in Javascript 🔥

I spent the last few weeks working on this, so I hope you will like it.

I've been diving into MCP (Model Context Protocol) to understand what the hype was all about.

It is fairly simple, but still quite powerful: MCP is a standard API to expose sets of Tools that can be hooked to LLMs.

But while doing that, came my second realization:

Once you have a MCP Client, an Agent is literally just a while loop on top of it. 🤯

➡️ read it exclusively on the official HF blog: https://huggingface.co/blog/tiny-agents

1 reply

julien-c

updated a dataset 2 days ago

huggingface/documentation-images

Viewer • Updated 2 days ago • 52 • 2.95M • 60

pagezyhf

updated a dataset 2 days ago

huggingface/documentation-images

Viewer • Updated 2 days ago • 52 • 2.95M • 60

merve

posted an update 3 days ago

Post

2403

Don't sleep on new AI at Meta Vision-Language release! 🔥

facebook/perception-encoder-67f977c9a65ca5895a7f6ba1
facebook/perception-lm-67f9783f171948c383ee7498

Meta dropped swiss army knives for vision with A2.0 license 👏
> image/video encoders for vision language modelling and spatial understanding (object detection etc) 👏
> The vision LM outperforms InternVL3 and Qwen2.5VL 👏
> They also release gigantic video and image datasets

The authors attempt to come up with single versatile vision encoder to align on diverse set of tasks.

They trained Perception Encoder (PE) Core: a new state-of-the-art family of vision encoders that can be aligned for both vision-language and spatial tasks. For zero-shot image tasks, it outperforms latest sota SigLIP2 👏

> Among fine-tuned ones, first one is PE-Spatial. It's a model to detect bounding boxes, segmentation, depth estimation and it outperforms all other models 😮

> Second one is PLM, Perception Language Model, where they combine PE-Core with Qwen2.5 LM 7B. it outperforms all other models (including InternVL3 which was trained with Qwen2.5LM too!)

The authors release the following checkpoints in sizes base, large and giant:

> 3 PE-Core checkpoints (224, 336, 448)
> 2 PE-Lang checkpoints (L, G)
> One PE-Spatial (G, 448)
> 3 PLM (1B, 3B, 8B)
> Datasets

Authors release following datasets 📑
> PE Video: Gigantic video datasete of 1M videos with 120k expert annotations ⏯️
> PLM-Video and PLM-Image: Human and auto-annotated image and video datasets on region-based tasks
> PLM-VideoBench: New video benchmark on MCQA

1 reply

medmekk

in huggingface/documentation-images 3 days ago

auto-round

#482 opened 3 days ago by

wenhuach

stevhliu

updated a dataset 3 days ago

huggingface/documentation-images

Viewer • Updated 2 days ago • 52 • 2.95M • 60

burtenshaw

posted an update 4 days ago

Post

1970

The rebooted LLM course starts today with an overhauled chapter 1 on Transformers:

👉 Follow the org to join the course:

huggingface-course

We’re starting from the foundations of modern generative AI by looking at transformers. This chapter is expanded in depth and features so contains new material like:

FREE and CERTIFIED exam on fundamentals of transformers
deeper exploration of transformer architectures and attention mechanisms
end -to-end exploration of inference strategies for prefill and decode steps

The course has leveled up in complexity and depth, so this a great time to join in if you want to build you own AI models.

fdaudens

posted an update 4 days ago

Post

1986

@thomwolf and @m-ric teaming up as the perfect instructor duo for DeepLearning.ai’s new course: Building Code Agents with Hugging Face smolagents!

https://www.deeplearning.ai/short-courses/building-code-agents-with-hugging-face-smolagents/

merve

posted an update 5 days ago

Post

2732

New foundation model on image and video captioning just dropped by NVIDIA AI 🔥

Describe Anything Model (DAM) is a 3B vision language model to generate detailed captions with localized references 😮

The team released the models, the dataset, a new benchmark and a demo 🤩 nvidia/describe-anything-680825bb8f5e41ff0785834c

Most of the vision LMs focus on image as a whole, lacking localized references in captions, and not taking in visual prompts (points, boxes, drawings around objects)

DAM addresses this on two levels: new vision backbone that takes in focal crops and the image itself, and a large scale dataset 👀

They generate a dataset by extending existing segmentation and referring expression generation datasets like REFCOCO, by passing in the images and classes to VLMs and generating captions.

Lastly, they also release a new benchmark again with self-supervision, they use an LLM to evaluate the detailed captions focusing on localization 👏

clem

posted an update 5 days ago

Post

3753

Energy is a massive constraint for AI but do you even know what energy your chatGPT convos are using?

We're trying to change this by releasing ChatUI-energy, the first interface where you see in real-time what energy your AI conversations consume. Great work from @jdelavande powered by spaces & TGI, available for a dozen of open-source models like Llama, Mistral, Qwen, Gemma and more.

jdelavande/chat-ui-energy

Should all chat interfaces have this? Just like ingredients have to be shown on products you buy, we need more transparency in AI for users!

3 replies

clem

posted an update 5 days ago

Post

2817

Just crossed half a million public apps on Hugging Face. A new public app is created every minute these days 🤯🤯🤯

What's your favorite? http://hf.co/spaces

3 replies

linoyts

posted an update 5 days ago

Post

2433

We just shipped HiDream Image LoRA fine-tuning to diffusers🧨

HiDream's sota capabilities (and mit license) bring a lot of potential to explore with fine-tunes 🔥

- more upgrades and features soon!
- code, weights and config example 👇

🧶Yarn art lora: linoyts/HiDream-yarn-art-LoRA
code: https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/README_hidream.md

2 replies

pagezyhf

posted an update 5 days ago

Post

1869

If you haven't had the chance to test the latest open model from Meta, Llama 4 Maverick, go try it on AMD MI 300 on Hugging Face!

amd/llama4-maverick-17b-128e-mi-amd

AI & ML interests

Recent Activity

Articles

Yay! Organizations can now publish blog Articles

Team members 212

huggingface's activity

[FEEDBACK] Local apps

[FEEDBACK] Inference Providers

Create README.md

Create Sporty-Video-Download,cutting app

Request for social media icon vector

Rename lvis-id2label.json to trees.json

Upload trees.json

auto-round