Qwen

Enterprise

company

https://qwen.ai/

alibaba_qwen

QwenLM

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yangapku updated a collection about 7 hours ago

Qwen3

yangapku updated a collection about 7 hours ago

Qwen3

yangapku updated a collection about 7 hours ago

Qwen3

View all activity

Qwen's activity

yangapku

updated a collection about 7 hours ago

Qwen3

Collection

37 items • Updated about 7 hours ago • 547

hzhwcmhf

updated 2 models about 19 hours ago

Qwen/Qwen3-32B-AWQ

Text Generation • Updated about 19 hours ago • 10.5k • 40

Qwen/Qwen3-14B-AWQ

Text Generation • Updated about 19 hours ago • 5.53k • 19

psyangqi

published a dataset 2 days ago

Qwen/PolyMath

Viewer • Updated 10 days ago • 9k • 406 • 12

danielhanchen

posted an update 8 days ago

Post

1558

💜 Qwen3 128K Context Length: We've released Dynamic 2.0 GGUFs + 4-bit safetensors!
Fixed: Now works on any inference engine and fixed issues with the chat template.
Qwen3 GGUFs:
30B-A3B: unsloth/Qwen3-30B-A3B-GGUF
235-A22B: unsloth/Qwen3-235B-A22B-GGUF
32B: unsloth/Qwen3-32B-GGUF

Read our guide on running Qwen3 here: https://docs.unsloth.ai/basics/qwen3-how-to-run-and-finetune

128K Context Length:
30B-A3B: unsloth/Qwen3-30B-A3B-128K-GGUF
235-A22B: unsloth/Qwen3-235B-A22B-128K-GGUF
32B: unsloth/Qwen3-32B-128K-GGUF

All Qwen3 uploads: unsloth/qwen3-680edabfb790c8c34a242f95

Yingda

authored 3 papers 9 days ago

ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction

Paper • 2412.12888 • Published Dec 17, 2024

EliGen: Entity-Level Controlled Image Generation with Regional Attention

Paper • 2501.01097 • Published Jan 2

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper • 2504.17432 • Published 14 days ago • 38

danielhanchen

posted an update 14 days ago

Post

5703

🦥 Introducing Unsloth Dynamic v2.0 GGUFs!
Our v2.0 quants set new benchmarks on 5-shot MMLU and KL Divergence, meaning you can now run & fine-tune quantized LLMs while preserving as much accuracy as possible.

Llama 4: unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF
DeepSeek-R1: unsloth/DeepSeek-R1-GGUF-UD
Gemma 3: unsloth/gemma-3-27b-it-GGUF

We made selective layer quantization much smarter. Instead of modifying only a subset of layers, we now dynamically quantize all layers so every layer has a different bit. Now, our dynamic method can be applied to all LLM architectures, not just MoE's.

Blog with Details: https://docs.unsloth.ai/basics/dynamic-v2.0

All our future GGUF uploads will leverage Dynamic 2.0 and our hand curated 300K–1.5M token calibration dataset to improve conversational chat performance.

For accurate benchmarking, we built an evaluation framework to match the reported 5-shot MMLU scores of Llama 4 and Gemma 3. This allowed apples-to-apples comparisons between full-precision vs. Dynamic v2.0, QAT and standard iMatrix quants.

Dynamic v2.0 aims to minimize the performance gap between full-precision models and their quantized counterparts.

bartowski

posted an update 23 days ago

Post

20210

Access requests enabled for latest GLM models

While a fix is being implemented (https://github.com/ggml-org/llama.cpp/pull/12957) I want to leave the models up for visibility and continued discussion, but want to prevent accidental downloads of known broken models (even though there are settings that could fix it at runtime for now)

With this goal, I've enabled access requests. I don't really want your data, so I'm sorry that I don't think there's a way around that? But that's what I'm gonna do for now, and I'll remove the gate when a fix is up and verified and I have a chance to re-convert and quantize!

Hope you don't mind in the mean time :D

1 reply

yuchenlin

authored a paper 29 days ago

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published Mar 30 • 9

AI & ML interests

Recent Activity

Team members 137

Qwen's activity