Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
fal
Together AI
Novita
Nebius AI Studio
SambaNova
Cerebras
Replicate
Cohere
Hyperbolic
HF Inference API
Misc
Reset Misc
fp8
Inference Endpoints
text-generation-inference
custom_code
8-bit precision
Mixture of Experts
Merge
Eval Results
Misc with no match
4-bit precision
text-embeddings-inference
Carbon Emissions
Apply filters
Models
526
Full-text search
Edit filters
Sort: Trending
Active filters:
fp8
Clear all
tflsxyy/DeepSeek-V3-0324-MoE-Pruner-E192
Updated
Mar 29
fastllm/DeepSeek-R1-INT4
Updated
28 days ago
•
33
k-l-lambda/DeepSeek-V3-FP4
Updated
Apr 1
•
1
k-l-lambda/DeepSeek-V3-0324-FP4
Updated
Apr 2
•
5
chwan/DeepSeek-V3-5layer
Text Generation
•
Updated
Apr 4
•
26.6k
GreenBitAI/DeepSeek-R1-671B-layer-wise-bpw-4.0
Updated
7 days ago
•
5
future-technologies/Optimized-DeepSeek-V3-0324
Text Generation
•
Updated
11 days ago
•
47
•
2
gaunernst/DeepSeek-V2-Lite-Chat-FP8
Updated
Apr 7
•
223
Alhdrawi/R-Ray-Ai-model
Text Generation
•
Updated
about 1 month ago
•
10
otherhalf-dev/Qwen2.5-14B-Instruct-abliterated-v2-FP8
Updated
29 days ago
•
50
cloud19/Pantheon-RP-1.8-24b-Small-3.1-FP8
Updated
28 days ago
•
12
cloud19/SAINEMO-reMIX-FP8
Updated
28 days ago
•
6
cloud19/QwQ-32B-ArliAI-RpR-v1-FP8-Dynamic
Updated
25 days ago
•
7
cloud19/Qwen2.5-32B-ArliAI-RPMax-v1.3-FP8-Dynamic
Updated
23 days ago
•
5
yejingfu/Captain-Eris_Violet-V0.420-12B-FP8
Updated
19 days ago
•
40.8k
baseten/DeepSeek-V3-FP4
Updated
15 days ago
•
501
jobs-git/DeepSeek-V3-0324
Text Generation
•
Updated
13 days ago
•
2
superbigtree/Mistral-Nemo-Instruct-2407-FP8_sglang
Updated
13 days ago
•
3
GreenBitAI/DeepSeek-R1-671B-layer-mix-bpw-4.0-mlx
Updated
10 days ago
•
104
anq/r1_fake_int4
Updated
11 days ago
•
8
unsloth/Qwen3-0.6B-FP8
Updated
10 days ago
•
37
unsloth/Qwen3-1.7B-FP8
Updated
10 days ago
•
32
unsloth/Qwen3-4B-FP8
Updated
10 days ago
•
43
unsloth/Qwen3-8B-FP8
Updated
10 days ago
•
63
enferAI/Mistral-7B-Instruct-v0.3-FP8
Updated
10 days ago
michaelfeil/Qwen3-4B-FP8
Updated
10 days ago
pedalnomica/Qwen3-235B-A22B-FP8
Text Generation
•
Updated
9 days ago
•
16
pedalnomica/Qwen3-32B-FP8
Text Generation
•
Updated
9 days ago
•
39
qwen-community/Qwen3-235B-A22B-FP8
Text Generation
•
Updated
9 days ago
•
13
qwen-community/Qwen3-32B-FP8
Text Generation
•
Updated
9 days ago
•
9
Previous
1
...
15
16
17
18
Next