ghostplant
ghostplant
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
ghostplant/data-collections
liked
a dataset
16 days ago
future-technologies/Universal-Transformers-Dataset
Organizations
None yet
ghostplant's activity
Does R1 support long context (> 4K)?
#172 opened 2 months ago
by
ghostplant
can this model run on Hopper GPU
6
#8 opened 2 months ago
by
simonlindelta

can this model run on A800 ?
2
#10 opened about 2 months ago
by
wang35
Why not use FP2 or IQ2 as kTransformers does?
#11 opened about 2 months ago
by
ghostplant
Deploying production ready service with Unsloth GGUF quants on your AWS account. (4 x L40S)
2
8
#171 opened 2 months ago
by
samagra-tensorfuse
90+ tokens per second for MI300x8 using batch_size = 1
1
#166 opened 3 months ago
by
ghostplant
Q2_K_XL 好还是 Q4好呢
3
#34 opened 3 months ago
by
jializou

所以部署一个671B的模型 显存需要多少 有什么基准的硬件配置?
27
#118 opened 3 months ago
by
cena163

How much vram do you need?
8
#12 opened 3 months ago
by
hyun10
Is there a model removing non-shared MoE experts?
4
#17 opened 3 months ago
by
ghostplant
Please convert these models to GGUF format...
2
5
#12 opened 4 months ago
by
Moodym