Resources

View closed (1)

ds-v2-chat

#17 opened 10 days ago by

Elon7111

Dddv

#16 opened 5 months ago by

Hxnnsns

NAN issue using FP16 to load the model

#15 opened 6 months ago by

joeltseng

ImportError: This modeling file requires the following packages that were not found in your environment: flash_attn. Run `pip install flash_attn`

#14 opened 10 months ago by

kang1

How much memory is needed if you make the 128k context length

#13 opened 11 months ago by

ggbondcxk

Implement MLA inference optimizations to DeepseekV2Attention

#12 opened 11 months ago by

sy-chen

Can you provide a sample code for training with DeepSpeed ZeRO3?

#10 opened 12 months ago by

SupercarryNg

Ollama support

#9 opened 12 months ago by

Dao3

MoE offloading strategy？

#8 opened 12 months ago by

Minami-su

Update README.md

#7 opened 12 months ago by

VanishingPsychopath

kv cache

#6 opened 12 months ago by

FrankWu

function/tool calling support

#5 opened about 1 year ago by

kaijietti

fail to run the example

#4 opened about 1 year ago by

Leymore

GPTQ plz

#3 opened about 1 year ago by

Parkerlambert123

vllm support

#2 opened about 1 year ago by

Sihangli

llama.cpp support

#1 opened about 1 year ago by

cpumaxx