New discussion

pad output?

#63 opened 2 days ago by
Juniworld

Update README.md

#58 opened 11 days ago by
denizzhansahin

Inference speed slow?

1
2
#56 opened 13 days ago by
banank1989

Update README.md

#50 opened 25 days ago by
beita6969

SigLIP or SigLIP2 encoder?

7
#48 opened 26 days ago by
orrzohar

Tokens generated per second

1
3
#39 opened about 1 month ago by
rameshch

Image tokenization

4
#38 opened about 1 month ago by
Marbuel

TPU inference

1
#37 opened about 1 month ago by
Arpit-Bansal

How to use the tokenizer of gemma-3

1
#36 opened about 1 month ago by
rishav09

please release AWQ version

2
2
#31 opened about 1 month ago by
classdemo

evals (PT vs IT)

1
1
#30 opened about 1 month ago by
erichartford

Provide an access token

2
#29 opened about 1 month ago by
Tirth09

Hio

#27 opened about 1 month ago by
Azeliox

CUDA error: CUBLAS_STATUS_NOT_SUPPORTED

#25 opened about 1 month ago by
surak

Function calling with Gemma 3

5
#24 opened about 2 months ago by
jasonisaac

can not run with vllm

6
#23 opened about 2 months ago by
tankpigg

Simple questions too hard?

2
8
#22 opened about 2 months ago by
urtuuuu

awq version

8
3
#20 opened about 2 months ago by
rastegar

Adds `vocab_size` to config.json

2
#18 opened about 2 months ago by
clowman

Languages list

1
1
#16 opened about 2 months ago by
averoo

Fail to Load Gemme3 27B

1
#14 opened about 2 months ago by
crm-ai

License

2
#13 opened about 2 months ago by
mrfakename

Quantization

1
#12 opened about 2 months ago by
Zelyanoth

Meet this error

7
5
#11 opened about 2 months ago by
crm-ai

Anybody thinking what I'm thinking?

2
2
#10 opened about 2 months ago by
darkc0de