Arthur Zucker
ArthurZ
AI & ML interests
None yet
Recent Activity
liked
a model
2 days ago
google/gemma-3-4b-it
liked
a Space
3 days ago
coreml-projects/transformers-to-coreml
liked
a model
4 days ago
optimum-internal-testing/tiny-random-llama
Organizations
ArthurZ's activity
remove <|finetune_right_pad_id|> and change pad_token to <|finetune_right_pad|>
1
#25 opened 21 days ago
by
wukaixingxp

pad error
7
8
#25 opened 22 days ago
by
bobber
Bug in AutoModel
1
3
#26 opened 22 days ago
by
random-checkin

Cannot generate with BS > 1
1
#25 opened 21 days ago
by
chenjiel
change to spda
2
#14 opened 23 days ago
by
wukaixingxp

Fastest way for inference?
3
#28 opened 3 months ago
by
psycy
model-00078-of-000163.safetensors not marked safe?
2
#80 opened 3 months ago
by
aborst

Upload transformers version
10
#3 opened 5 months ago
by
ArthurZ

Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened 5 months ago
by
kwen2501
Update model weight
8
#13 opened 6 months ago
by
nguyen-brat
Update hidden_act to silu
2
#14 opened 6 months ago
by
ArthurZ

llama.cpp support
11
9
#1 opened 7 months ago
by
ayyylol

tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened 7 months ago
by
dahara1
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 7 months ago
by
Valadaro
hidden_activation vs hidden_act in config.json
2
#10 opened 7 months ago
by
heheda
How to use safetensors?
2
#13 opened 7 months ago
by
prathi1729
lamma cpp ht to gguf not working
4
#2 opened 7 months ago
by
RameshRajamani
8-kv-heads
5
8
#14 opened 9 months ago
by
ArthurZ

Update config.json
#17 opened 9 months ago
by
ArthurZ

Config KV Heads should be 8 now?
1
#16 opened 9 months ago
by
tanmaylaud
