Arthur Zucker's picture

Arthur Zucker

ArthurZ

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

google/gemma-3-4b-it

liked a Space 3 days ago

coreml-projects/transformers-to-coreml

liked a model 4 days ago

optimum-internal-testing/tiny-random-llama

View all activity

Organizations

ArthurZ's activity

New activity in meta-llama/Llama-4-Maverick-17B-128E-Instruct 20 days ago

remove <|finetune_right_pad_id|> and change pad_token to <|finetune_right_pad|>

#25 opened 21 days ago by

New activity in meta-llama/Llama-4-Scout-17B-16E-Instruct 21 days ago

pad error

#25 opened 22 days ago by

Bug in AutoModel

#26 opened 22 days ago by

New activity in meta-llama/Llama-4-Scout-17B-16E 21 days ago

Cannot generate with BS > 1

#25 opened 21 days ago by

New activity in meta-llama/Llama-4-Maverick-17B-128E-Instruct 23 days ago

change to spda

#14 opened 23 days ago by

New activity in mistral-community/pixtral-12b 3 months ago

Fastest way for inference?

#28 opened 3 months ago by

New activity in deepseek-ai/DeepSeek-R1 3 months ago

model-00078-of-000163.safetensors not marked safe?

#80 opened 3 months ago by

New activity in mistralai/Pixtral-Large-Instruct-2411 5 months ago

Upload transformers version

#3 opened 5 months ago by

New activity in huggingface/documentation-images 5 months ago

Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png

#392 opened 5 months ago by

New activity in mistral-community/pixtral-12b 6 months ago

Update model weight

#13 opened 6 months ago by

Update hidden_act to silu

#14 opened 6 months ago by

New activity in rhymes-ai/Aria 7 months ago

llama.cpp support

#1 opened 7 months ago by

New activity in google/gemma-2-2b-jpn-it 7 months ago

tokenizer_config.json is different from gemma-2-2b-it

#8 opened 7 months ago by

New activity in mistral-community/pixtral-12b 7 months ago

How can i use the full 24GB model instead of this separated safetensors files?

#8 opened 7 months ago by

New activity in meta-llama/Llama-3.2-11B-Vision-Instruct 7 months ago

hidden_activation vs hidden_act in config.json

#10 opened 7 months ago by

New activity in mistral-community/pixtral-12b-240910 7 months ago

How to use safetensors?

#13 opened 7 months ago by

New activity in mistral-community/pixtral-12b 7 months ago

lamma cpp ht to gguf not working

#2 opened 7 months ago by

New activity in meta-llama/Llama-3.1-405B-Instruct-FP8 9 months ago

8-kv-heads

#14 opened 9 months ago by

New activity in meta-llama/Llama-3.1-405B-FP8 9 months ago

Update config.json

#17 opened 9 months ago by

Config KV Heads should be 8 now?

#16 opened 9 months ago by