Prokhorov's picture

28 2 2

Prokhorov

Maverick17

·

AI & ML interests

None yet

Recent Activity

new activity 9 days ago

Qwen/Qwen3-30B-A3B:Waiting for the Qwen3-VL

new activity about 1 month ago

unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-dynamic-bnb-4bit:OOM on 2xH100

View all activity

Organizations

Maverick17's activity

New activity in Qwen/Qwen3-30B-A3B 9 days ago

Waiting for the Qwen3-VL

#8 opened 9 days ago by

New activity in unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-dynamic-bnb-4bit about 1 month ago

OOM on 2xH100

#3 opened about 1 month ago by

New activity in AskUI/PTA-1 2 months ago

Dataset and hyperparameters for training

#3 opened 2 months ago by

New activity in osunlp/UGround-V1-Data 3 months ago

Null byte error

#2 opened 3 months ago by

New activity in OPEA/deepseek-vl2-int4-sym-gptq-inc 4 months ago

Exception: data did not match any variant of untagged enum ModelWrapper at line 646524 column 3

#2 opened 4 months ago by

ValueError: Invalid modules, at least two modules detected as dependent, {shortest_module} and {longest_module}

#3 opened 4 months ago by

New activity in OS-Copilot/OS-Atlas-Pro-7B 5 months ago

How does the Agent is supposed to be working?

#2 opened 5 months ago by

New activity in allenai/Molmo-7B-D-0924 5 months ago

Text -> Point -> Segmentation

#30 opened 6 months ago by

New activity in showlab/ShowUI-2B 5 months ago

Agent Loop

#6 opened 5 months ago by

New activity in allenai/Molmo-7B-D-0924 5 months ago

How to finetune using DPO?

#31 opened 6 months ago by

How should I extract attention maps? Can you provide a specific example?

#33 opened 6 months ago by

New activity in allenai/Molmo-7B-D-0924 6 months ago

Any plans on when vllm will be supported?

#26 opened 7 months ago by

New activity in Maverick17/idefics3-llama-gui-dense-descriptions 6 months ago

Using truncation in idefics3 processor

#1 opened 6 months ago by

New activity in OpenGVLab/InternVL2-Llama3-76B 9 months ago

Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0!

#6 opened 10 months ago by

New activity in HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 about 1 year ago

Token indices sequence length is longer than the specified maximum sequence length for this model (4645 > 2048)

#5 opened about 1 year ago by

New activity in utahnlp/robertabase-structured-tuning-srl-conll2012 over 1 year ago

Usage

#1 opened almost 2 years ago by

Usage

#1 opened almost 2 years ago by