AlejandroOlmedo
/

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx

Text Generation

Generated from Trainer

text-generation-inference

4-bit precision

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx

Ctrl+K

Ctrl+K

1 contributor

History: 13 commits

AlejandroOlmedo's picture

AlejandroOlmedo

Update README.md

9719729 verified 3 months ago

.gitattributes

1.57 kB

Upload tokenizer.json with huggingface_hub 4 months ago
README.md

2.82 kB

Update README.md 3 months ago
config.json

917 Bytes

Upload config.json with huggingface_hub 4 months ago
model.safetensors

4.28 GB
LFS

Upload model.safetensors with huggingface_hub 4 months ago
model.safetensors.index.json

51.7 kB

Upload model.safetensors.index.json with huggingface_hub 4 months ago
special_tokens_map.json

485 Bytes

Upload special_tokens_map.json with huggingface_hub 4 months ago
tokenizer.json

11.4 MB
LFS

Upload tokenizer.json with huggingface_hub 4 months ago
tokenizer_config.json

6.86 kB

Upload tokenizer_config.json with huggingface_hub 4 months ago