Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
2
Inference Providers
Select all
Cerebras
Fireworks
Replicate
Novita
fal
Together AI
Nebius AI Studio
Cohere
SambaNova
Hyperbolic
HF Inference API
Misc
Reset Misc
cpo
trl
Inference Endpoints
text-generation-inference
4-bit precision
custom_code
Eval Results
Misc with no match
Merge
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
120
Full-text search
Edit filters
Sort: Trending
Active filters:
cpo, trl
Clear all
Aratako/Llama-Gemma-2-27b-CPO_SimPO-iter1
Text Generation
•
Updated
Dec 15, 2024
•
2
•
1
Aratako/Llama-Gemma-2-27b-CPO_SimPO-iter2
Text Generation
•
Updated
Dec 16, 2024
•
2
•
1
Aratako/gemma-2-2b-axolotl-simpo-v1.0
Text Generation
•
Updated
Dec 10, 2024
•
2
Aratako/gemma-2-2b-axolotl-simpo-v1.0-merged
Text Generation
•
Updated
Dec 10, 2024
•
3
•
1
mradermacher/gemma-2-2b-axolotl-simpo-v1.0-merged-GGUF
Updated
Dec 11, 2024
•
14
mjhamar/Meta-Llama-3.1-8B-Instruct-cpo-beir
Text Generation
•
Updated
Dec 12, 2024
•
7
mjhamar/Meta-Llama-3.1-8B-Instruct-cpo-beir-4b
Text Generation
•
Updated
Dec 11, 2024
•
6
mradermacher/waka-0.5b-simpo-GGUF
Updated
Dec 26, 2024
•
13
williamlcn/simpotest
Text Generation
•
Updated
Feb 26
•
3
braginpawel/deepseek-14b-simpo-400ex-10ep-7th_iteration-merged
Text Generation
•
Updated
Mar 3
•
2
williamlcn/34337_simpo
Text Generation
•
Updated
Mar 5
•
2
williamlcn/34337_simpo_ds_notcot
Text Generation
•
Updated
Mar 5
•
4
williamlcn/34337_simpo2
Text Generation
•
Updated
Mar 6
•
3
braginpawel/deepseek-14b-simpo-2976ex-5ep-8th_iteration-merged
Text Generation
•
Updated
Mar 7
•
5
braginpawel/deepseek-14b-simpo-3376ex-4ep-9th_iteration-merged
Text Generation
•
Updated
Mar 8
•
3
mradermacher/Llama-Gemma-2-27b-CPO_SimPO-iter2-GGUF
Updated
Mar 9
•
37
mradermacher/Llama-Gemma-2-27b-CPO_SimPO-iter2-i1-GGUF
Updated
Mar 9
•
70
williamlcn/17718_simpo_16_1
Text Generation
•
Updated
Mar 9
•
3
williamlcn/17718_simpo_16_1_05e
Text Generation
•
Updated
Mar 10
•
5
williamlcn/17718_simpo_16_1_1e
Text Generation
•
Updated
Mar 10
•
3
mradermacher/Meta-Llama-3.1-8B-Instruct-cpo-beir-GGUF
Updated
Mar 13
•
145
williamlcn/17718_simpo_32_16_05e
Text Generation
•
Updated
Mar 13
•
5
williamlcn/17718_simpo_64_32_05e
Text Generation
•
Updated
Mar 13
•
3
williamlcn/17718_simpo_32_16_05e_0317
Text Generation
•
Updated
Mar 17
•
3
nomadrp/msimpo-10each
Updated
Mar 21
nomadrp/msimpo-10each-v1
Updated
Mar 22
nomadrp/cpo-simpo-10each
Updated
Mar 23
FluxiIA/Tucaninho_simpo
Text Generation
•
Updated
about 1 month ago
•
6
mradermacher/Tucaninho_simpo-GGUF
Updated
Mar 25
•
127
nomadrp/msimpo-30each-v1
Updated
Mar 27
Previous
1
2
3
4
Next