-
-
-
-
-
-
Inference Providers
Active filters:
dpo, trl
RyanYr/reflect_ministral8Bit_om2_sft-t2_lr.5-6_dpo-t2
Text Generation
•
Updated
•
4
QinLiuNLP/llama3-sudo-dpo-3epochs-medical-1e-5
danushkhanna/results
Text Generation
•
Updated
•
5
AmberYifan/Mistral-7B-v0.1-sft-gen-dpo-10k
Text Generation
•
Updated
•
2
AmberYifan/Mistral-7B-v0.1-sft-dpo-10k
Text Generation
•
Updated
•
2
AmberYifan/Mistral-7B-v0.1-sft-spin-10k
Text Generation
•
Updated
•
2
AmberYifan/Llama-2-7b-sft-spin-10k
Text Generation
•
Updated
•
3
double-ai/DPO_AV_sigmoid_0.15_8f891f01b04bd9d8_checkpoint-154_2024-11-25_10-33-10_94ff3b49e75cec48
Updated
juniorVision/lora_eager_Llama-3.1-Tulu-3-8B-SFT_nhn-dpo-v4-hq
Updated
Jise/flan-t5-hh-dpo-lora
Text2Text Generation
•
Updated
•
5
Shahradmz/OLMo-1B-hf-DPO-constitution-1
Updated
Avvvvva/M2
Updated
Avvvvva/M3
Updated
AmberYifan/Llama-2-7b-sft-gen-dpo-10k
Text Generation
•
Updated
•
2
Avvvvva/M2-PairRM
Updated
Avvvvva/M3-PairRM
Updated
AmberYifan/Llama-2-7b-sft-dpo-10k
Text Generation
•
Updated
•
2
Setpember/HH_pythia_DPO_props_epi_1
Text Generation
•
Updated
•
4
Setpember/HH_pythia_DPO_props_epi_2
Text Generation
•
Updated
•
4
Setpember/HH_pythia_DPO_props_epi_point5
Text Generation
•
Updated
•
4
Setpember/HH_pythia_DPO_props_epi_point1
Text Generation
•
Updated
•
4
HuggingFaceTB/SmolVLM-Instruct-DPO
Image-Text-to-Text
•
Updated
•
176
•
19
yosefw/llama-3.2-180m-amharic-instruct-apo-2
Text Generation
•
Updated
•
2
Setpember/HH_GPT2_DPO_props_epi_point1
Text Generation
•
Updated
•
6
Setpember/HH_GPT2_DPO_props_epi_point5
Text Generation
•
Updated
•
6
Setpember/HH_GPT2_DPO_props_epi_1
Text Generation
•
Updated
•
6
Setpember/HH_GPT2_DPO_props_epi_2
Text Generation
•
Updated
•
6
yosefw/llama-3.2-180m-amharic-instruct-dpo
Text Generation
•
Updated
•
2
tongliuphysics/zephyr-7b-ultra-p-0.01
Text Generation
•
Updated
•
4
tongliuphysics/zephyr-7b-ultra-p-0.04
Text Generation
•
Updated
•
4