-
-
-
-
-
-
Inference Providers
Active filters:
rlhf
sileod/deberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
182k
•
124
sileod/deberta-v3-large-tasksource-nli
Zero-Shot Classification
•
Updated
•
5.34k
•
36
sileod/mdeberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
36
•
18
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
•
35
•
11
PKU-Alignment/beaver-dam-7b
Updated
•
2.33k
•
8
fnlp/moss-rlhf-policy-model-7B-en
lightonai/alfred-40b-0723
Text Generation
•
Updated
•
11
•
46
TheBloke/NeuralHermes-2.5-Mistral-7B-GGUF
Updated
•
1.42k
•
51
simonveitner/MathHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
24
•
1
joey00072/ToxicHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
35
•
21
argilla/distilabeled-OpenHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
12
•
31
mlabonne/NeuralBeagle14-7B
Text Generation
•
Updated
•
71
•
158
mlabonne/NeuralBeagle14-7B-GGUF
Updated
•
718
•
47
argilla/CapybaraHermes-2.5-Mistral-7B
Updated
•
23
•
69
tasksource/deberta-small-long-nli
Zero-Shot Classification
•
Updated
•
40.9k
•
42
TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
•
7.3k
•
111
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
Updated
•
563
•
57
mradermacher/distilabeled-Hermes-2.5-Mistral-7B-GGUF
Updated
•
42
•
1
mradermacher/beaver-7b-v3.0-GGUF
Reinforcement Learning
•
Updated
•
405
•
1
stanfordnlp/SteamSHP-flan-t5-xl
Text2Text Generation
•
Updated
•
42
•
43
stanfordnlp/SteamSHP-flan-t5-large
Text2Text Generation
•
Updated
•
34
•
33
trl-lib/llama-7b-se-peft
sileod/deberta-v3-large-tasksource-rlhf-reward-model
Text Classification
•
Updated
•
70
•
11
trl-lib/llama-7b-se-rl-peft
Updated
•
103
trl-lib/llama-7b-se-rm-peft
toloka/gpt2-large-rl-prompt-writing
Text Generation
•
Updated
•
13
•
3
AdamG012/chat-opt-1.3b-rlhf-actor-deepspeed
Text Generation
•
Updated
•
17
•
5
AdamG012/chat-opt-1.3b-rlhf-critic-deepspeed
Text Generation
•
Updated
•
11
•
3
AdamG012/chat-opt-1.3b-rlhf-actor-ema-deepspeed
Text Generation
•
Updated
•
7
•
8
agi-css/socially-good-lm
Text Generation
•
Updated
•
20
•
5