NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos-GGUF Reinforcement Learning • Updated 3 days ago • 77 • 1