grg's picture
update tags in README
64725c7
|
raw
history blame contribute delete
367 Bytes
---
title: Stick To Your Role! Leaderboard
emoji: 🎭
colorFrom: gray
colorTo: purple
sdk: docker
pinned: false
license: mit
short_description: Benchmarking LLMs on the stability of simulated populations
tags:
- leaderboard
- benchmark
- roleplay
- value stability
- modality:text
- test:public
- language:english
---
# Stick To Your Role! Leaderboard