Commit History

Update app.py
7735526
Running

zhiminy commited on

Update app.py
1da2cba

zhiminy commited on

add Conversation Efficiency Index
deaae24

zhiminy commited on

Update app.py
17322b6

zhiminy commited on

Update app.py
946d041

zhiminy commited on

Update app.py
264fef1

zhiminy commited on

add qwen3
7ad0805

zhiminy commited on

make each round of chat colorful
6c27b5c

zhiminy commited on

Update app.py
5c0f8f0

zhiminy commited on

rename it FM4SE Leaderboard
9a67f03

zhiminy commited on

Update README.md
065faaf

zhiminy commited on

Update app.py
adfb223

zhiminy commited on

debug mode
dc8f79a

zhiminy commited on

Update README.md
46eba93

zhiminy commited on

Update app.py
0683204

zhiminy commited on

Update app.py
10d942e

zhiminy commited on

remove an obsolete model
ad40aea

zhiminy commited on

Update app.py
978f8be

zhiminy commited on

Update context_window.json
6156ee7

zhiminy commited on

Update context_window.json
ea88b1a

zhiminy commited on

add gpt-4.1, o4, o4-mini etc
c4ef19a

zhiminy commited on

Update app.py
1024d82

zhiminy commited on

use consistency score instead
5c3511e

zhiminy commited on

add instability score
19a995e

zhiminy commited on

Update app.py
1e0fb78

zhiminy commited on

add grok-3 and llama-4
49cb056

zhiminy commited on

remove obsolete models
c1d84c5

zhiminy commited on

add
55a2fb1

zhiminy commited on

Update context_window.json
520df9c

zhiminy commited on

Update app.py
5ce765e

zhiminy commited on

Update app.py
501e8a3

zhiminy commited on

add gemma-3
f59607e

zhiminy commited on

add claude thinking model
9bee307

zhiminy commited on

use gpt-4o-mini
229f990

zhiminy commited on

add guardrail for SE-related tasks
a5c143f

zhiminy commited on

Update app.py
3b4c3ae

zhiminy commited on

refine update logic
0f4d845

zhiminy commited on

change the leaderboard frequency up to 3 months
9371dc4

zhiminy commited on

Update README.md
daf78cf
unverified

zhiminy commited on

Update app.py
bc49057

zhiminy commited on

add grok models
ea8b371

zhiminy commited on

add mistral models
ab10f2f

zhiminy commited on

add new models
0caf7c8

zhiminy commited on

add deepseek-r1
61cd7b8

zhiminy commited on

update caption
ebff921

zhiminy commited on

remove temperature to use default instead
cd63a06

zhiminy commited on

refine this
541c881

zhiminy commited on

Update app.py
abec366

zhiminy commited on

Update app.py
24d120e

zhiminy commited on

fix the sync bug
b52baf5

zhiminy commited on