Spaces:

nhop
/

L3Score

Running

Niklas Hoepner commited on 19 days ago

Commit

8f7a170

1 Parent(s): bba3d69

Back to old gradio sdk

Files changed (1) hide show

README.md CHANGED Viewed

@@ -13,12 +13,12 @@ description: >
   It uses log-probabilities of "Yes"/"No" tokens from a language model acting as a judge.
   Based on the SPIQA benchmark: https://arxiv.org/pdf/2407.09413
 sdk: gradio
-sdk_version: 4.44.1
 app_file: app.py
 pinned: false
 ---
-# Metric Card: L3Score
 ## 📌 Description
@@ -37,7 +37,7 @@ Answer in one word - Yes or No.
 The model's **log-probabilities** for "Yes" and "No" tokens are used to compute the score.
-### 🧮  Score Calculation
 Let $ l_{\text{yes}}$ and $ l_{\text{no}}$ be the log-probabilities of "Yes" and "No", respectively.

   It uses log-probabilities of "Yes"/"No" tokens from a language model acting as a judge.
   Based on the SPIQA benchmark: https://arxiv.org/pdf/2407.09413
 sdk: gradio
+sdk_version: 3.19.1
 app_file: app.py
 pinned: false
 ---
+# 🦢 Metric Card: L3Score
 ## 📌 Description
 The model's **log-probabilities** for "Yes" and "No" tokens are used to compute the score.
+### 🧮  Scoring Logic
 Let $ l_{\text{yes}}$ and $ l_{\text{no}}$ be the log-probabilities of "Yes" and "No", respectively.