Spaces:
Running
Running
Commit
·
68395b5
1
Parent(s):
d4b589f
Update app.py
Browse filesUpdate paper link in UI
app.py
CHANGED
@@ -209,7 +209,7 @@ with demo:
|
|
209 |
gr.Markdown(
|
210 |
"#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
|
211 |
gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
|
212 |
-
[Selection
|
213 |
we are able to identify likely spurious correlations and exploit them in \
|
214 |
the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
|
215 |
below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
|
|
|
209 |
gr.Markdown(
|
210 |
"#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
|
211 |
gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
|
212 |
+
[Exploiting Selection Bias on Underspecified Tasks in Large Language Models](https://arxiv.org/abs/2210.00131), \
|
213 |
we are able to identify likely spurious correlations and exploit them in \
|
214 |
the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
|
215 |
below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
|