Spaces:
Running
Running
Commit
·
a0648f8
1
Parent(s):
68395b5
Update app.py
Browse files
app.py
CHANGED
@@ -209,7 +209,7 @@ with demo:
|
|
209 |
gr.Markdown(
|
210 |
"#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
|
211 |
gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
|
212 |
-
[
|
213 |
we are able to identify likely spurious correlations and exploit them in \
|
214 |
the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
|
215 |
below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
|
|
|
209 |
gr.Markdown(
|
210 |
"#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
|
211 |
gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
|
212 |
+
[our ICLR paper under review](https://openreview.net/pdf?id=25VgHaPz0l4), \
|
213 |
we are able to identify likely spurious correlations and exploit them in \
|
214 |
the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
|
215 |
below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
|