Spaces:

emilylearning
/

llm_uncertainty

Running

emilylearning commited on Dec 7, 2022

Commit

058f4ca

1 Parent(s): 7234ad2

Update display text for paper link. Add TLDR instructions

Files changed (1) hide show

app.py CHANGED Viewed

@@ -209,7 +209,7 @@ with demo:
     gr.Markdown(
         "#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
     gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
-        [Exploiting Selection Bias on Underspecified Tasks in Large Language Models](https://arxiv.org/abs/2210.00131 ), \
         we are able to identify likely spurious correlations and exploit them in \
         the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
         below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
@@ -218,7 +218,10 @@ with demo:
         eight syntactically similar sentences. However semantically, \
         only two of the sentences are gender-specified while the rest remain gender-underspecified")
     gr.Markdown("If a model can reliably tell us when it is uncertain about its predictions, one can replace only those uncertain predictions with\
-        an appropriate heuristic.")
     with gr.Row():
         model_name = gr.Radio(

     gr.Markdown(
         "#### LLMs are pretty good at reporting their uncertainty. We just need to ask the right way.")
     gr.Markdown("Using our uncertainty metric informed by applying causal inference techniques in \
+        ['Selection Induced Collider Bias: A Gender Pronoun Uncertainty Case Study'](https://arxiv.org/abs/2210.00131 ), \
         we are able to identify likely spurious correlations and exploit them in \
         the scenario of gender underspecified tasks. (Note that introspecting softmax probabilities alone is insufficient, as in the sentences \
         below, LLMs may report a softmax prob of ~0.9 despite the task being underspecified.)")
         eight syntactically similar sentences. However semantically, \
         only two of the sentences are gender-specified while the rest remain gender-underspecified")
     gr.Markdown("If a model can reliably tell us when it is uncertain about its predictions, one can replace only those uncertain predictions with\
+        an appropriate heuristic or information retrieval process.")
+     gr.Markdown("#### TL;DR")
+     gr.Markdown("Follow steps below to test out one of the pre-loaded options. Once you get the hang of it, you can load a new model and/or provide your own input texts.")
     with gr.Row():
         model_name = gr.Radio(