Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
geesimon 's Collections
MultiModal
Distillation
AGI
Inference
Prompt
Game
Finetune
Coding
Federation Learning
Eval
Safety
Video
Model

Safety

updated Nov 15, 2023
Upvote
-

  • Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

    Paper • 2311.07587 • Published Nov 8, 2023 • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs