Safety - a geesimon Collection

geesimon 's Collections

AGI

Prompt

Game

Coding

Federation Learning

Eval

Safety

Video

Model

Safety

updated Nov 15, 2023

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Paper • 2311.07587 • Published Nov 8, 2023 • 5