Richard Ren's picture

1 2

Richard Ren

notrichardren

·

notrichardren

AI & ML interests

robustness, interpretability, probing, truthfulness

Organizations

notrichardren's activity

liked 2 datasets almost 2 years ago

notrichardren/truthfulness_legacy

Viewer • Updated Jun 29, 2023 • 210k • 33 • 3

NeelNanda/counterfact-tracing

Viewer • Updated Nov 5, 2022 • 21.9k • 111 • 13