Richard Ren

notrichardren

AI & ML interests

robustness, interpretability, probing, truthfulness

Organizations

Center for AI Safety's profile picture Truthfulness & Deception Research Team's profile picture Robust Control's profile picture

notrichardren's activity