Charles Rathkopf
Charles Rathkopf
Home
Research
Projects
Publications
Talks
Contact
CV
AI Ethics
Is AI deception real?
I examine whether alignment faking in Claude 3 Opus constitutes genuine deception, arguing it represents shallow deception - genuine intentional behavior that differs systematically from human deception.
Jan 23, 2026 2:00 PM — 3:30 PM
Forschungszentrum Jülich
Deep learning models in science: some risks and opportunities
Under some conditions, we ought to trade interpretability for predictive power.
Jun 11, 2024
Jülich/Düsseldorf
Follow
Cite
×