Jehyeok Yeon

Oh, hey there! I’m Jehyeok Yeon, an incoming PhD candidate at Max Planck Institute for Intelligent Systems with Maksym Andriushchenko with a focus on building intelligent systems that are both technically robust and socially responsible. I was previously at the University of Illinois Urbana-Champaign, where I was previously advised by Professor Gagandeep Singh at the FOCAL Lab.

My current research interest focuses on scalable oversight and “superalignment”. As AI models become more capable, I believe it’s more important than ever to be able to continue evaluating and analyzing these models, even if the models become smarter than us. This could mean creating more difficult tasks with no ground truth answer, finding scalable ways to control and interpret models, or reverse-engineering their internal representations to guarantee their hidden objectives match our instructions.

Outside of research, I care a lot about writing, both creative and academic, and how it shapes the way we think. I spend a lot of time walking and thinking through ideas, and I try to catch theatre performances whenever I can. There’s something powerful about live storytelling that reminds me why I study intelligence in the first place. When I’m not writing or watching something live, I’m usually reading somethingMoby Dick — Herman Melvillecurrently on my nightstand.

The fastest way to reach me is through tommy8289@gmail.com. Feel free to reach out if you have any interesting ideas to throw around!

news

May 5, 2026	FlowGuard accepted to ICML 2026 as a Spotlight paper!
Apr 13, 2026	Offered acceptance for the NSF Graduate Research Fellowship Program!
Mar 2, 2026	LLMCert-T accepted to Agents in the Wild: Safety, Security, and Beyond Workshop at ICLR 2026!
Jan 7, 2026	Started a research internship at the Max Planck Institute for Intelligent Systems with Maksym Andriushchenko, working on scalable oversight and superalignment.
Dec 15, 2025	Graduated from my B.S. at University of Illinois Urbana-Champaign!

latest posts

Feb 14, 2026	Research: Your Agent Can't Read the Code It's Running
Jan 17, 2026	Opinion: You Cannot Secure a System That Believes Everything It Reads
Jan 02, 2026	Research: We Need a Same-Origin Policy for AI Agents

selected publications

2026

Securing Multimodal AI through Internal Information Decomposition

Jehyeok Yeon, Hyeonjeong Ha, Qiusi Zhan, and 1 more author

Proceedings of the 43rd International Conference on Machine Learning (ICML 2026) , 2026

Spotlight

2025

Certifying Robustness of Agent Tool-Selection Under Adversarial Attacks

Jehyeok Yeon, Isha Chaudhary, and Gagandeep Singh

ICLR 2026 Agentic AI in the Wild Workshop , 2025

Website
TRAP: Targeted Redirecting of Agentic Preferences

Jehyeok Yeon*, Hangoo Kang*, and Gagandeep Singh

Proceedings of the 39th Conference on Neural Information Processing Systems (NeurIPS 2025) , 2025