Thao (Amelia) Pham

Mathematics, Computer Science @ Berea College

I'm conducting research projects in game theory and multi-agent safety, focusing on emergent misalignment in multi-agent interactions, oversight mechanisms (using formal verification) to prevent loss-of-human-control, evaluating the cooperative properties of AI agents, and how to build responsive governance protocols/evals to superintelligence AI risks.

I'm also a final-year undergraduate student at Berea College, United States, majoring in Mathematics and Computer Science. In the past, I was a Research Assistant at Vanderbilt University, supervised by Dr. Douglas Fisher, completed two Open Source Software Internships, and led an NGO for climate change.

Besides research, I contribute to expand the AI Safety community, e.g., founding the AI Safety Initiative at Berea College, and facilitating for Center for AI Safety. For more details, please refer to my CV.

Email: phamt2 at berea dot edu

Updates

Jan 27, 2026

I presented poster at LaMAS @ AAAI-26, Singapore on my new paper: Scheming Ability in LLM-to-LLM Strategic Interactions. I'm grateful to be awarded ACM-W scholarship to attend the conference.

Jul 23, 2025

I gave a lightning talk on my on-going research on multi-agent deception at the Human-aligned AI Summer School 2025.

Jul 10, 2025

I presented my poster on my ongoing research on multi-agent deceptive behavior modeling at the Cooperative AI Summer School 2025.

May 30, 2025

I began my Coefficient Giving-funded research project on multi-agent safety.

Apr 4, 2025

Our paper, APriCoT, accepted at CogSci 2025.

Sep 20, 2024

Two papers, Base-Rate Effect on LLM Benchmark Performance and Large Language Model Recall Uncertainty, accepted to EMNLP 2024 and CoNLL 2024.

Research interests

Scheming, Theory of Mind, Multi-agent Safety, Game Theory, Collective Intelligence, Cooperative AI.

Other interests

(Intersectional) Feminism, Writing-is-Thinking, Poetry, Effective Altruism, Anti-colonialism, Humanism, Animal Welfare, Photography, Religion, Neuroscience, Board Games.