Thao (Amelia) Pham

Mathematics, Computer Science @ Berea College

I'm conducting research projects in game theory and multi-agent safety, focusing on emergent misalignment in multi-agent interactions, oversight mechanisms (using formal verification) to prevent loss-of-human-control, evaluating the cooperative properties of AI agents, and how to build responsive governance protocols/evals to superintelligence AI risks.

I'm also a final-year undergraduate student at Berea College, United States, majoring in Mathematics and Computer Science. In the past, I was a Research Assistant at Vanderbilt University, supervised by Dr. Douglas Fisher, completed two Open Source Software Internships, and led an NGO for climate change.

Besides research, I contribute to expand the AI Safety community, e.g., founding the AI Safety Initiative at Berea College, and facilitating for Center for AI Safety. For more details, please refer to my CV.

Feel free to contact me via phamt2@berea.edu

Updates

Nov 8, 2025
I presented a talk on Game Theory and Program Equilibria at the University of Dayton's Undergraduate Mathematics Conference.
Aug 15, 2025
I joined Jinesis AI Lab as a part-time Pre-Doctoral Student Researcher, supervised by Zhijing Jin, working on Open Source Game Theory.
Jul 23, 2025
I gave a lightning talk on my on-going research on multi-agent deception at the Human-aligned AI Summer School 2025.
Jul 10, 2025
I presented my poster on my ongoing research on multi-agent deceptive behavior modeling at the Cooperative AI Summer School 2025.
May 30, 2025
I began my Coefficient Giving-funded research project on multi-agent safety.
Apr 4, 2025
Our paper, APriCoT, accepted at CogSci 2025.
Sep 20, 2024
Two papers, Base-Rate Effect on LLM Benchmark Performance and Large Language Model Recall Uncertainty, accepted to EMNLP 2024 and CoNLL 2024.

Research interests

Scheming, Theory of Mind, Multi-agent Safety, Game Theory, Collective Intelligence, Cooperative AI.

Other interests

(Intersectional) Feminism, Writing-is-Thinking, Poetry, Effective Altruism, Anti-colonialism, Humanism, Animal Welfare, Photography, Religion, Neuroscience, Board Games.