Research
Selected publications and on-going projects
2026
Preprint
GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Theory
*Equal Contribution. Under Review, Jan 2026
LaMAS @ AAAI
Scheming Ability in LLM-to-LLM Strategic Interactions
LaMAS @ AAAI-26, Jan 2026
2025
CogSci
Chain of Thought Still Thinks Fast: APriCoT Helps with Thinking Slow
Cognitive Science Society (CogSci), Apr 2025
2024
EMNLP
The Base-Rate Effect on LLM Benchmark Performance: Disambiguating Test-Taking Strategies From Benchmark Performance
Empirical Methods in Natural Language Processing (EMNLP), Sep 2024
CoNLL
Large Language Model Recall Uncertainty is Modulated by the Fan Effect
Computational Natural Language Learning (CoNLL), Sep 2024