
Hi, I'm Zhehao Zhang
I'm a PhD Student in Computer Science at The Ohio State University, working on Language Agents.
I am a first year PhD student in Computer Science & Engineering at
The Ohio State University and a member of the OSU NLP Lab, advised by Prof. Yu Su and closely collaborating with Prof. Huan Sun. Previously, I worked as a Research Intern at
Stanford SALT Lab,
Netflix,
Amazon,
Adobe Research, and
Microsoft Research Lab – Asia. I received my Master's degree from
Dartmouth College and Bachelor's degree in Artificial Intelligence Honor Class at
Shanghai Jiao Tong University.
The Ohio State University and a member of the OSU NLP Lab, advised by Prof. Yu Su and closely collaborating with Prof. Huan Sun. Previously, I worked as a Research Intern at
Stanford SALT Lab,
Netflix,
Amazon,
Adobe Research, and
Dartmouth College and Bachelor's degree in Artificial Intelligence Honor Class at
Shanghai Jiao Tong University.
My research interests lie in Language Agents, Agent Safety, (Recursive) Self-Evolving Agents, and LLM Alignment. I focus on developing methods to evaluate and improve the safety, robustness, and reliability of language agents and LLMs in real-world applications. I believe that agentic AI will drive the next industrial revolution, and I am excited to build agents that are not only capable but also safe, trustworthy, and continually self-improving.
Please feel free to contact me by email (zhang.16420@osu.edu) for collaboration opportunities!
News
2026 June
Joined
Netflix as a Machine Learning Intern, working on language agents for automatic video editing.
Netflix as a Machine Learning Intern, working on language agents for automatic video editing.
2026 May
🎉 Two papers, AutoElicit and Misaligned Action Detection, are accepted to ICML 2026!
2026 May
🏅 Recognized as a Gold Reviewer for ICML 2026.
2025 Nov
🎉 VipAct, on visual-perception enhancement via specialized VLM agent collaboration and tool-use, is accepted to AAAI 2026!
2025 Sept
✈️ Received the COLM 2025 Travel Grant.
2025 Sept
🎓 Started my PhD journey at the OSU NLP Group, working on language agents, supported by an OSU Fellowship.
2025 July
📰 Our FalseReject work is featured on the Amazon Science Blog and AI Era.
2025 July
🎉 FalseReject, a resource for improving contextual safety and mitigating over-refusals in LLMs via structured reasoning, is accepted to COLM 2025. See you in Montreal!
2025 Mar
🎉 Personalization of Large Language Models: A Survey is accepted to TMLR.
2024 Nov
Joined
Amazon as an Applied Scientist Intern in Seattle.
Amazon as an Applied Scientist Intern in Seattle.
2024 Sept 25
🎉 The DARG paper is accepted to NeurIPS 2024. See you in Vancouver!
2024 Sept 20
🎉 One first-author paper on vision language models for academic chart generation is accepted to EMNLP 2024. See you in Miami!
2024 Sept 5
🎤 Honored to give a talk on Recent Advances in Synthetic Data for Foundation Models at Stanford SALT Lab. Slides.
2024 June 25
📄 A new preprint is released! Please check DARG, a dynamic evaluation framework that augments current reasoning benchmarks from the level of reasoning graphs.
2024 Mar 13
🎉 One first-author paper on LLMs for hierarchical table analysis is accepted to NAACL 2024. See you in Mexico City!
2024 Mar
🎤 Honored to give a talk on Augmented Language Models at TRIP Lab at Dartmouth, hosted by Prof. Yaoqing Yang. Slides and Recording.
2024 Feb
I will join
Adobe Research as a Research Intern this summer. See you in San Jose and the Bay Area!
Adobe Research as a Research Intern this summer. See you in San Jose and the Bay Area!
2023 Dec 14
🎉 One first-author paper from my undergraduate is accepted to ICASSP 2024.
2023 Oct 27
🎉 The paper "Can Large Language Models Transform Computational Social Science?" is accepted to Computational Linguistics.
2023 Oct 7
🎉 Two first-author papers (CRT-QA and Hate Speech Detection) from my undergraduate are accepted to EMNLP 2023. See you in Singapore!
Featured Research Publications
Selected papers on language agents, agent safety, and the alignment and robustness of large language models. See my Google Scholar for the full list.