Zhehao Zhang

PhD Student in Computer Science & Engineering (Language Agent Safety + LLM Alignment)

I am a first year PhD student in Computer Science & Engineering at The Ohio State University, with research interests in Language Agent Safety and Robustness of Large Language Models and Alignment. My work focuses on evaluating and mitigating the refusal behavior of LLMs, developing methods to improve the safety and reliability of language models in real-world applications.
Previously I have worked as an Applied Scientist Intern at Amazon and have collaborated with researchers at Stanford SALT Lab, Adobe Research, and Microsoft Research Lab – Asia, working on cutting-edge NLP research and applications.

Education

2025 — Present
Ph.D. in Computer Science
The Ohio State University, Columbus, OH
Advisor: Yu Su, Co-advisor: Huan Sun
Research focus: Language Agent Safety and Robustness of LLMs, Alignment Advised by Prof. Yu Su and Prof. Huan Sun
2023 — 2024
M.S. in Computer Science
Dartmouth College, Hanover, NH
Research focus: Natural Language Processing
2019 — 2023
B.Eng. in Artificial Intelligence (Honor Class)
Shanghai Jiao Tong University, Shanghai, China
Honor Class in Artificial Intelligence

Industry Research Experience

Nov 2024 — Jun 2025
Amazon, Seattle, WA
Applied Scientist Intern, People eXperience and Technology (PXT) Central Science
Work as an applied scientist intern on evaluating and mitigating the refusal behavior of LLMs.
Summer 2024
Adobe Research, San Jose, CA
Research Intern, Adobe Research
Research in multi-modal large language models and visual perception enhancement.
2023
Microsoft Research Lab – Asia, Beijing, China
Research Intern, Data, Knowledge, and Intelligence Group
Research in hierarchical table analysis and complex reasoning question answering over tabular data.

Academic Research Experience

2025 — Present
The Ohio State University, Columbus, OH
PhD Student, OSU NLP Lab
Mentor: Yu Su
Research in Language Agent Safety and Robustness of LLMs, Alignment. Advised by Prof. Yu Su and Prof. Huan Sun.
2024
Stanford University, Stanford, CA
Research Intern, Social and Language Technologies (SALT) Lab
Mentor: Diyi Yang
Research in Natural Language Processing, focusing on synthetic data and dynamic evaluation of large language models.

Honors and Awards

2025
Graduate Fellowship
awarded by Ohio State University
2025
COLM 2025 Travel Grant
2025
ICLR Notable Reviewer
2023-2025
Merit Scholarship
awarded by Dartmouth College
2019-2023
Zhiyuan Honor Scholarship and Merit Scholarship
awarded by SJTU

Publications

For the most up-to-date list of publications, please refer to my Google Scholar profile.

Selected: Latest & Greatest

Falsereject: A resource for improving contextual safety and mitigating over-refusals in llms via structured reasoning
Zhehao Zhang, Weijie Xu, Fanyou Wu, Chandan K Reddy
Conference on Language Modeling (COLM). 2025.
Project PDF Code BibTeX
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang, Jiaao Chen, Diyi Yang
Advances in Neural Information Processing Systems (NeurIPS). Vancouver, Canada, 2024.
Project PDF Code BibTeX
VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
Zhehao Zhang, Ryan A. Rossi, Tong Yu, Franck Dernoncourt, Ruiyi Zhang, Jiuxiang Gu, Sungchul Kim, Xiang Chen, Zichao Wang, Nedim Lipka
arXiv preprint (arXiv). 2024.
Project PDF BibTeX
Is GPT-4V (ision) All You Need for Automating Academic Data Visualization? Exploring Vision-Language Models' Capability in Reproducing Academic Charts
Zhehao Zhang, Weicheng Ma, Soroush Vosoughi
Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP). Miami, FL, USA, 2024.
Project PDF Code BibTeX
Personalization of Large Language Models: A Survey
Zhehao Zhang, Ryan A. Rossi, Branislav Kveton, Yijia Shao, Diyi Yang, Hamed Zamani, Franck Dernoncourt, Joe Barrow, Tong Yu, Sungchul Kim, Ruiyi Zhang, Jiuxiang Gu, Tyler Derr, Hongjie Chen, Junda Wu, Xiang Chen, Zichao Wang, Subrata Mitra, Nedim Lipka, Nesreen Ahmed, Yu Wang
Transactions on Machine Learning Research (TMLR). 2025.
Project PDF BibTeX
E5: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit, and Extrapolate
Zhehao Zhang, Yan Gao, Jian-Guang Lou
Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). Mexico City, Mexico, 2024.
Project PDF Code BibTeX
CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular Data
Zhehao Zhang, Xitao Li, Yan Gao, Jian-Guang Lou
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore, 2023.
Project PDF Code BibTeX
Mitigating Biases in Hate Speech Detection from A Causal Perspective
Zhehao Zhang, Jiaao Chen, Diyi Yang
Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore, 2023.
Project PDF Code BibTeX
Can Large Language Models Transform Computational Social Science?
Caleb Ziems, William Held, Omar Shaikh, Jiaao Chen, Zhehao Zhang, Diyi Yang
Computational Linguistics (CL). 2023.
Project PDF Code BibTeX

Conference

C6
Falsereject: A resource for improving contextual safety and mitigating over-refusals in llms via structured reasoning
Zhehao Zhang, Weijie Xu, Fanyou Wu, Chandan K Reddy
Conference on Language Modeling (COLM). 2025.
Project PDF Code BibTeX
C5
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang, Jiaao Chen, Diyi Yang
Advances in Neural Information Processing Systems (NeurIPS). Vancouver, Canada, 2024.
Project PDF Code BibTeX
C4
Is GPT-4V (ision) All You Need for Automating Academic Data Visualization? Exploring Vision-Language Models' Capability in Reproducing Academic Charts
Zhehao Zhang, Weicheng Ma, Soroush Vosoughi
Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP). Miami, FL, USA, 2024.
Project PDF Code BibTeX
C3
E5: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit, and Extrapolate
Zhehao Zhang, Yan Gao, Jian-Guang Lou
Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). Mexico City, Mexico, 2024.
Project PDF Code BibTeX
C2
CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular Data
Zhehao Zhang, Xitao Li, Yan Gao, Jian-Guang Lou
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore, 2023.
Project PDF Code BibTeX
C1
Mitigating Biases in Hate Speech Detection from A Causal Perspective
Zhehao Zhang, Jiaao Chen, Diyi Yang
Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore, 2023.
Project PDF Code BibTeX

Journal

J2
Personalization of Large Language Models: A Survey
Zhehao Zhang, Ryan A. Rossi, Branislav Kveton, Yijia Shao, Diyi Yang, Hamed Zamani, Franck Dernoncourt, Joe Barrow, Tong Yu, Sungchul Kim, Ruiyi Zhang, Jiuxiang Gu, Tyler Derr, Hongjie Chen, Junda Wu, Xiang Chen, Zichao Wang, Subrata Mitra, Nedim Lipka, Nesreen Ahmed, Yu Wang
Transactions on Machine Learning Research (TMLR). 2025.
Project PDF BibTeX
J1
Can Large Language Models Transform Computational Social Science?
Caleb Ziems, William Held, Omar Shaikh, Jiaao Chen, Zhehao Zhang, Diyi Yang
Computational Linguistics (CL). 2023.
Project PDF Code BibTeX

Preprint

1
VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
Zhehao Zhang, Ryan A. Rossi, Tong Yu, Franck Dernoncourt, Ruiyi Zhang, Jiuxiang Gu, Sungchul Kim, Xiang Chen, Zichao Wang, Nedim Lipka
arXiv preprint (arXiv). 2024.
Project PDF BibTeX

Service

Reviewer EMNLP 2023, 2024; NeurIPS 2023, 2024, 2025; NAACL 2024; ACL 2024, 2025; COLM 2024
CIKM 2024, 2025; ICLR 2025; COLING 2025; IJCAI 2025; IEEE TNNLS Journal
Volunteer EMNLP 2023; NAACL 2024

References

Prof. Yu Su Associate Professor, Computer Science & Engineering The Ohio State University ysu@cse.ohio-state.edu

Prof. Huan Sun Associate Professor, Computer Science & Engineering The Ohio State University sun.397@osu.edu

Prof. Diyi Yang Assistant Professor, Computer Science Stanford University diyiy@stanford.edu

Dr. Ryan Rossi Principal Research Scientist Adobe Research rrossi@adobe.com