Kaijie Zhu

kaijiezhu@ucsb.edu

CA, US

I’m a second-year Ph.D student at UCSB, fortunately advised by Prof. William Wang and Prof. Wenbo Guo.

My research interests span evaluating, training, and securing AI agent systems.

Reasoning and Agent: training (TermiGen arXiv, rePIRL ICML 2026), benchmarking (DevOps-Gym, ICLR 2026), defending indirect prompt injection (MELON, ICML 2025)
Dynamic Evaluation of LLMs: DyVal series (ICLR’24 Spotlight, ICML’24), PromptBench (JMLR MLOSS)
Adversarial Robustness: LLM prompt robustness (PromptRobust CCS’24 LAMPS), CNN robustness (RiFT ICCV’23)

news

May 1, 2026	rePIRL is accepted by ICML 2026.
Jan 26, 2026	DevOps-Gym is accepted by ICLR 2026.
May 1, 2025	MELON is accepted by ICML 2025.
Feb 25, 2025	Hosting the AAAI 2025 Tutorial on Evaluating Large Language Models: Challenges and Methods with Prof. Jindong Wang, Dr. Linyi Yang, Prof. Yue Feng, and Prof. Yue Zhang.
Jan 20, 2025	Selected to present a talk at the KAUST Rising Stars in AI Symposium 2025.

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Kaijie Zhu, Yuzhou Nie, Yijiang Li, and 10 more authors

arXiv, 2026

arXiv
rePIRL: Learn PRM with Inverse RL for LLM Reasoning

Xian Wu, Kaijie Zhu, Wenbo Guo, and 1 more author

ICML, 2026
DevOps-Gym: Benchmarking AI Agents in Software DevOps Cycle

Yuheng Tang, Kaijie Zhu, and others

ICLR, 2026

Website
MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool Comparison

Kaijie Zhu, Xianjun Yang, Jindong Wang, and 2 more authors

ICML, 2025
DyVal: Graph-informed Dynamic Evaluation of Large Language Models

Kaijie Zhu, Jiaao Chen, Jindong Wang, and 3 more authors

ICLR (Spotlight), 2024
PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

Kaijie Zhu, Jindong Wang, Jiaheng Zhou, and 8 more authors

CCS LAMPS Workshop, 2023
Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning

Kaijie Zhu, Xixu Hu, Jindong Wang, and 2 more authors

ICCV, 2023
DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing Agents

Kaijie Zhu, Jindong Wang, Qinlin Zhao, and 2 more authors

ICML, 2024