Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
-
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal AgentsarXiv, 2026 - rePIRL: Learn PRM with Inverse RL for LLM ReasoningSubmitted to ICML, 2026
-
2025
- AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint AdherenceACL, 2025
-
MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool ComparisonICML, 2025
2024
- PromptBench: A Unified Library for Evaluation of Large Language ModelsJMLR MLOSS, 2024
-
DyVal: Graph-informed Dynamic Evaluation of Large Language ModelsICLR (Spotlight), 2024 - Emotionprompt: Leveraging psychology for large language models enhancement via emotional stimulusICML, 2024
- CompeteAI: Understanding the Competition Behaviors in Large Language Model-based AgentsICML (Oral), 2024
-
DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing AgentsICML, 2024 - AgentReview: Exploring Peer Review Dynamics with LLM AgentsIn The 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
-
PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial PromptsCCS LAMPS Workshop, 2023 -
Improving Generalization of Adversarial Training via Robust Critical Fine-TuningICCV, 2023 - A survey on evaluation of large language modelsACM TIST, 2023