Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
-
MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool ComparisonICML, 2025
2024
-
DyVal: Graph-informed Dynamic Evaluation of Large Language ModelsICLR (Spotlight), 2024 - Emotionprompt: Leveraging psychology for large language models enhancement via emotional stimulusICML, 2024
- CompeteAI: Understanding the Competition Behaviors in Large Language Model-based AgentsICML (Oral), 2024
-
DyVal 2: Dynamic Evaluation of Large Language Models by Meta Probing AgentsICML, 2024 - AgentReview: Exploring Peer Review Dynamics with LLM AgentsIn The 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
-
PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial PromptsCCS LAMPS Workshop, 2023 -
Improving Generalization of Adversarial Training via Robust Critical Fine-TuningICCV, 2023 - A survey on evaluation of large language modelsACM TIST, 2023