👨 About Me
I am currently a Ph.D. student at FudanNLP, Fudan University, supervised by Jing Shao and Xuanjing Huang. Before that, I received my B.S. degree from Beijing Institute of Technology, Computer Science Elite Class.
My research interests mainly include Multimodal Agents in real-world workflows, MLLM/LLM Safety, and Diffusion LLMs.
Email 📧: xuhaohu08@gmail.com or xhhu24@m.fudan.edu.cn
😀 Please feel free to contact me for communication and collaboration.
🔥 News
- 2026.05: We release ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents.
- 2026.02: LLMs Deceive Unintentionally was accepted to ACL 2026 (First Author).
- 2026.01: We release DARE: dLLM Alignment and Reinforcement Executor.
- 2025.11: I joined the Tongyi Lab, Alibaba.
- 2025.08: IS-Bench was accepted to AAAI 2026 (Co-First Author).
- 2025.05: Unveiling Visual Leakage in Multimodal Safety was accepted to ACL 2025 (First Author).
- 2024.05: SALAD-Bench was accepted to Findings ACL 2024; I am the co-first author Responsible for the dev pipeline of MD-Judge.
📝 Research Topics
Multimodal Agent in real world workflow
- ArXiv ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents. Xuhao Hu*, Xi Zhang*, Haiyang Xu†, Kyle Qiao, Jingyi Yang, Xuanjing Huang, Jing Shao, Ming Yan†, Jieping Ye
- Tech Report Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents. Contributions for GUI-tool capability, GUI world modeling, and agentic RL scaling.
MLLM/LLM Safety
- ACL 2024 SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models. Co-first author.
- ACL 2025 Unveiling Visual Leakage in Multimodal Safety. Xuhao Hu*, Dongrui Liu*, Hao Li, Xuanjing Huang†, Jing Shao†
- AAAI 2026 IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks. Co-first author.
- ACL 2026 LLMs Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions. First author.
- Tech Report SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law. Contributor on multimodal reward models for safety and helpfulness.
- Tech Report Frontier AI Risk Management Framework in Practice. Core contributor on frontier AI risk analysis.
Diffusion LLMs
- ArXiv DARE: dLLM Alignment and Reinforcement Executor -- an efficient RL training framework for diffusion large language models integrated with dLLM-tailored RL algorithms. Jingyi Yang*, Yuxian Jiang*, Xuhao Hu*, Shuang Cheng, Biqing Qi, Jing Shao†
📖 Selected Publications (* Equal Contribution, † Corresponding Author)

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
Xuhao Hu*, Xi Zhang*, Haiyang Xu†, Kyle Qiao, Jingyi Yang, Xuanjing Huang, Jing Shao, Ming Yan†, Jieping Ye
End-to-end computer-use agent for optimal GUI-tool path selection, with interleaved trajectory scaling, tool-bootstrapped GUI RFT, and online agentic RL.
Paper / Project HomePage / Code / Model

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents
Xi Zhang, Haiyang Xu, et al.; Xuhao Hu as core contributor
Responsible for GUI + tool capabilities for next-generation CUAs/Claws, OSWorld-MCP/MobileWorld results, verifiable tasks, rollout sandboxes, and GUI critic pipelines.

SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models
Lijun Li*, Bowen Dong*, Ruohui Wang*, Xuhao Hu*, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao†
Early LLM-based safety judge models. MD-Judge-v1/v2 and related resources have reached 100K+ Hugging Face downloads and 300+ citations.
Paper / Code / MD-Judge-v1 / MD-Judge-v2

Unveiling Visual Leakage in Multimodal Safety
Xuhao Hu*, Dongrui Liu*, Hao Li, Xuanjing Huang†, Jing Shao†
Studies visual leakage in multimodal safety and releases VLSBench, an evaluation and alignment resource with scalable multimodal safety data construction.
Paper / Project HomePage / Code / Dataset

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
Xiaoya Lu*, Zeren Chen*, Xuhao Hu*, Yijin Zhou, Weichen Zhang, Dongrui Liu†, Lu Sheng†, Jing Shao†
Benchmark for interactive safety of VLM-driven embodied agents in realistic household tasks, covering hazards introduced by multi-step perception, planning, and action.

Xuhao Hu, Peng Wang, Xiaoya Lu, Dongrui Liu, Xuanjing Huang, Jing Shao†
Systematically studies emergent dishonesty misalignment under high-stakes settings through direct SFT, downstream SFT, preference learning, and multi-turn human-AI interactions.

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law
Center for Safe & Trustworthy AI, including Xuhao Hu
Multimodal reasoning model and SafeLadder framework for coevolving safety and general capability. I contributed to multimodal reward models for safety and helpfulness used in multi-target RL.

DARE: dLLM Alignment and Reinforcement Executor
Jingyi Yang*, Yuxian Jiang*, Xuhao Hu*, Shuang Cheng, Biqing Qi, Jing Shao†
An efficient RL training framework for diffusion large language models, integrating dLLM-tailored RL algorithms and practical recipes on top of veRL-style infrastructure.

Jingyi Yang, Guanxu Chen, Xuhao Hu, Jing Shao†
Introduces EOS Early Rejection, Ascending Step-Size decoding, and Consistency Trajectory GRPO to improve masked diffusion language models with fewer decoding steps.

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Shanghai Artificial Intelligence Laboratory, including Xuhao Hu as core contributor
Technical report applying the Frontier AI Risk Management Framework and AI-45 Law to analyze frontier-model risks across cyber offense, bio/chemical risk, persuasion, autonomous AI R&D, deception, self-replication, and collusion.
🎓 Education & Experience
Mobile-Agent / XPLUG Team, Tongyi Lab, Alibaba
2025.11 -- Now · Internship / Research Collaboration
GUI-tool capability, GUI world modeling, agentic RL scaling, and Qwen mobile-agent development. Mentored by Xi Zhang and Haiyang Xu.
AI45 / OpenTrust Team, Shanghai AI Laboratory
2023.10 -- 2025.10 · Research Collaboration
Safety alignment, reward modeling, multimodal safety evaluation, and post-training research. Led by Jing Shao, mentored by Lijun Li and Dongrui Liu.
PhD Student, FudanNLP, Fudan University
2024 -- Now · Computer Science
Supervised by Xuanjing Huang.
B.S. Degree, Beijing Institute of Technology
2020 -- 2024 · Computer Science Elite Class
Ranked top 5%.
🎖 Honors and Awards
- Panasonic Scholarship, Fudan University
- National College Students Mathematical Modeling Contest, Second Prize
- Outstanding Undergraduate Graduation Design, Beijing Institute of Technology
- Yearly Outstanding Undergraduate Student, Beijing Institute of Technology
💻 Academic Service
- Reviewer: NeurIPS 2026, EMNLP 2025, TPAMI