👨 About Me

I am currently a Ph.D. student at FudanNLP, Fudan University, supervised by Jing Shao and Xuanjing Huang. Before that, I received my B.S. degree from Beijing Institute of Technology, Computer Science Elite Class.

My research interests mainly include Multimodal Agents in real-world workflows, MLLM/LLM Safety, and Diffusion LLMs.

Email 📧: xuhaohu08@gmail.com or xhhu24@m.fudan.edu.cn

😀 Please feel free to contact me for communication and collaboration.

🔥 News

2026.05: We release ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents.
2026.02: LLMs Deceive Unintentionally was accepted to ACL 2026 (First Author).
2026.01: We release DARE: dLLM Alignment and Reinforcement Executor.
2025.11: I joined the Tongyi Lab, Alibaba.
2025.08: IS-Bench was accepted to AAAI 2026 (Co-First Author).
2025.05: Unveiling Visual Leakage in Multimodal Safety was accepted to ACL 2025 (First Author).
2024.05: SALAD-Bench was accepted to Findings ACL 2024; I am the co-first author Responsible for the dev pipeline of MD-Judge.

📝 Research Topics

Multimodal Agent in real world workflow

ArXiv ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents. Xuhao Hu^*, Xi Zhang^*, Haiyang Xu^†, Kyle Qiao, Jingyi Yang, Xuanjing Huang, Jing Shao, Ming Yan^†, Jieping Ye
Tech Report Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents. Contributions for GUI-tool capability, GUI world modeling, and agentic RL scaling.

MLLM/LLM Safety

ACL 2024 SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models. Co-first author.
ACL 2025 Unveiling Visual Leakage in Multimodal Safety. First Author.
AAAI 2026 IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks. Co-first author.
ACL 2026 LLMs Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions. First author.
Tech Report SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law. Contributor for safety and helpfulness.
Tech Report Frontier AI Risk Management Framework in Practice. Core contributor on frontier AI risk analysis.

Diffusion LLMs

ArXiv DARE: dLLM Alignment and Reinforcement Executor -- an efficient RL training framework for diffusion large language models integrated with dLLM-tailored RL algorithms. Jingyi Yang^*, Yuxian Jiang^*, Xuhao Hu^*, Shuang Cheng, Biqing Qi, Jing Shao^†

📖 Selected Publications (* Equal Contribution, † Corresponding Author)

ArXiv

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Xuhao Hu^*, Xi Zhang^*, Haiyang Xu^†, Kyle Qiao, Jingyi Yang, Xuanjing Huang, Jing Shao, Ming Yan^†, Jieping Ye

End-to-end computer-use agent for optimal GUI-tool path selection, with interleaved trajectory scaling, tool-bootstrapped GUI RFT, and online agentic RL.

Paper / Project HomePage / Code / Model

Tech Report

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Xi Zhang, Haiyang Xu, et al.; Xuhao Hu as core contributor

Responsible for GUI + tool capabilities for next-generation CUAs/Claws, OSWorld-MCP/MobileWorld results, verifiable tasks, rollout sandboxes, and GUI critic pipelines.

Paper / Code / X-PLUG

ACL 2024

SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models

Lijun Li^*, Bowen Dong^*, Ruohui Wang^*, Xuhao Hu^*, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao^†

Early LLM-based safety judge models. MD-Judge-v1/v2 and related resources have reached 100K+ Hugging Face downloads and 300+ citations.

Paper / Code / MD-Judge-v1 / MD-Judge-v2

ACL 2025

Unveiling Visual Leakage in Multimodal Safety

Xuhao Hu^*, Dongrui Liu^*, Hao Li, Xuanjing Huang^†, Jing Shao^†

Studies visual leakage in multimodal safety and releases VLSBench, an evaluation and alignment resource with scalable multimodal safety data construction.

Paper / Project HomePage / Code / Dataset

AAAI 2026

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

Xiaoya Lu^*, Zeren Chen^*, Xuhao Hu^*, Yijin Zhou, Weichen Zhang, Dongrui Liu^†, Lu Sheng^†, Jing Shao^†

Benchmark for interactive safety of VLM-driven embodied agents in realistic household tasks, covering hazards introduced by multi-step perception, planning, and action.

Paper / Code

ACL 2026

LLMs Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Xuhao Hu, Peng Wang, Xiaoya Lu, Dongrui Liu, Xuanjing Huang, Jing Shao^†

Systematically studies emergent dishonesty misalignment under high-stakes settings through direct SFT, downstream SFT, preference learning, and multi-turn human-AI interactions.

Paper / Code

Tech Report

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law

Center for Safe & Trustworthy AI, including Xuhao Hu

Multimodal reasoning model and SafeLadder framework for coevolving safety and general capability. I contributed to multimodal reward models for safety and helpfulness used in multi-target RL.

Paper / Project HomePage

ArXiv

DARE: dLLM Alignment and Reinforcement Executor

Jingyi Yang^*, Yuxian Jiang^*, Xuhao Hu^*, Shuang Cheng, Biqing Qi, Jing Shao^†

An efficient RL training framework for diffusion large language models, integrating dLLM-tailored RL algorithms and practical recipes on top of veRL-style infrastructure.

Paper / Code

ArXiv

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Jingyi Yang, Guanxu Chen, Xuhao Hu, Jing Shao^†

Introduces EOS Early Rejection, Ascending Step-Size decoding, and Consistency Trajectory GRPO to improve masked diffusion language models with fewer decoding steps.

Paper / Code

Tech Report

Frontier AI risk management report title page

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Shanghai Artificial Intelligence Laboratory, including Xuhao Hu as core contributor

Technical report applying the Frontier AI Risk Management Framework and AI-45 Law to analyze frontier-model risks across cyber offense, bio/chemical risk, persuasion, autonomous AI R&D, deception, self-replication, and collusion.

Paper