👨 About Me

I am currently a Ph.D. student at FudanNLP, Fudan University, supervised by Jing Shao and Xuanjing Huang. Before that, I received my B.S. degree from Beijing Institute of Technology, Computer Science Elite Class.

My research interests mainly include Multimodal Agents in real-world workflows, MLLM/LLM Safety, and Diffusion LLMs.

Email 📧: xuhaohu08@gmail.com or xhhu24@m.fudan.edu.cn

😀 Please feel free to contact me for communication and collaboration.

🔥 News

  • 2026.05: We release ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents.
  • 2026.02: LLMs Deceive Unintentionally was accepted to ACL 2026 (First Author).
  • 2026.01: We release DARE: dLLM Alignment and Reinforcement Executor.
  • 2025.11: I joined the Tongyi Lab, Alibaba.
  • 2025.08: IS-Bench was accepted to AAAI 2026 (Co-First Author).
  • 2025.05: Unveiling Visual Leakage in Multimodal Safety was accepted to ACL 2025 (First Author).
  • 2024.05: SALAD-Bench was accepted to Findings ACL 2024; I am the co-first author Responsible for the dev pipeline of MD-Judge.

📝 Research Topics

Multimodal Agent in real world workflow

MLLM/LLM Safety

Diffusion LLMs

  • ArXiv DARE: dLLM Alignment and Reinforcement Executor -- an efficient RL training framework for diffusion large language models integrated with dLLM-tailored RL algorithms. Jingyi Yang*, Yuxian Jiang*, Xuhao Hu*, Shuang Cheng, Biqing Qi, Jing Shao

📖 Selected Publications (* Equal Contribution, † Corresponding Author)

ArXiv
ToolCUA teaser

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Xuhao Hu*, Xi Zhang*, Haiyang Xu, Kyle Qiao, Jingyi Yang, Xuanjing Huang, Jing Shao, Ming Yan, Jieping Ye

End-to-end computer-use agent for optimal GUI-tool path selection, with interleaved trajectory scaling, tool-bootstrapped GUI RFT, and online agentic RL.

Paper / Project HomePage / Code / Model

Tech Report
Mobile-Agent logo

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Xi Zhang, Haiyang Xu, et al.; Xuhao Hu as core contributor

Responsible for GUI + tool capabilities for next-generation CUAs/Claws, OSWorld-MCP/MobileWorld results, verifiable tasks, rollout sandboxes, and GUI critic pipelines.

Paper / Code / X-PLUG

ACL 2024
SALAD-Bench overview

SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models

Lijun Li*, Bowen Dong*, Ruohui Wang*, Xuhao Hu*, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao

Early LLM-based safety judge models. MD-Judge-v1/v2 and related resources have reached 100K+ Hugging Face downloads and 300+ citations.

Paper / Code / MD-Judge-v1 / MD-Judge-v2

ACL 2025
VLSBench teaser

Unveiling Visual Leakage in Multimodal Safety

Xuhao Hu*, Dongrui Liu*, Hao Li, Xuanjing Huang, Jing Shao

Studies visual leakage in multimodal safety and releases VLSBench, an evaluation and alignment resource with scalable multimodal safety data construction.

Paper / Project HomePage / Code / Dataset

AAAI 2026
IS-Bench motivation

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

Xiaoya Lu*, Zeren Chen*, Xuhao Hu*, Yijin Zhou, Weichen Zhang, Dongrui Liu, Lu Sheng, Jing Shao

Benchmark for interactive safety of VLM-driven embodied agents in realistic household tasks, covering hazards introduced by multi-step perception, planning, and action.

Paper / Code

ACL 2026
LLMs Deceive Unintentionally teaser

LLMs Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Xuhao Hu, Peng Wang, Xiaoya Lu, Dongrui Liu, Xuanjing Huang, Jing Shao

Systematically studies emergent dishonesty misalignment under high-stakes settings through direct SFT, downstream SFT, preference learning, and multi-turn human-AI interactions.

Paper / Code

Tech Report
SafeWork-R1 evolution figure

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45 Law

Center for Safe & Trustworthy AI, including Xuhao Hu

Multimodal reasoning model and SafeLadder framework for coevolving safety and general capability. I contributed to multimodal reward models for safety and helpfulness used in multi-target RL.

Paper / Project HomePage

ArXiv
DARE logo

DARE: dLLM Alignment and Reinforcement Executor

Jingyi Yang*, Yuxian Jiang*, Xuhao Hu*, Shuang Cheng, Biqing Qi, Jing Shao

An efficient RL training framework for diffusion large language models, integrating dLLM-tailored RL algorithms and practical recipes on top of veRL-style infrastructure.

Paper / Code

ArXiv
CTRLD overview figure

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Jingyi Yang, Guanxu Chen, Xuhao Hu, Jing Shao

Introduces EOS Early Rejection, Ascending Step-Size decoding, and Consistency Trajectory GRPO to improve masked diffusion language models with fewer decoding steps.

Paper / Code

Tech Report
Frontier AI risk management report title page

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Shanghai Artificial Intelligence Laboratory, including Xuhao Hu as core contributor

Technical report applying the Frontier AI Risk Management Framework and AI-45 Law to analyze frontier-model risks across cyber offense, bio/chemical risk, persuasion, autonomous AI R&D, deception, self-replication, and collusion.

Paper

🎓 Education & Experience

Mobile-Agent / XPLUG Team, Tongyi Lab, Alibaba

2025.11 -- Now · Internship / Research Collaboration

GUI-tool capability, GUI world modeling, agentic RL scaling, and Qwen mobile-agent development. Mentored by Xi Zhang and Haiyang Xu.

AI45 / OpenTrust Team, Shanghai AI Laboratory

2023.10 -- 2025.10 · Research Collaboration

Safety alignment, reward modeling, multimodal safety evaluation, and post-training research. Led by Jing Shao, mentored by Lijun Li and Dongrui Liu.

PhD Student, FudanNLP, Fudan University

2024 -- Now · Computer Science

Supervised by Xuanjing Huang.

B.S. Degree, Beijing Institute of Technology

2020 -- 2024 · Computer Science Elite Class

Ranked top 5%.

🎖 Honors and Awards

  • Panasonic Scholarship, Fudan University
  • National College Students Mathematical Modeling Contest, Second Prize
  • Outstanding Undergraduate Graduation Design, Beijing Institute of Technology
  • Yearly Outstanding Undergraduate Student, Beijing Institute of Technology

💻 Academic Service

  • Reviewer: NeurIPS 2026, EMNLP 2025, TPAMI

🌏 Visitors