I am a second-year master student at the School of AI, Beihang University (BUAA), supervised by Prof. Lei Sha.

My previous research focused on the safety alignment of LLM and VLM, and I am now seeking a PhD position for 2027 Fall.

πŸ”₯ News

  • 2025.08: Β πŸŽ‰πŸŽ‰ Two papers (LARF and DIffusionAttacker) are accepted by EMNLP 2025 and DIffusionAttacker is selected as Oral Presentation.
  • 2025.03: Β πŸŽ‰πŸŽ‰ Two papers (ActorBreaker and VLSBench) are accepted by ACL 2025 and ActorBreaker is selected as Outstanding Paper.
  • 2024.09: Β πŸŽ‰πŸŽ‰ ASETF is accepted by EMNLP 2024 and selected as Oral Presentation.

πŸ“ Publications

EMNLP 2025
Layer-Aware Representation Filtering: Purifying Finetuning Data to Preserve LLM Safety Alignment

Layer-Aware Representation Filtering: Purifying Finetuning Data to Preserve LLM Safety Alignment

Hao Li*, Lijun Li*, Zhenghao Lu, Xianyi Wei, Rui Li, Jing Shao, Lei Sha

EMNLP 2025 Oral
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak

DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak

Hao Wang, Hao Li, Junda Zhu, Xinyuan Wang, Chengwei Pan, Minlie Huang, Lei Sha

ACL 2025 Outstanding Paper
LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts

LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts

Qibing Ren*, Hao Li*, Dongrui Liu, Zhanxu Xie, Xiaoya Lu, Yu Qiao, Lei Sha, Junchi Yan, Lizhuang Ma, Jing Shao

ACL 2025
VLSBench: Unveiling Visual Leakage in Multimodal Safety

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Xuhao Hu, Dongrui Liu, Hao Li, Xuanjing Huang, Jing Shao

EMNLP 2024 Oral
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings

πŸ“ Preprints

πŸ“– Educations

  • 2024.09 - present, Master, Beihang University, Beijing.
  • 2020.09 - 2024.06, Bachelor, Beihang University, Beijing.

πŸŽ– Selected Honors and Awards

  • 2025, National Scholarship in China.
  • 2023, Special Prize (Top 1) in β€œChallenge Cup” Competition of Science Achievement in China.

πŸ’» Internships

  • 2025.08 - present, VLM post-training & evaluation, BAAI, Beijing
  • 2024.07 – 2025.07, LLM and VLM safety, Shanghai AI Lab, Beijing and Shanghai