Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

Haomin Zhuang

Ph.D. student at the University of Notre Dame working on LLM agents, safety, reasoning, and human-agent collaboration

Jupyter notebook markdown generator

Posts

Future Blog Post

less than 1 minute read

Published: January 01, 2199

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published: August 14, 2015

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published: August 14, 2014

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published: August 14, 2013

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published: August 14, 2012

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

[CVPR Workshop’23] A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion

Haomin Zhuang, Yihua Zhang, and Sijia Liu

[ICLR’24] Backdoor Federated Learning by Poisoning Backdoor-Critical Layers

Haomin Zhuang, Mingxian Yu, Hao Wang, Yang Hua, Jian Li, Xu Yuan

[ACL’25] SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?

Haomin Zhuang, Yihua Zhang, Kehan Guo, Jinghan Jia, Gaowen Liu, Sijia Liu, Xiangliang Zhang

[Preprint] Exploring Multi-Temperature Strategies for Token- and Rollout-Level Control in RLVR

Haomin Zhuang, Yujun Zhou, Taicheng Guo, Yue Huang, Fangxu Liu, Kai Song, Xiangliang Zhang

[Preprint] SenseMath: Do LLMs Have Number Sense? Evaluating Shortcut Use, Judgment, and Generation

Haomin Zhuang, Xiangqi Wang, Yili Shen, Ying Cheng, Xiangliang Zhang

[Preprint] Reliable Control-Point Selection for Steering Reasoning in Large Language Models

Haomin Zhuang, Hojun Yoo, Xiaonan Luo, Kehan Guo, Xiangliang Zhang

[Preprint] AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

Haomin Zhuang, Hanwen Xing, Yujun Zhou, Yuchen Ma, Yue Huang, Yili Shen, Yufei Han, Xiangliang Zhang

[CAIS’26 Demo] AgentClick: A Human-in-the-Loop Review UI for Autonomous Agents

Haomin Zhuang, Hanwen Xing, Xiangliang Zhang

publications

SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?

Published in ACL 2025 (63rd Annual Meeting of the Association for Computational Linguistics), 2025

We study machine unlearning in Mixture-of-Experts LLMs and show that the dynamic routing nature introduces unique challenges. SEUF targets specific experts for knowledge removal while stabilizing router behavior, achieving up to 5% improvement in forgetting quality and 35% in model utility while modifying only 0.06% of parameters.

Recommended citation: Haomin Zhuang, Yihua Zhang, Kehan Guo, Jinghan Jia, Gaowen Liu, Sijia Liu, Xiangliang Zhang. (2025). "SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?" ACL 2025. https://aclanthology.org/2025.acl-long.424.pdf

Exploring Multi-Temperature Strategies for Token- and Rollout-Level Control in RLVR

Published in arXiv, 2025

We propose multi-temperature sampling strategies for reinforcement learning from verifiable rewards (RLVR), applying higher temperatures to reasoning tokens to encourage exploration while retaining lower temperatures for knowledge tokens to maintain factual correctness, demonstrating improvements across reasoning benchmarks.

Recommended citation: Haomin Zhuang, Yujun Zhou, Taicheng Guo, Yue Huang, Fangxu Liu, Kai Song, Xiangliang Zhang. (2025). "Exploring Multi-Temperature Strategies for Token- and Rollout-Level Control in RLVR." arXiv:2510.08892. https://arxiv.org/pdf/2510.08892

Reliable Control-Point Selection for Steering Reasoning in Large Language Models

Published in arXiv, 2026

We find that 93.3% of keyword-detected reasoning boundaries are behaviorally unstable. Our stability filtering method retains only genuine behavioral signals, achieving 0.784 accuracy on MATH-500 (+5.0 over SEAL) with cross-model transfer.

Recommended citation: Haomin Zhuang, Hojun Yoo, Xiaonan Luo, Kehan Guo, Xiangliang Zhang. (2026). "Reliable Control-Point Selection for Steering Reasoning in Large Language Models." arXiv:2604.02113. https://arxiv.org/abs/2604.02113

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Haomin Zhuang

Sitemap

Pages

Posts

portfolio

publications

talks

teaching