Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

publications

SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?

Published in ACL 2025 (63rd Annual Meeting of the Association for Computational Linguistics), 2025

We study machine unlearning in Mixture-of-Experts LLMs and show that the dynamic routing nature introduces unique challenges. SEUF targets specific experts for knowledge removal while stabilizing router behavior, achieving up to 5% improvement in forgetting quality and 35% in model utility while modifying only 0.06% of parameters.

Recommended citation: Haomin Zhuang, Yihua Zhang, Kehan Guo, Jinghan Jia, Gaowen Liu, Sijia Liu, Xiangliang Zhang. (2025). "SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?" ACL 2025. https://aclanthology.org/2025.acl-long.424.pdf

Exploring Multi-Temperature Strategies for Token- and Rollout-Level Control in RLVR

Published in arXiv, 2025

We propose multi-temperature sampling strategies for reinforcement learning from verifiable rewards (RLVR), applying higher temperatures to reasoning tokens to encourage exploration while retaining lower temperatures for knowledge tokens to maintain factual correctness, demonstrating improvements across reasoning benchmarks.

Recommended citation: Haomin Zhuang, Yujun Zhou, Taicheng Guo, Yue Huang, Fangxu Liu, Kai Song, Xiangliang Zhang. (2025). "Exploring Multi-Temperature Strategies for Token- and Rollout-Level Control in RLVR." arXiv:2510.08892. https://arxiv.org/pdf/2510.08892

Reliable Control-Point Selection for Steering Reasoning in Large Language Models

Published in arXiv, 2026

We find that 93.3% of keyword-detected reasoning boundaries are behaviorally unstable. Our stability filtering method retains only genuine behavioral signals, achieving 0.784 accuracy on MATH-500 (+5.0 over SEAL) with cross-model transfer.

Recommended citation: Haomin Zhuang, Hojun Yoo, Xiaonan Luo, Kehan Guo, Xiangliang Zhang. (2026). "Reliable Control-Point Selection for Steering Reasoning in Large Language Models." arXiv:2604.02113. https://arxiv.org/abs/2604.02113

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.