Junbin Qiu (邱俊斌)

Qjbtiger - Overview

A Ph.D. student majoring in artificial intelligence at Hong Kong University of Science and Technology (Guangzhou). - Qjbtiger

I am currently a first-year Ph.D. student in the Artificial Intelligence Thrust, Information Hub, at The Hong Kong University of Science and Technology (Guangzhou), under the supervision of Prof. Yao Shu. Prior to joining HKUST(GZ), I received my master's and bachelor's degree in Physics from Sun Yat-sen University, where I was advised by Prof. Haiping Huang. My research focuses on developing theoretically grounded and practically efficient algorithms for modern AI systems, with an emphasis on zeroth-order optimization, efficient post-training of large language models, post-training quantization, and on-device AI. My recent work has been published in venues including ICML, Physical Review Research, Physical Review E.

Research Interests

Variance-reduced zeroth-order optimization for black-box objectives and on-device fine-tuning of large language models.

Post-training quantization for deploying large models on edge devices.

Efficient post-training methods for LLMs in both online and offline settings.

On-device AI algorithms that balance performance, efficiency, and privacy.

Manuscripts Under Review

APEX: Accuracy Projection from Verifier-Labeled Experience for Offline RLVR Junbin Qiu, Yao Shu Submitted to NeurIPS 2026

Predicting Quantization Price for Selecting PTQ Configurations Before Deployment Junbin Qiu, Jian Mu, Weitong Zhang, Yao Shu Submitted to NeurIPS 2026

The Cost of Mismatch: Noise Amplification in Zeroth-Order Reinforcement Learning Lianming Chen, Junbin Qiu, Chenxing Wei, Yao Shu, Kai He Submitted to NeurIPS 2026

Selected Publications

Revisiting Zeroth-Order Hessian Approximation: A Single-Step Policy Optimization Lens Junbin Qiu, Zhaowei Hong, Renzhe Xu, Yao Shu ICML 2026

Zeroth-Order Optimization is Secretly Single-Step Policy Optimization Junbin Qiu, Xiangda Yan, Yongjie Yang, Yao Shu ICML 2025 Workshop

An optimization-based equilibrium measure describes non-equilibrium steady state dynamics: application to edge of chaos Junbin Qiu, Haiping Huang Communications in Theoretical Physics, 2024

Meta predictive learning model of languages in neural circuits Chan Li, Junbin Qiu, Haiping Huang Physical Review E, 2024

Equivalence between belief propagation instability and transition to replica symmetry breaking in perceptron learning systems Yang Zhao, Junbin Qiu, Mingshan Xie, Haiping Huang Physical Review Research, 2022

Education

Ph.D. student in Artificial Intelligence (2025.9 - now) The Hong Kong University of Science and Technology (Guangzhou)

Master's degree in Statistical Physics (2021.9 - 2024.6) Sun Yat-sen University

Bachelor's degree in Physics (2017.9 - 2021.6) Sun Yat-sen University

Activities

Talk: A language model based on free energy minimization Swarma pattern study group, 2024.6.15

Contact

I am open to research discussions and collaboration opportunities around theoretical machine learning, zeroth-order optimization, efficient LLM post-training, and model deployment.

Email: jqiu236 [at] connect.hkust-gz.edu.cn