Publications

publications in reversed chronological order.

2026

  1. Optimal Regret for Policy Optimization in Average Reward MDPs Without Mixing
    William Powell, Jeongyeol Kwon, Qiaomin Xie, and Hanbaek Lyu
    In Reinforcement Learning Conference (RLC) 2026
  2. Lyapunov-Based Sample Complexity Analysis for Weakly-Coupled MDPs
    Tianhao Wu, Matthew Zurek, Weina Wang, and Qiaomin Xie
    In Conference on Learning Theory (COLT) 2026
  3. Wasserstein-p Central Limit Theorem Rates: From Local Dependence to Markov Chains
    Yixuan Zhang, and Qiaomin Xie
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2026
    Best Student Paper
    Best Paper Award Finalist
  4. Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive Stochastic Approximation
    Yixuan Zhang, Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    Operations Research 2026
  5. Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes
    Dongyan Huo, Yudong Chen, and Qiaomin Xie
    Mathematics of Operations Research 2026

2025

  1. Offline Actor-Critic for Average Reward MDPs
    William Powell, Jeongyeol Kwon, Qiaomin Xie, and Hanbaek Lyu
    In Advances in Neural Information Processing Systems (NeurIPS) 2025
  2. Contextual Online Pricing with (Biased) Offline Data
    Yixuan Zhang, Ruihao Zhu, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2025
  3. Unichain and aperiodicity are sufficient for asymptotic optimality of average-reward restless bandits
    Yige Hong, Qiaomin Xie, Yudong Chen, and Weina Wang
    Mathematics of Operations Research 2025
  4. Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
    Subhojyoti Mukherjee, Josiah P Hanna, Qiaomin Xie, and Robert Nowak
    In Reinforcement Learning Conference (RLC) 2025
  5. Multi-task Representation Learning for Fixed Budget Pure-Exploration in Linear and Bilinear Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, and Robert Nowak
    In Reinforcement Learning Conference (RLC) 2025
  6. Stable Offline Value Function Learning with Bisimulation-based Representations
    Brahma S. Pavse, Yudong Chen, Qiaomin Xie, and Josiah P. Hanna
    In International Conference on Machine Learning (ICML) 2025
  7. A Piecewise Lyapunov Analysis of Sub-quadratic SGD: Applications to Robust and Quantile Regression
    Yixuan Zhang, Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2025
  8. Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way
    Jeongyeol Kwon, Luke Dotson, Yudong Chen, and Qiaomin Xie
    In International Conference on Artificial Intelligence and Statistics (AISTATS) 2025
  9. Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent
    Xiang Li, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2025

2024

  1. The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize
    Dongyan (Lucy) Huo, Yixuan Zhang, Yudong Chen, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS), Spotlight 2024
  2. Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
    Yixuan Zhang, and Qiaomin Xie
    In Reinforcement Learning Conference (RLC) 2024
  3. Inception: Efficiently Computable Misinformation Attacks on Markov Games
    Jeremy McMahan, Young Wu, Yudong Chen, Xiaojin Zhu, and Qiaomin Xie
    In Reinforcement Learning Conference (RLC) 2024
  4. Roping in Uncertainty: Robustness and Regularization in Markov Games
    Jeremy McMahan, Giovanni Artiglio, and Qiaomin Xie
    In International Conference on Machine Learning (ICML) 2024
  5. Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value
    Young Wu, Jeremy McMahan, Yiding Chen, Yudong Chen, Xiaojin Zhu, and Qiaomin Xie
    In International Conference on Machine Learning (ICML) 2024
  6. Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
    Brahma S. Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, and Josiah P. Hanna
    In International Conference on Machine Learning (ICML) 2024
  7. Near-Optimal Stochastic Bin-Packing in Large Service Systems with Time-Varying Item Sizes
    Yige Hong, Qiaomin Xie, and Weina Wang
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
  8. Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive Stochastic Approximation
    Yixuan Zhang, Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
  9. SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, and Robert Nowak
    In International Conference on Artificial Intelligence and Statistics (AISTATS) 2024
  10. Stochastic Methods in Variational Inequalities: Ergodicity, Bias and Refinements
    Emmanouil-Vasileios Vlatakis-Gkaragkounis, Angeliki Giannou, Yudong Chen, and Qiaomin Xie
    In International Conference on Artificial Intelligence and Statistics (AISTATS), Oral 2024
  11. Data Poisoning to Fake a Nash Equilibrium in Markov Games
    Young Wu, Jeremy McMahan, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  12. Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
    Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  13. Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data Corruption
    Yiding Chen, Xuezhou Zhang, Qiaomin Xie, and Xiaojin Zhu
    In AAAI Conference on Artificial Intelligence 2024
  14. Optimal Attack and Defense for Reinforcement Learning
    Jeremy McMahan, Young Wu, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024

2023

  1. Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, and Robert Nowak
    In Advances in Neural Information Processing Systems (NeurIPS) 2023
  2. Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
    Yige Hong, Qiaomin Xie, Yudong Chen, and Weina Wang
    In Advances in Neural Information Processing Systems (NeurIPS), Spotlight, 2023
  3. Distributed Threshold-based Offloading for Heterogeneous Mobile Edge Computing
    Xudong Qin, Qiaomin Xie, and Bin Li
    In International Conference on Distributed Computing Systems (ICDCS) 2023
  4. Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes
    Zihan Zhang, and Qiaomin Xie
    In Conference on Learning Theory (COLT) 2023
  5. Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes
    Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In ACM Sigmetrics 2023
  6. Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
    Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang
    Mathematics of Operations Research 2023
  7. Reward Poisoning Attacks on Offline Multi-Agent Reinforcement Learning
    Young Wu, Jermey McMahan, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2023

2022

  1. RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems
    Bai Liu, Qiaomin Xie, and Eytan Modiano
    ACM Transactions on Modeling and Performance Evaluation of Computing Systems 2022
  2. ORSuite: Benchmarking Suite for Sequential Operations Models
    Christopher Archer, Siddhartha Banerjee, Mayleen Cortez, Carrie Rucker, Sean R. Sinclair, Max Solberg, Qiaomin Xie, and Christina Lee Yu
    SIGMETRICS Performance Evaluation Review 2022
  3. Nonasymptotic Analysis of Monte Carlo Tree Search
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    Operations Research 2022

2021

  1. Learning While Playing in Mean-Field Games: Convergence and Optimality
    Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, and Andreea Minca
    In International Conference on Machine Learning (ICML) 2021
  2. Zero queueing for multi-server jobs
    Weina Wang, Qiaomin Xie, and Mor Harchol-Balter
    In ACM Sigmetrics 2021

2020

  1. Dynamic Regret of Policy Optimization in Non-Stationary Environments
    Yingjie Fei, Zhuoran Yang, Zhaoran Wang, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  2. POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
    Weichao Mao, Kaiqing Zhang, Qiaomin Xie, and Tamer Basar
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  3. Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
    Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  4. Stable Reinforcement Learning with Unbounded State Space
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    In Learning for Dynamics and Control (L4DC) 2020
  5. On Reinforcement Learning for Turn-based Zero-sum Markov Games
    Devavrat Shah, Varun Somani, Qiaomin Xie, and Zhi Xu
    In Proceedings of the 2020 ACM-IMS on Foundations of Data Science Conference 2020
  6. Learning zero-sum simultaneous-move Markov games using function approximation and correlated equilibrium
    Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang
    In Conference on Learning Theory 2020
  7. Non-asymptotic analysis of Monte Carlo tree search
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    In ACM Sigmetrics 2020
  8. Greed works—online algorithms for unrelated machine stochastic scheduling
    Varun Gupta, Benjamin Moseley, Marc Uetz, and Qiaomin Xie
    Mathematics of operations research 2020

2019

  1. Reinforcement learning for optimal control of queueing systems
    Bai Liu, Qiaomin Xie, and Eytan Modiano
    In 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton) 2019

2018

  1. Q-learning with nearest neighbors
    Devavrat Shah, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2018

2017

  1. Stochastic online scheduling on unrelated machines
    Varun Gupta, Benjamin Moseley, Marc Uetz, and Qiaomin Xie
    In International Conference on Integer Programming and Combinatorial Optimization 2017
  2. Centralized Congestion Control and Scheduling in a Datacenter
    Devavrat Shah, and Qiaomin Xie
    arXiv preprint arXiv:1710.02548 2017

2016

  1. Scheduling with Multi-level Data Locality: Throughput and Heavy-Traffic Optimality
    Qiaomin Xie, Ali Yekkehkhany, and Yi Lu
    In 2016 IEEE Conference on Computer Communications (INFOCOM) 2016
  2. Pandas: robust locality-aware scheduling with stochastic delay optimality
    Qiaomin Xie, Mayank Pundir, Yi Lu, Cristina L Abad, and Roy H Campbell
    IEEE/ACM Transactions on Networking 2016

2015

  1. Power of d Choices for Large-Scale Bin Packing: A Loss Model
    Qiaomin Xie, Xiaobo Dong, Yi Lu, and R Srikant
    In ACM Sigmetrics 2015
  2. Priority algorithm for near-data scheduling: Throughput and heavy-traffic optimality
    Qiaomin Xie, and Yi Lu
    In 2015 IEEE Conference on Computer Communications (INFOCOM) 2015

2012

  1. Degree-guided map-reduce task assignment with data locality constraint
    Qiaomin Xie, and Yi Lu
    In 2012 IEEE International Symposium on Information Theory Proceedings 2012

2011

  1. Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services
    Yi Lu, Qiaomin Xie, Gabriel Kliot, Alan Geller, James R Larus, and Albert Greenberg
    Performance Evaluation
    International Symposium on Computer Performance, Modeling, Measurements, and Evaluation (IFIP Performance)
    2011
    Best Paper Award