Publications

publications in reversed chronological order.

  1. Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes
    Dongyan Huo, Yudong Chen, and Qiaomin Xie
    Mathematics of Operations Research 2026
  2. Wasserstein-p Central Limit Theorem Rates: From Local Dependence to Markov Chains
    Yixuan Zhang, and Qiaomin Xie
    2026
  3. Offline Actor-Critic for Average Reward MDPs
    William Powell, Jeongyeol Kwon, Qiaomin Xie, and Hanbaek Lyu
    In Advances in Neural Information Processing Systems (NeurIPS) 2025
  4. Contextual Online Pricing with (Biased) Offline Data
    Yixuan Zhang, Ruihao Zhu, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2025
  5. Unichain and aperiodicity are sufficient for asymptotic optimality of average-reward restless bandits
    Yige Hong, Qiaomin Xie, Yudong Chen, and Weina Wang
    Mathematics of Operations Research 2025
  6. Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
    Subhojyoti Mukherjee, Josiah P Hanna, Qiaomin Xie, and Robert Nowak
    In Reinforcement Learning Conference (RLC) 2025
  7. Multi-task Representation Learning for Fixed Budget Pure-Exploration in Linear and Bilinear Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, and Robert Nowak
    In Reinforcement Learning Conference (RLC) 2025
  8. Stable Offline Value Function Learning with Bisimulation-based Representations
    Brahma S. Pavse, Yudong Chen, Qiaomin Xie, and Josiah P. Hanna
    In International Conference on Machine Learning (ICML) 2025
  9. A Piecewise Lyapunov Analysis of Sub-quadratic SGD: Applications to Robust and Quantile Regression
    Yixuan Zhang, Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2025
  10. Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way
    Jeongyeol Kwon, Luke Dotson, Yudong Chen, and Qiaomin Xie
    In International Conference on Artificial Intelligence and Statistics (AISTATS) 2025
  11. Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent
    Xiang Li, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2025
  12. The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize
    Dongyan (Lucy) Huo, Yixuan Zhang, Yudong Chen, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS), Spotlight 2024
  13. Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
    Yixuan Zhang, and Qiaomin Xie
    In Reinforcement Learning Conference (RLC) 2024
  14. Inception: Efficiently Computable Misinformation Attacks on Markov Games
    Jeremy McMahan, Young Wu, Yudong Chen, Xiaojin Zhu, and Qiaomin Xie
    In Reinforcement Learning Conference (RLC) 2024
  15. Roping in Uncertainty: Robustness and Regularization in Markov Games
    Jeremy McMahan, Giovanni Artiglio, and Qiaomin Xie
    In International Conference on Machine Learning (ICML) 2024
  16. Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value
    Young Wu, Jeremy McMahan, Yiding Chen, Yudong Chen, Xiaojin Zhu, and Qiaomin Xie
    In International Conference on Machine Learning (ICML) 2024
  17. Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
    Brahma S. Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, and Josiah P. Hanna
    In International Conference on Machine Learning (ICML) 2024
  18. Near-Optimal Stochastic Bin-Packing in Large Service Systems with Time-Varying Item Sizes
    Yige Hong, Qiaomin Xie, and Weina Wang
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
  19. Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive Stochastic Approximation
    Yixuan Zhang, Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
  20. SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, and Robert Nowak
    In International Conference on Artificial Intelligence and Statistics (AISTATS) 2024
  21. Stochastic Methods in Variational Inequalities: Ergodicity, Bias and Refinements
    Emmanouil-Vasileios Vlatakis-Gkaragkounis, Angeliki Giannou, Yudong Chen, and Qiaomin Xie
    In International Conference on Artificial Intelligence and Statistics (AISTATS), Oral 2024
  22. Data Poisoning to Fake a Nash Equilibrium in Markov Games
    Young Wu, Jeremy McMahan, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  23. Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
    Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  24. Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data Corruption
    Yiding Chen, Xuezhou Zhang, Qiaomin Xie, and Xiaojin Zhu
    In AAAI Conference on Artificial Intelligence 2024
  25. Optimal Attack and Defense for Reinforcement Learning
    Jeremy McMahan, Young Wu, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  26. Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, and Robert Nowak
    In Advances in Neural Information Processing Systems (NeurIPS) 2023
  27. Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
    Yige Hong, Qiaomin Xie, Yudong Chen, and Weina Wang
    In Advances in Neural Information Processing Systems (NeurIPS), Spotlight, 2023
  28. Distributed Threshold-based Offloading for Heterogeneous Mobile Edge Computing
    Xudong Qin, Qiaomin Xie, and Bin Li
    In International Conference on Distributed Computing Systems (ICDCS) 2023
  29. Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes
    Zihan Zhang, and Qiaomin Xie
    In Conference on Learning Theory (COLT) 2023
  30. Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes
    Dongyan (Lucy) Huo, Yudong Chen, and Qiaomin Xie
    In ACM Sigmetrics 2023
  31. Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
    Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang
    Mathematics of Operations Research 2023
  32. Reward Poisoning Attacks on Offline Multi-Agent Reinforcement Learning
    Young Wu, Jermey McMahan, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2023
  33. RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems
    Bai Liu, Qiaomin Xie, and Eytan Modiano
    ACM Transactions on Modeling and Performance Evaluation of Computing Systems 2022
  34. ORSuite: Benchmarking Suite for Sequential Operations Models
    Christopher Archer, Siddhartha Banerjee, Mayleen Cortez, Carrie Rucker, Sean R. Sinclair, Max Solberg, Qiaomin Xie, and Christina Lee Yu
    SIGMETRICS Performance Evaluation Review 2022
  35. Nonasymptotic Analysis of Monte Carlo Tree Search
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    Operations Research 2022
  36. Learning While Playing in Mean-Field Games: Convergence and Optimality
    Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, and Andreea Minca
    In International Conference on Machine Learning (ICML) 2021
  37. Zero queueing for multi-server jobs
    Weina Wang, Qiaomin Xie, and Mor Harchol-Balter
    In ACM Sigmetrics 2021
  38. Dynamic Regret of Policy Optimization in Non-Stationary Environments
    Yingjie Fei, Zhuoran Yang, Zhaoran Wang, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  39. POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
    Weichao Mao, Kaiqing Zhang, Qiaomin Xie, and Tamer Basar
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  40. Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
    Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  41. Stable Reinforcement Learning with Unbounded State Space
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    In Learning for Dynamics and Control (L4DC) 2020
  42. On Reinforcement Learning for Turn-based Zero-sum Markov Games
    Devavrat Shah, Varun Somani, Qiaomin Xie, and Zhi Xu
    In Proceedings of the 2020 ACM-IMS on Foundations of Data Science Conference 2020
  43. Learning zero-sum simultaneous-move Markov games using function approximation and correlated equilibrium
    Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang
    In Conference on Learning Theory 2020
  44. Non-asymptotic analysis of Monte Carlo tree search
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    In ACM Sigmetrics 2020
  45. Greed works—online algorithms for unrelated machine stochastic scheduling
    Varun Gupta, Benjamin Moseley, Marc Uetz, and Qiaomin Xie
    Mathematics of operations research 2020
  46. Reinforcement learning for optimal control of queueing systems
    Bai Liu, Qiaomin Xie, and Eytan Modiano
    In 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton) 2019
  47. Q-learning with nearest neighbors
    Devavrat Shah, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2018
  48. Stochastic online scheduling on unrelated machines
    Varun Gupta, Benjamin Moseley, Marc Uetz, and Qiaomin Xie
    In International Conference on Integer Programming and Combinatorial Optimization 2017
  49. Centralized Congestion Control and Scheduling in a Datacenter
    Devavrat Shah, and Qiaomin Xie
    arXiv preprint arXiv:1710.02548 2017
  50. Scheduling with Multi-level Data Locality: Throughput and Heavy-Traffic Optimality
    Qiaomin Xie, Ali Yekkehkhany, and Yi Lu
    In 2016 IEEE Conference on Computer Communications (INFOCOM) 2016
  51. Pandas: robust locality-aware scheduling with stochastic delay optimality
    Qiaomin Xie, Mayank Pundir, Yi Lu, Cristina L Abad, and Roy H Campbell
    IEEE/ACM Transactions on Networking 2016
  52. Power of d Choices for Large-Scale Bin Packing: A Loss Model
    Qiaomin Xie, Xiaobo Dong, Yi Lu, and R Srikant
    In ACM Sigmetrics 2015
  53. Priority algorithm for near-data scheduling: Throughput and heavy-traffic optimality
    Qiaomin Xie, and Yi Lu
    In 2015 IEEE Conference on Computer Communications (INFOCOM) 2015
  54. Degree-guided map-reduce task assignment with data locality constraint
    Qiaomin Xie, and Yi Lu
    In 2012 IEEE International Symposium on Information Theory Proceedings 2012
  55. Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services
    Yi Lu, Qiaomin Xie, Gabriel Kliot, Alan Geller, James R Larus, and Albert Greenberg
    Performance Evaluation 2011