Publications

publications in reversed chronological order.

  1. Unichain and aperiodicity are sufficient for asymptotic optimality of average-reward restless bandits
    Yige Hong, Qiaomin Xie, Yudong Chen, and Weina Wang
    arXiv preprint arXiv:2402.05689 2024
  2. The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize
    Dongyan Huo, Yixuan Zhang, Yudong Chen, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS), Spotlight, 2024
  3. Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
    Subhojyoti Mukherjee, Josiah P Hanna, Qiaomin Xie, and Robert Nowak
    arXiv preprint arXiv:2406.05064 2024
  4. Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
    Yixuan Zhang, and Qiaomin Xie
    In Reinforcement Learning Conference (RLC) 2024
  5. Inception: Efficiently Computable Misinformation Attacks on Markov Games
    Jeremy McMahan, Young Wu, Yudong Chen, Xiaojin Zhu, and Qiaomin Xie
    In Reinforcement Learning Conference (RLC) 2024
  6. Roping in Uncertainty: Robustness and Regularization in Markov Games
    Jeremy McMahan, Giovanni Artiglio, and Qiaomin Xie
    In International Conference on Machine Learning (ICML) 2024
  7. Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value
    Young Wu, Jeremy McMahan, Yiding Chen, Yudong Chen, Xiaojin Zhu, and Qiaomin Xie
    In International Conference on Machine Learning (ICML) 2024
  8. Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
    Brahma S. Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, and Josiah P. Hanna
    In International Conference on Machine Learning (ICML) 2024
  9. Near-Optimal Stochastic Bin-Packing in Large Service Systems with Time-Varying Item Sizes
    Yige Hong, Qiaomin Xie, and Weina Wang
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
  10. Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive Stochastic Approximation
    Yixuan Zhang, Lucy Huo, Yudong Chen, and Qiaomin Xie
    In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
  11. SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, and Robert Nowak
    In International Conference on Artificial Intelligence and Statistics (AISTATS) 2024
  12. Stochastic Methods in Variational Inequalities: Ergodicity, Bias and Refinements
    Emmanouil-Vasileios Vlatakis-Gkaragkounis, Angeliki Giannou, Yudong Chen, and Qiaomin Xie
    In International Conference on Artificial Intelligence and Statistics (AISTATS), Oral 2024
  13. Data Poisoning to Fake a Nash Equilibrium in Markov Games
    Young Wu, Jeremy McMahan, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  14. Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
    Dongyan Huo, Yudong Chen, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  15. Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data Corruption
    Yiding Chen, Xuezhou Zhang, Qiaomin Xie, and Xiaojin Zhu
    In AAAI Conference on Artificial Intelligence 2024
  16. Optimal Attack and Defense for Reinforcement Learning
    Jeremy McMahan, Young Wu, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2024
  17. Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
    Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, and Robert Nowak
    In Advances in Neural Information Processing Systems (NeurIPS) 2023
  18. Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
    Yige Hong, Qiaomin Xie, Yudong Chen, and Weina Wang
    In Advances in Neural Information Processing Systems (NeurIPS), Spotlight, 2023
  19. Distributed Threshold-based Offloading for Heterogeneous Mobile Edge Computing
    Xudong Qin, Qiaomin Xie, and Bin Li
    In International Conference on Distributed Computing Systems (ICDCS) 2023
  20. Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes
    Zihan Zhang, and Qiaomin Xie
    In Conference on Learning Theory (COLT) 2023
  21. Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes
    Dongyan Huo, Yudong Chen, and Qiaomin Xie
    In ACM Sigmetrics 2023
  22. Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
    Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang
    Mathematics of Operations Research 2023
  23. Reward Poisoning Attacks on Offline Multi-Agent Reinforcement Learning
    Young Wu, Jermey McMahan, Xiaojin Zhu, and Qiaomin Xie
    In AAAI Conference on Artificial Intelligence 2023
  24. RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems
    Bai Liu, Qiaomin Xie, and Eytan Modiano
    ACM Transactions on Modeling and Performance Evaluation of Computing Systems 2022
  25. ORSuite: Benchmarking Suite for Sequential Operations Models
    Christopher Archer, Siddhartha Banerjee, Mayleen Cortez, Carrie Rucker, Sean R. Sinclair, Max Solberg, Qiaomin Xie, and Christina Lee Yu
    SIGMETRICS Performance Evaluation Review 2022
  26. Nonasymptotic Analysis of Monte Carlo Tree Search
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    Operations Research 2022
  27. Learning While Playing in Mean-Field Games: Convergence and Optimality
    Qiaomin Xie, Zhuoran Yang, Zhaoran Wang, and Andreea Minca
    In International Conference on Machine Learning (ICML) 2021
  28. Zero queueing for multi-server jobs
    Weina Wang, Qiaomin Xie, and Mor Harchol-Balter
    In ACM Sigmetrics 2021
  29. Dynamic Regret of Policy Optimization in Non-Stationary Environments
    Yingjie Fei, Zhuoran Yang, Zhaoran Wang, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  30. POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
    Weichao Mao, Kaiqing Zhang, Qiaomin Xie, and Tamer Basar
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  31. Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
    Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  32. Stable Reinforcement Learning with Unbounded State Space
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    In Learning for Dynamics and Control (L4DC) 2020
  33. On Reinforcement Learning for Turn-based Zero-sum Markov Games
    Devavrat Shah, Varun Somani, Qiaomin Xie, and Zhi Xu
    In Proceedings of the 2020 ACM-IMS on Foundations of Data Science Conference 2020
  34. Learning zero-sum simultaneous-move Markov games using function approximation and correlated equilibrium
    Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang
    In Conference on Learning Theory 2020
  35. Non-asymptotic analysis of Monte Carlo tree search
    Devavrat Shah, Qiaomin Xie, and Zhi Xu
    In ACM Sigmetrics 2020
  36. Greed works—online algorithms for unrelated machine stochastic scheduling
    Varun Gupta, Benjamin Moseley, Marc Uetz, and Qiaomin Xie
    Mathematics of operations research 2020
  37. Reinforcement learning for optimal control of queueing systems
    Bai Liu, Qiaomin Xie, and Eytan Modiano
    In 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton) 2019
  38. Q-learning with nearest neighbors
    Devavrat Shah, and Qiaomin Xie
    In Advances in Neural Information Processing Systems (NeurIPS) 2018
  39. Stochastic online scheduling on unrelated machines
    Varun Gupta, Benjamin Moseley, Marc Uetz, and Qiaomin Xie
    In International Conference on Integer Programming and Combinatorial Optimization 2017
  40. Centralized Congestion Control and Scheduling in a Datacenter
    Devavrat Shah, and Qiaomin Xie
    arXiv preprint arXiv:1710.02548 2017
  41. Scheduling with Multi-level Data Locality: Throughput and Heavy-Traffic Optimality
    Qiaomin Xie, Ali Yekkehkhany, and Yi Lu
    In 2016 IEEE Conference on Computer Communications (INFOCOM) 2016
  42. Pandas: robust locality-aware scheduling with stochastic delay optimality
    Qiaomin Xie, Mayank Pundir, Yi Lu, Cristina L Abad, and Roy H Campbell
    IEEE/ACM Transactions on Networking 2016
  43. Power of d Choices for Large-Scale Bin Packing: A Loss Model
    Qiaomin Xie, Xiaobo Dong, Yi Lu, and R Srikant
    In ACM Sigmetrics 2015
  44. Priority algorithm for near-data scheduling: Throughput and heavy-traffic optimality
    Qiaomin Xie, and Yi Lu
    In 2015 IEEE Conference on Computer Communications (INFOCOM) 2015
  45. Degree-guided map-reduce task assignment with data locality constraint
    Qiaomin Xie, and Yi Lu
    In 2012 IEEE International Symposium on Information Theory Proceedings 2012
  46. Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services
    Yi Lu, Qiaomin Xie, Gabriel Kliot, Alan Geller, James R Larus, and Albert Greenberg
    Performance Evaluation 2011