Publications
publications in reversed chronological order.
-
Unichain and aperiodicity are sufficient for asymptotic optimality of average-reward restless banditsarXiv preprint arXiv:2402.05689 2024
-
The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant StepsizeIn Advances in Neural Information Processing Systems (NeurIPS), Spotlight, 2024
-
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit LearningarXiv preprint arXiv:2406.05064 2024
-
Constant Stepsize Q-learning: Distributional Convergence, Bias and ExtrapolationIn Reinforcement Learning Conference (RLC) 2024
-
Inception: Efficiently Computable Misinformation Attacks on Markov GamesIn Reinforcement Learning Conference (RLC) 2024
-
Roping in Uncertainty: Robustness and Regularization in Markov GamesIn International Conference on Machine Learning (ICML) 2024
-
Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and ValueIn International Conference on Machine Learning (ICML) 2024
-
Learning to Stabilize Online Reinforcement Learning in Unbounded State SpacesIn International Conference on Machine Learning (ICML) 2024
-
Near-Optimal Stochastic Bin-Packing in Large Service Systems with Time-Varying Item SizesIn ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
-
Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive Stochastic ApproximationIn ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2024
-
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic BanditsIn International Conference on Artificial Intelligence and Statistics (AISTATS) 2024
-
Stochastic Methods in Variational Inequalities: Ergodicity, Bias and RefinementsIn International Conference on Artificial Intelligence and Statistics (AISTATS), Oral 2024
-
Data Poisoning to Fake a Nash Equilibrium in Markov GamesIn AAAI Conference on Artificial Intelligence 2024
-
Effectiveness of Constant Stepsize in Markovian LSA and Statistical InferenceIn AAAI Conference on Artificial Intelligence 2024
-
Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data CorruptionIn AAAI Conference on Artificial Intelligence 2024
-
Optimal Attack and Defense for Reinforcement LearningIn AAAI Conference on Artificial Intelligence 2024
-
Multi-task Representation Learning for Pure Exploration in Bilinear BanditsIn Advances in Neural Information Processing Systems (NeurIPS) 2023
-
Restless Bandits with Average Reward: Breaking the Uniform Global Attractor AssumptionIn Advances in Neural Information Processing Systems (NeurIPS), Spotlight, 2023
-
Distributed Threshold-based Offloading for Heterogeneous Mobile Edge ComputingIn International Conference on Distributed Computing Systems (ICDCS) 2023
-
Sharper Model-free Reinforcement Learning for Average-reward Markov Decision ProcessesIn Conference on Learning Theory (COLT) 2023
-
Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant StepsizesIn ACM Sigmetrics 2023
-
Reward Poisoning Attacks on Offline Multi-Agent Reinforcement LearningIn AAAI Conference on Artificial Intelligence 2023
-
Learning While Playing in Mean-Field Games: Convergence and OptimalityIn International Conference on Machine Learning (ICML) 2021
-
Stable Reinforcement Learning with Unbounded State SpaceIn Learning for Dynamics and Control (L4DC) 2020
-
On Reinforcement Learning for Turn-based Zero-sum Markov GamesIn Proceedings of the 2020 ACM-IMS on Foundations of Data Science Conference 2020
-
Learning zero-sum simultaneous-move Markov games using function approximation and correlated equilibriumIn Conference on Learning Theory 2020
-
Greed works—online algorithms for unrelated machine stochastic schedulingMathematics of operations research 2020
-
Reinforcement learning for optimal control of queueing systemsIn 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton) 2019
-
Stochastic online scheduling on unrelated machinesIn International Conference on Integer Programming and Combinatorial Optimization 2017
-
Centralized Congestion Control and Scheduling in a DatacenterarXiv preprint arXiv:1710.02548 2017
-
Scheduling with Multi-level Data Locality: Throughput and Heavy-Traffic OptimalityIn 2016 IEEE Conference on Computer Communications (INFOCOM) 2016
-
Pandas: robust locality-aware scheduling with stochastic delay optimalityIEEE/ACM Transactions on Networking 2016
-
Priority algorithm for near-data scheduling: Throughput and heavy-traffic optimalityIn 2015 IEEE Conference on Computer Communications (INFOCOM) 2015
-
Degree-guided map-reduce task assignment with data locality constraintIn 2012 IEEE International Symposium on Information Theory Proceedings 2012
-
Join-idle-queue: A novel load balancing algorithm for dynamically scalable web servicesPerformance Evaluation 2011