Skip to main content

Showing 1–50 of 80 results for author: Wierman, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.20534  [pdf, other

    cs.LG math.OC

    End-to-End Conformal Calibration for Optimization Under Uncertainty

    Authors: Christopher Yeh, Nicolas Christianson, Alan Wu, Adam Wierman, Yisong Yue

    Abstract: Machine learning can significantly improve performance for decision-making under uncertainty in a wide range of domains. However, ensuring robustness guarantees requires well-calibrated uncertainty estimates, which can be difficult to achieve in high-capacity prediction models such as deep neural networks. Moreover, in high-dimensional settings, there may be many valid uncertainty estimates, each… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  2. arXiv:2409.20067  [pdf, ps, other

    cs.LG cs.GT cs.MA stat.ML

    Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning

    Authors: Laixi Shi, Jingchu Gai, Eric Mazumdar, Yuejie Chi, Adam Wierman

    Abstract: Standard multi-agent reinforcement learning (MARL) algorithms are vulnerable to sim-to-real gaps. To address this, distributionally robust Markov games (RMGs) have been proposed to enhance robustness in MARL by optimizing the worst-case performance when game dynamics shift within a prescribed uncertainty set. Solving RMGs remains under-explored, from problem formulation to the development of sampl… ▽ More

    Submitted 7 October, 2024; v1 submitted 30 September, 2024; originally announced September 2024.

  3. arXiv:2409.01447  [pdf, other

    cs.LG cs.GT

    Last-Iterate Convergence of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

    Authors: Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

    Abstract: In this paper, we consider two-player zero-sum matrix and stochastic games and develop learning dynamics that are payoff-based, convergent, rational, and symmetric between the two players. Specifically, the learning dynamics for matrix games are based on the smoothed best-response dynamics, while the learning dynamics for stochastic games build upon those for matrix games, with additional incorpor… ▽ More

    Submitted 4 September, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: A preliminary version [arXiv:2303.03100] of this paper, with a subset of the results that are presented here, was presented at NeurIPS 2023

  4. arXiv:2408.07831  [pdf, other

    cs.DS cs.DC cs.LG

    CarbonClipper: Optimal Algorithms for Carbon-Aware Spatiotemporal Workload Management

    Authors: Adam Lechowicz, Nicolas Christianson, Bo Sun, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, Prashant Shenoy

    Abstract: We study carbon-aware spatiotemporal workload management, which seeks to address the growing environmental impact of data centers. We formalize this as an online problem called spatiotemporal online allocation with deadline constraints ($\mathsf{SOAD}$), in which an online player completes a workload (e.g., a batch compute job) by moving and scheduling the workload across a network subject to a de… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 50 pages, 21 figures

  5. arXiv:2406.15788  [pdf, other

    cs.LG

    Distributionally Robust Constrained Reinforcement Learning under Strong Duality

    Authors: Zhengfei Zhang, Kishan Panaganti, Laixi Shi, Yanan Sui, Adam Wierman, Yisong Yue

    Abstract: We study the problem of Distributionally Robust Constrained RL (DRC-RL), where the goal is to maximize the expected reward subject to environmental distribution shifts and constraints. This setting captures situations where training and testing environments differ, and policies must satisfy constraints motivated by safety or limited budgets. Despite significant progress toward algorithm design for… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Accepted at the Reinforcement Learning Conference (RLC) 2024; 28 pages, 4 figures

  6. arXiv:2405.20860  [pdf, other

    cs.LG

    Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation

    Authors: Shangding Gu, Laixi Shi, Yuhao Ding, Alois Knoll, Costas Spanos, Adam Wierman, Ming Jin

    Abstract: Safe reinforcement learning (RL) is crucial for deploying RL agents in real-world applications, as it aims to maximize long-term rewards while satisfying safety constraints. However, safe RL often suffers from sample inefficiency, requiring extensive interactions with the environment to learn a safe policy. We propose Efficient Safe Policy Optimization (ESPO), a novel approach that enhances the ef… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  7. arXiv:2405.19811  [pdf, ps, other

    cs.LG cs.MA

    Approximate Global Convergence of Independent Learning in Multi-Agent Systems

    Authors: Ruiyang Jin, Zaiwei Chen, Yiheng Lin, Jie Song, Adam Wierman

    Abstract: Independent learning (IL), despite being a popular approach in practice to achieve scalability in large-scale multi-agent systems, usually lacks global convergence guarantees. In this paper, we study two representative algorithms, independent $Q$-learning and independent natural actor-critic, within value-based and policy-based frameworks, and provide the first finite-sample analysis for approxima… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  8. arXiv:2405.13858  [pdf, other

    cs.DC cs.AR cs.ET cs.LG

    Carbon Connect: An Ecosystem for Sustainable Computing

    Authors: Benjamin C. Lee, David Brooks, Arthur van Benthem, Udit Gupta, Gage Hills, Vincent Liu, Benjamin Pierce, Christopher Stewart, Emma Strubell, Gu-Yeon Wei, Adam Wierman, Yuan Yao, Minlan Yu

    Abstract: Computing is at a moment of profound opportunity. Emerging applications -- such as capable artificial intelligence, immersive virtual realities, and pervasive sensor systems -- drive unprecedented demand for computer. Despite recent advances toward net zero carbon emissions, the computing industry's gross energy usage continues to rise at an alarming rate, outpacing the growth of new energy instal… ▽ More

    Submitted 21 August, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  9. arXiv:2405.09859  [pdf, other

    cs.DS

    Risk-Sensitive Online Algorithms

    Authors: Nicolas Christianson, Bo Sun, Steven Low, Adam Wierman

    Abstract: We study the design of risk-sensitive online algorithms, in which risk measures are used in the competitive analysis of randomized online algorithms. We introduce the CVaR$_δ$-competitive ratio ($δ$-CR) using the conditional value-at-risk of an algorithm's cost, which measures the expectation of the $(1-δ)$-fraction of worst outcomes against the offline optimal cost, and use this measure to study… ▽ More

    Submitted 24 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2024. Updated with an additional reference and minor edits

  10. arXiv:2405.05468  [pdf, ps, other

    cs.LG stat.ML

    Model-Free Robust $φ$-Divergence Reinforcement Learning Using Both Offline and Online Data

    Authors: Kishan Panaganti, Adam Wierman, Eric Mazumdar

    Abstract: The robust $φ$-regularized Markov Decision Process (RRMDP) framework focuses on designing control policies that are robust against parameter uncertainties due to mismatches between the simulator (nominal) model and real-world settings. This work makes two important contributions. First, we propose a model-free algorithm called Robust $φ$-regularized fitted Q-iteration (RPQ) for learning an $ε$-opt… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: To appear in the proceedings of the International Conference on Machine Learning (ICML) 2024

  11. arXiv:2404.18909  [pdf, other

    cs.LG cs.MA stat.ML

    Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty

    Authors: Laixi Shi, Eric Mazumdar, Yuejie Chi, Adam Wierman

    Abstract: To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must maintain robustness against environmental uncertainties. While robust RL has been widely studied in single-agent regimes, in multi-agent environments, the problem remains understudied -- despite the fact that the problems posed by environmental uncertainties are often exacerbated by strategic interactions. This w… ▽ More

    Submitted 8 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted by International Conference on Machine Learning, 2024

  12. arXiv:2402.14012  [pdf, other

    cs.DS cs.LG

    Chasing Convex Functions with Long-term Constraints

    Authors: Adam Lechowicz, Nicolas Christianson, Bo Sun, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, Prashant Shenoy

    Abstract: We introduce and study a family of online metric problems with long-term constraints. In these problems, an online player makes decisions $\mathbf{x}_t$ in a metric space $(X,d)$ to simultaneously minimize their hitting cost $f_t(\mathbf{x}_t)$ and switching cost as determined by the metric. Over the time horizon $T$, the player must satisfy a long-term demand constraint… ▽ More

    Submitted 12 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024. 31 pages, 12 figures

  13. arXiv:2312.04905  [pdf, ps, other

    cs.LG cs.MA

    Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games

    Authors: Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

    Abstract: We consider two-player zero-sum stochastic games and propose a two-timescale $Q$-learning algorithm with function approximation that is payoff-based, convergent, rational, and symmetric between the two players. In two-timescale $Q$-learning, the fast-timescale iterates are updated in spirit to the stochastic gradient descent and the slow-timescale iterates (which we use to compute the policies) ar… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  14. arXiv:2311.01698  [pdf, other

    cs.LG cs.CR cs.MA

    Adversarial Attacks on Cooperative Multi-agent Bandits

    Authors: Jinhang Zuo, Zhiyao Zhang, Xuchuang Wang, Cheng Chen, Shuai Li, John C. S. Lui, Mohammad Hajiesmaili, Adam Wierman

    Abstract: Cooperative multi-agent multi-armed bandits (CMA2B) consider the collaborative efforts of multiple agents in a shared multi-armed bandit game. We study latent vulnerabilities exposed by this collaboration and consider adversarial attacks on a few agents with the goal of influencing the decisions of the rest. More specifically, we study adversarial attacks on CMA2B in both homogeneous settings, whe… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  15. arXiv:2311.01568  [pdf, other

    cs.LG

    Anytime-Competitive Reinforcement Learning with Policy Prior

    Authors: Jianyi Yang, Pengfei Li, Tongxin Li, Adam Wierman, Shaolei Ren

    Abstract: This paper studies the problem of Anytime-Competitive Markov Decision Process (A-CMDP). Existing works on Constrained Markov Decision Processes (CMDPs) aim to optimize the expected reward while constraining the expected cost over random dynamics, but the cost in a specific episode can still be unsatisfactorily high. In contrast, the goal of A-CMDP is to optimize the expected reward while guarantee… ▽ More

    Submitted 2 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  16. arXiv:2311.00181  [pdf, other

    math.OC cs.DS cs.LG math.PR

    Best of Both Worlds Guarantees for Smoothed Online Quadratic Optimization

    Authors: Neelkamal Bhuyan, Debankur Mukherjee, Adam Wierman

    Abstract: We study the smoothed online quadratic optimization (SOQO) problem where, at each round $t$, a player plays an action $x_t$ in response to a quadratic hitting cost and an additional squared $\ell_2$-norm cost for switching actions. This problem class has strong connections to a wide range of application domains including smart grid management, adaptive control, and data center management, where sw… ▽ More

    Submitted 23 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 48 pages, 9 figures

  17. arXiv:2310.20598  [pdf, other

    cs.DS cs.LG

    Online Conversion with Switching Costs: Robust and Learning-Augmented Algorithms

    Authors: Adam Lechowicz, Nicolas Christianson, Bo Sun, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, Prashant Shenoy

    Abstract: We introduce and study online conversion with switching costs, a family of online problems that capture emerging problems at the intersection of energy and sustainability. In this problem, an online player attempts to purchase (alternatively, sell) fractional shares of an asset during a fixed time horizon with length $T$. At each time step, a cost function (alternatively, price function) is reveal… ▽ More

    Submitted 13 January, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted to SIGMETRICS / Performance '24. 47 pages, 9 figures

  18. arXiv:2310.20098  [pdf, other

    cs.LG cs.DS math.OC

    Robust Learning for Smoothed Online Convex Optimization with Feedback Delay

    Authors: Pengfei Li, Jianyi Yang, Adam Wierman, Shaolei Ren

    Abstract: We study a challenging form of Smoothed Online Convex Optimization, a.k.a. SOCO, including multi-step nonlinear switching costs and feedback delay. We propose a novel machine learning (ML) augmented online algorithm, Robustness-Constrained Learning (RCL), which combines untrusted ML predictions with a trusted expert online algorithm via constrained projection to robustify the ML prediction. Specif… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  19. arXiv:2310.11558  [pdf, other

    cs.LG cs.DS

    Online Algorithms with Uncertainty-Quantified Predictions

    Authors: Bo Sun, Jerry Huang, Nicolas Christianson, Mohammad Hajiesmaili, Adam Wierman, Raouf Boutaba

    Abstract: The burgeoning field of algorithms with predictions studies the problem of using possibly imperfect machine learning predictions to improve online algorithm performance. While nearly all existing algorithms in this framework make no assumptions on prediction quality, a number of methods providing uncertainty quantification (UQ) on machine learning models have been developed in recent years, which… ▽ More

    Submitted 3 June, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  20. arXiv:2309.14648  [pdf, other

    math.OC cs.LG math.ST

    Learning the Uncertainty Sets for Control Dynamics via Set Membership: A Non-Asymptotic Analysis

    Authors: Yingying Li, Jing Yu, Lauren Conger, Taylan Kargin, Adam Wierman

    Abstract: This paper studies uncertainty set estimation for unknown linear systems. Uncertainty sets are crucial for the quality of robust control since they directly influence the conservativeness of the control design. Departing from the confidence region analysis of least squares estimation, this paper focuses on set membership estimation (SME). Though good numerical performances have attracted applicati… ▽ More

    Submitted 9 June, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: ICML 2024

  21. arXiv:2307.10524  [pdf, other

    cs.LG cs.PF

    Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions

    Authors: Tongxin Li, Yiheng Lin, Shaolei Ren, Adam Wierman

    Abstract: We study the tradeoff between consistency and robustness in the context of a single-trajectory time-varying Markov Decision Process (MDP) with untrusted machine-learned advice. Our work departs from the typical approach of treating advice as coming from black-box sources by instead considering a setting where additional information about how the advice is generated is available. We prove a first-o… ▽ More

    Submitted 28 October, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 32 pages, NeurIPS 2023

  22. arXiv:2307.05494  [pdf, other

    cs.AI cs.CY

    Towards Environmentally Equitable AI via Geographical Load Balancing

    Authors: Pengfei Li, Jianyi Yang, Adam Wierman, Shaolei Ren

    Abstract: Fueled by the soaring popularity of large language and foundation models, the accelerated growth of artificial intelligence (AI) models' enormous environmental footprint has come under increased scrutiny. While many approaches have been proposed to make AI more energy-efficient and environmentally friendly, environmental inequity -- the fact that AI's environmental footprint can be disproportionat… ▽ More

    Submitted 2 May, 2024; v1 submitted 20 June, 2023; originally announced July 2023.

    Comments: Accepted by ACM e-Energy 2024

  23. arXiv:2306.10158  [pdf, other

    cs.LG math.OC

    Learning-Augmented Decentralized Online Convex Optimization in Networks

    Authors: Pengfei Li, Jianyi Yang, Adam Wierman, Shaolei Ren

    Abstract: This paper studies decentralized online convex optimization in a networked multi-agent system and proposes a novel algorithm, Learning-Augmented Decentralized Online optimization (LADO), for individual agents to select actions only based on local online information. LADO leverages a baseline policy to safeguard online actions for worst-case robustness guarantees, while staying close to the machine… ▽ More

    Submitted 23 September, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  24. arXiv:2305.17071  [pdf, other

    cs.LG cs.CR cs.IR

    Adversarial Attacks on Online Learning to Rank with Click Feedback

    Authors: Jinhang Zuo, Zhiyao Zhang, Zhiyong Wang, Shuai Li, Mohammad Hajiesmaili, Adam Wierman

    Abstract: Online learning to rank (OLTR) is a sequential decision-making problem where a learning agent selects an ordered list of items and receives feedback through user clicks. Although potential attacks against OLTR algorithms may cause serious losses in real-world applications, little is known about adversarial attacks on OLTR. This paper studies attack strategies against multiple variants of OLTR. Our… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  25. arXiv:2303.17551  [pdf, other

    cs.DS cs.DC

    The Online Pause and Resume Problem: Optimal Algorithms and An Application to Carbon-Aware Load Shifting

    Authors: Adam Lechowicz, Nicolas Christianson, Jinhang Zuo, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, Prashant Shenoy

    Abstract: We introduce and study the online pause and resume problem. In this problem, a player attempts to find the $k$ lowest (alternatively, highest) prices in a sequence of fixed length $T$, which is revealed sequentially. At each time step, the player is presented with a price and decides whether to accept or reject it. The player incurs a switching cost whenever their decision changes in consecutive t… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 34 pages, 12 figures

    Journal ref: Proc. ACM Meas. Anal. Comput. Syst. Volume 7, Issue 3, Article 45 (December 2023), 32 pages

  26. arXiv:2303.17110  [pdf, other

    cs.LG cs.AI stat.ML

    Contextual Combinatorial Bandits with Probabilistically Triggered Arms

    Authors: Xutong Liu, Jinhang Zuo, Siwei Wang, John C. S. Lui, Mohammad Hajiesmaili, Adam Wierman, Wei Chen

    Abstract: We study contextual combinatorial bandits with probabilistically triggered arms (C$^2$MAB-T) under a variety of smoothness conditions that capture a wide range of applications, such as contextual cascading bandits and contextual influence maximization bandits. Under the triggering probability modulated (TPM) condition, we devise the C$^2$-UCB-T algorithm and propose a novel analysis that achieves… ▽ More

    Submitted 14 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted in the 40th International Conference on Machine Learning (ICML), 2023

  27. arXiv:2303.04865  [pdf, other

    cs.LG cs.AI cs.MA

    Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games

    Authors: Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman

    Abstract: We introduce a class of networked Markov potential games in which agents are associated with nodes in a network. Each agent has its own local potential function, and the reward of each agent depends only on the states and actions of the agents within a neighborhood. In this context, we propose a localized actor-critic algorithm. The algorithm is scalable since each agent uses only local informatio… ▽ More

    Submitted 8 July, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  28. arXiv:2303.03100  [pdf, ps, other

    cs.GT cs.LG

    A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

    Authors: Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

    Abstract: We study two-player zero-sum stochastic games, and propose a form of independent learning dynamics called Doubly Smoothed Best-Response dynamics, which integrates a discrete and doubly smoothed variant of the best-response dynamics into temporal-difference (TD)-learning and minimax value iteration. The resulting dynamics are payoff-based, convergent, rational, and symmetric among players. Our main… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  29. arXiv:2211.17116  [pdf, other

    cs.LG cs.AI cs.MA math.OC

    Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning

    Authors: Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman

    Abstract: We study a multi-agent reinforcement learning (MARL) problem where the agents interact over a given network. The goal of the agents is to cooperatively maximize the average of their entropy-regularized long-term rewards. To overcome the curse of dimensionality and to reduce communication, we propose a Localized Policy Iteration (LPI) algorithm that provably learns a near-globally-optimal policy us… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  30. arXiv:2209.11934  [pdf, ps, other

    cs.DS

    The Online Knapsack Problem with Departures

    Authors: Bo Sun, Lin Yang, Mohammad Hajiesmaili, Adam Wierman, John C. S. Lui, Don Towsley, Danny H. K. Tsang

    Abstract: The online knapsack problem is a classic online resource allocation problem in networking and operations research. Its basic version studies how to pack online arriving items of different sizes and values into a capacity-limited knapsack. In this paper, we study a general version that includes item departures, while also considering multiple knapsacks and multi-dimensional item sizes. We design a… ▽ More

    Submitted 15 March, 2023; v1 submitted 24 September, 2022; originally announced September 2022.

  31. arXiv:2206.11780  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    Chasing Convex Bodies and Functions with Black-Box Advice

    Authors: Nicolas Christianson, Tinashe Handina, Adam Wierman

    Abstract: We consider the problem of convex function chasing with black-box advice, where an online decision-maker aims to minimize the total cost of making and switching between decisions in a normed vector space, aided by black-box advice such as the decisions of a machine-learned algorithm. The decision-maker seeks cost comparable to the advice when it performs well, known as $\textit{consistency}$, whil… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted to COLT 2022

  32. arXiv:2206.01704  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems

    Authors: Sahin Lale, Yuanyuan Shi, Guannan Qu, Kamyar Azizzadenesheli, Adam Wierman, Anima Anandkumar

    Abstract: Learning a dynamical system requires stabilizing the unknown dynamics to avoid state blow-ups. However, current reinforcement learning (RL) methods lack stabilization guarantees, which limits their applicability for the control of safety-critical systems. We propose a model-based RL framework with formal stability guarantees, Krasovskii Constrained RL (KCRL), that adopts Krasovskii's family of Lya… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  33. arXiv:2206.01341  [pdf, other

    cs.LG eess.SY stat.ML

    Equipping Black-Box Policies with Model-Based Advice for Stable Nonlinear Control

    Authors: Tongxin Li, Ruixiao Yang, Guannan Qu, Yiheng Lin, Steven Low, Adam Wierman

    Abstract: Machine-learned black-box policies are ubiquitous for nonlinear control problems. Meanwhile, crude model information is often available for these problems from, e.g., linear approximations of nonlinear dynamics. We study the problem of equipping a black-box control policy with model-based advice for nonlinear control on a single trajectory. We first show a general negative result that a naive conv… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 33 pages, 7 figures

  34. arXiv:2204.05551  [pdf, other

    math.OC cs.LG eess.SY math.DS

    Near-Optimal Distributed Linear-Quadratic Regulator for Networked Systems

    Authors: Sungho Shin, Yiheng Lin, Guannan Qu, Adam Wierman, Mihai Anitescu

    Abstract: This paper studies the trade-off between the degree of decentralization and the performance of a distributed controller in a linear-quadratic control setting. We study a system of interconnected agents over a graph and a distributed controller, called $κ$-distributed control, which lets the agents make control decisions based on the state information within distance $κ$ on the underlying graph. Th… ▽ More

    Submitted 11 September, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

  35. arXiv:2203.04503  [pdf, other

    cs.GT math.OC

    An Energy Sharing Mechanism Considering Network Constraints and Market Power Limitation

    Authors: Yue Chen, Changhong Zhao, Steven H. Low, Adam Wierman

    Abstract: As the number of prosumers with distributed energy resources (DERs) grows, the conventional centralized operation scheme may suffer from conflicting interests, privacy concerns, and incentive inadequacy. In this paper, we propose an energy sharing mechanism to address the above challenges. It takes into account network constraints and fairness among prosumers. In the proposed energy sharing market… ▽ More

    Submitted 27 June, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: 23 pages, 14 figures

  36. arXiv:2202.07086  [pdf, other

    cs.GT cs.MA math.OC

    Price Cycles in Ridesharing Platforms

    Authors: Chenkai Yu, Hongyao Ma, Adam Wierman

    Abstract: In ridesharing platforms such as Uber and Lyft, it is observed that drivers sometimes collaboratively go offline when the price is low, and then return after the price has risen due to the perceived lack of supply. This collective strategy leads to cyclic fluctuations in prices and available drivers, resulting in poor reliability and social welfare. We study a continuous time, non-atomic model and… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  37. arXiv:2202.03519  [pdf, other

    cs.LG cs.DS

    Smoothed Online Optimization with Unreliable Predictions

    Authors: Daan Rutten, Nico Christianson, Debankur Mukherjee, Adam Wierman

    Abstract: We examine the problem of smoothed online optimization, where a decision maker must sequentially choose points in a normed vector space to minimize the sum of per-round, non-convex hitting costs and the costs of switching decisions between rounds. The decision maker has access to a black-box oracle, such as a machine learning model, that provides untrusted and potentially inaccurate predictions of… ▽ More

    Submitted 26 October, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 38 pages, 4 figures

    MSC Class: 68Q25 (Primary) 68T20; 68W27 (Secondary) ACM Class: G.1.6; I.2.8; F.2.2

  38. arXiv:2111.00095  [pdf, other

    cs.LG eess.SY math.OC

    Online Optimization with Feedback Delay and Nonlinear Switching Cost

    Authors: Weici Pan, Guanya Shi, Yiheng Lin, Adam Wierman

    Abstract: We study a variant of online optimization in which the learner receives $k$-round $\textit{delayed feedback}$ about hitting cost and there is a multi-step nonlinear switching cost, i.e., costs depend on multiple previous actions in a nonlinear manner. Our main result shows that a novel Iterative Regularized Online Balanced Descent (iROBD) algorithm has a constant, dimension-free competitive ratio… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  39. arXiv:2109.01556  [pdf, other

    cs.LG cs.DS

    Pareto-Optimal Learning-Augmented Algorithms for Online Conversion Problems

    Authors: Bo Sun, Russell Lee, Mohammad Hajiesmaili, Adam Wierman, Danny H. K. Tsang

    Abstract: This paper leverages machine-learned predictions to design competitive algorithms for online conversion problems with the goal of improving the competitive ratio when predictions are accurate (i.e., consistency), while also guaranteeing a worst-case competitive ratio regardless of the prediction quality (i.e., robustness). We unify the algorithmic design of both integral and fractional conversion… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

  40. arXiv:2106.08872  [pdf, other

    cs.DC

    Enabling Sustainable Clouds: The Case for Virtualizing the Energy System

    Authors: Noman Bashir, Tian Guo, Mohammad Hajiesmaili, David Irwin, Prashant Shenoy, Ramesh Sitaraman, Abel Souza, Adam Wierman

    Abstract: Cloud platforms' growing energy demand and carbon emissions are raising concern about their environmental sustainability. The current approach to enabling sustainable clouds focuses on improving energy-efficiency and purchasing carbon offsets. These approaches have limits: many cloud data centers already operate near peak efficiency, and carbon offsets cannot scale to near zero carbon where there… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  41. arXiv:2106.02156  [pdf, other

    cs.NI

    Trading Throughput for Freshness: Freshness-Aware Traffic Engineering and In-Network Freshness Control

    Authors: Shih-Hao Tseng, SooJean Han, Adam Wierman

    Abstract: In addition to traditional concerns such as throughput and latency, freshness is becoming increasingly important. To stay fresh, applications stream status updates among their components. Existing studies propose the metric age of information (AoI) to gauge the freshness and design systems to achieve low AoI. Despite active research in this area, existing results are not applicable to general wire… ▽ More

    Submitted 14 December, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

  42. arXiv:2105.14262  [pdf, other

    cs.SI cs.GT

    The Privacy Paradox and Optimal Bias-Variance Trade-offs in Data Acquisition

    Authors: Guocheng Liao, Yu Su, Juba Ziani, Adam Wierman, Jianwei Huang

    Abstract: While users claim to be concerned about privacy, often they do little to protect their privacy in their online actions. One prominent explanation for this "privacy paradox" is that when an individual shares her data, it is not just her privacy that is compromised; the privacy of other individuals with correlated data is also compromised. This information leakage encourages oversharing of data and… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

  43. arXiv:2104.14134  [pdf, other

    math.OC cs.LG eess.SY

    Stable Online Control of Linear Time-Varying Systems

    Authors: Guannan Qu, Yuanyuan Shi, Sahin Lale, Anima Anandkumar, Adam Wierman

    Abstract: Linear time-varying (LTV) systems are widely used for modeling real-world dynamical systems due to their generality and simplicity. Providing stability guarantees for LTV systems is one of the central problems in control theory. However, existing approaches that guarantee stability typically lead to significantly sub-optimal cumulative control cost in online settings where only current or short-te… ▽ More

    Submitted 29 April, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: 3rd Annual Learning for Dynamics & Control Conference (L4DC)

  44. arXiv:2012.05361  [pdf, ps, other

    cs.DS math.NA

    Data-driven Competitive Algorithms for Online Knapsack and Set Cover

    Authors: Ali Zeynali, Bo Sun, Mohammad Hajiesmaili, Adam Wierman

    Abstract: The design of online algorithms has tended to focus on algorithms with worst-case guarantees, e.g., bounds on the competitive ratio. However, it is well-known that such algorithms are often overly pessimistic, performing sub-optimally on non-worst-case inputs. In this paper, we develop an approach for data-driven design of online algorithms that maintain near-optimal worst-case guarantees while al… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  45. arXiv:2010.00412  [pdf, other

    cs.DS

    Competitive Algorithms for the Online Multiple Knapsack Problem with Application to Electric Vehicle Charging

    Authors: Bo Sun, Ali Zeynali, Tongxin Li, Mohammad Hajiesmaili, Adam Wierman, Danny H. K. Tsang

    Abstract: We introduce and study a general version of the fractional online knapsack problem with multiple knapsacks, heterogeneous constraints on which items can be assigned to which knapsack, and rate-limiting constraints on the assignment of items to knapsacks. This problem generalizes variations of the knapsack problem and of the one-way trading problem that have previously been treated separately, and… ▽ More

    Submitted 17 October, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

  46. arXiv:2006.07476  [pdf, other

    math.OC cs.LG eess.SY

    Combining Model-Based and Model-Free Methods for Nonlinear Control: A Provably Convergent Policy Gradient Approach

    Authors: Guannan Qu, Chenkai Yu, Steven Low, Adam Wierman

    Abstract: Model-free learning-based control methods have seen great success recently. However, such methods typically suffer from poor sample complexity and limited convergence guarantees. This is in sharp contrast to classical model-based control, which has a rich theory but typically requires strong modeling assumptions. In this paper, we combine the two approaches to achieve the best of both worlds. We c… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  47. arXiv:2006.06626  [pdf, other

    math.OC cs.AI cs.LG cs.MA eess.SY

    Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward

    Authors: Guannan Qu, Yiheng Lin, Adam Wierman, Na Li

    Abstract: It has long been recognized that multi-agent reinforcement learning (MARL) faces significant scalability issues due to the fact that the size of the state and action spaces are exponentially large in the number of agents. In this paper, we identify a rich class of networked MARL problems where the model exhibits a local dependence structure that allows it to be solved in a scalable manner. Specifi… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  48. arXiv:2006.06555  [pdf, ps, other

    cs.LG cs.MA stat.ML

    Multi-Agent Reinforcement Learning in Stochastic Networked Systems

    Authors: Yiheng Lin, Guannan Qu, Longbo Huang, Adam Wierman

    Abstract: We study multi-agent reinforcement learning (MARL) in a stochastic network of agents. The objective is to find localized policies that maximize the (discounted) global reward. In general, scalability is a challenge in this setting because the size of the global state/action space can be exponential in the number of agents. Scalable algorithms are only known in cases where dependencies are static,… ▽ More

    Submitted 1 November, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  49. arXiv:2004.14639  [pdf, other

    cs.PF cs.DC

    Communication-Aware Scheduling of Precedence-Constrained Tasks on Related Machines

    Authors: Yu Su, Xiaoqi Ren, Shai Vardi, Adam Wierman

    Abstract: Scheduling precedence-constrained tasks is a classical problem that has been studied for more than fifty years. However, little progress has been made in the setting where there are communication delays between tasks. Results for the case of identical machines were derived nearly thirty years ago, and yet no results for related machines have followed. In this work, we propose a new scheduler, Gene… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  50. arXiv:2002.08908  [pdf, ps, other

    cs.PF cs.DC math.PR

    Asymptotically Optimal Load Balancing in Large-scale Heterogeneous Systems with Multiple Dispatchers

    Authors: Xingyu Zhou, Ness Shroff, Adam Wierman

    Abstract: We consider the load balancing problem in large-scale heterogeneous systems with multiple dispatchers. We introduce a general framework called Local-Estimation-Driven (LED). Under this framework, each dispatcher keeps local (possibly outdated) estimates of queue lengths for all the servers, and the dispatching decision is made purely based on these local estimates. The local estimates are updated… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: 2 figures