(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–15 of 15 results for author: Zaman, M A u

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06528  [pdf, ps, other

    math.OC cs.IT eess.SY

    Semantic Communication in Multi-team Dynamic Games: A Mean Field Perspective

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Tamer Başar

    Abstract: Coordinating communication and control is a key component in the stability and performance of networked multi-agent systems. While single user networked control systems have gained a lot of attention within this domain, in this work, we address the more challenging problem of large population multi-team dynamic games. In particular, each team constitutes two decision makers (namely, the sensor and… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE for possible publication

  2. arXiv:2406.13992  [pdf, ps, other

    cs.MA eess.SY

    Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective

    Authors: Muhammad Aneeq uz Zaman, Mathieu Laurière, Alec Koppel, Tamer Başar

    Abstract: In this paper, we study the problem of robust cooperative multi-agent reinforcement learning (RL) where a large number of cooperative agents with distributed information aim to learn policies in the presence of \emph{stochastic} and \emph{non-stochastic} uncertainties whose distributions are respectively known and unknown. Focusing on policy optimization that accounts for both types of uncertainti… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in L4DC 2024

  3. arXiv:2404.02898  [pdf, ps, other

    cs.IT cs.GT cs.NI eess.SY

    A Mean Field Game Model for Timely Computation in Edge Computing Systems

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Sennur Ulukus, Tamer Başar

    Abstract: We consider the problem of task offloading in multi-access edge computing (MEC) systems constituting $N$ devices assisted by an edge server (ES), where the devices can split task execution between a local processor and the ES. Since the local task execution and communication with the ES both consume power, each device must judiciously choose between the two. We model the problem as a large populat… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE for possible publication

  4. arXiv:2404.00045  [pdf, ps, other

    cs.GT cs.AI cs.LG cs.MA

    Policy Optimization finds Nash Equilibrium in Regularized General-Sum LQ Games

    Authors: Muhammad Aneeq uz Zaman, Shubham Aggarwal, Melih Bastopcu, Tamer Başar

    Abstract: In this paper, we investigate the impact of introducing relative entropy regularization on the Nash Equilibria (NE) of General-Sum $N$-agent games, revealing the fact that the NE of such games conform to linear Gaussian policies. Moreover, it delineates sufficient conditions, contingent upon the adequacy of entropy regularization, for the uniqueness of the NE within the game. As Policy Optimizatio… ▽ More

    Submitted 25 March, 2024; originally announced April 2024.

  5. arXiv:2403.11345  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective

    Authors: Muhammad Aneeq uz Zaman, Alec Koppel, Mathieu Laurière, Tamer Başar

    Abstract: We address in this paper Reinforcement Learning (RL) among agents that are grouped into teams such that there is cooperation within each team but general-sum (non-zero sum) competition across different teams. To develop an RL method that provably achieves a Nash equilibrium, we focus on a linear-quadratic structure. Moreover, to tackle the non-stationarity induced by multi-agent interactions in th… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  6. arXiv:2309.15423  [pdf, other

    cs.GT eess.SY

    Prosumers Participation in Markets: A Scalar-Parameterized Function Bidding Approach

    Authors: Abdullah Alawad, Muhammad Aneeq uz Zaman, Khaled Alshehri, Tamer Başar

    Abstract: In uniform-price markets, suppliers compete to supply a resource to consumers, resulting in a single market price determined by their competition. For sufficient flexibility, producers and consumers prefer to commit to a function as their strategies, indicating their preferred quantity at any given market price. Producers and consumers may wish to act as both, i.e., prosumers. In this paper, we ex… ▽ More

    Submitted 14 March, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Corrected typos in the figures

  7. arXiv:2303.09515  [pdf, ps, other

    eess.SY cs.GT cs.SI math.OC

    Large Population Games on Constrained Unreliable Networks

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Tamer Başar

    Abstract: This paper studies an $N$--agent cost-coupled game where the agents are connected via an unreliable capacity constrained network. Each agent receives state information over that network which loses packets with probability $p$. A Base station (BS) actively schedules agent communications over the network by minimizing a weighted Age of Information (WAoI) based cost function under a capacity limit… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Submitted to IEEE for possible publication

  8. arXiv:2209.12888  [pdf, ps, other

    eess.SY cs.IT cs.NI math.OC

    Weighted Age of Information based Scheduling for Large Population Games on Networks

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Melih Bastopcu, Tamer Başar

    Abstract: In this paper, we consider a discrete-time multi-agent system involving $N$ cost-coupled networked rational agents solving a consensus problem and a central Base Station (BS), scheduling agent communications over a network. Due to a hard bandwidth constraint on the number of transmissions through the network, at most $R_d < N$ agents can concurrently access their state information through the netw… ▽ More

    Submitted 26 December, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: This work has been submitted to IEEE for possible publication

  9. arXiv:2208.11639  [pdf, ps, other

    cs.LG cs.GT cs.MA

    Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample Path

    Authors: Muhammad Aneeq uz Zaman, Alec Koppel, Sujay Bhatt, Tamer Başar

    Abstract: We consider online reinforcement learning in Mean-Field Games (MFGs). Unlike traditional approaches, we alleviate the need for a mean-field oracle by developing an algorithm that approximates the Mean-Field Equilibrium (MFE) using the single sample path of the generic agent. We call this {\it Sandbox Learning}, as it can be used as a warm-start for any agent learning in a multi-agent non-cooperati… ▽ More

    Submitted 11 April, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Accepted for publication in AISTATS 2023

  10. arXiv:2203.05686  [pdf, other

    eess.SY cs.MA math.OC

    Linear Quadratic Mean-Field Games with Communication Constraints

    Authors: Shubham Aggarwal, Muhammad Aneeq uz Zaman, Tamer Başar

    Abstract: In this paper, we study a large population game with heterogeneous dynamics and cost functions solving a consensus problem. Moreover, the agents have communication constraints which appear as: (1) an Additive-White Gaussian Noise (AWGN) channel, and (2) asynchronous data transmission via a fixed scheduling policy. Since the complexity of solving the game increases with the number of agents, we use… ▽ More

    Submitted 25 August, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: Accepted in American Control Conference 2022

  11. arXiv:2109.14461  [pdf, other

    eess.SY cs.MA

    Adversarial Linear-Quadratic Mean-Field Games over Multigraphs

    Authors: Muhammad Aneeq uz Zaman, Sujay Bhatt, Tamer Başar

    Abstract: In this paper, we propose a game between an exogenous adversary and a network of agents connected via a multigraph. The multigraph is composed of (1) a global graph structure, capturing the virtual interactions among the agents, and (2) a local graph structure, capturing physical/local interactions among the agents. The aim of each agent is to achieve consensus with the other agents in a decentral… ▽ More

    Submitted 3 October, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: Accepted at 2021 IEEE Conference on Decision and Control (CDC)

  12. arXiv:2009.04350  [pdf, ps, other

    eess.SY cs.GT cs.LG

    Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games

    Authors: Muhammad Aneeq uz Zaman, Kaiqing Zhang, Erik Miehling, Tamer Başar

    Abstract: In this paper, we study large population multi-agent reinforcement learning (RL) in the context of discrete-time linear-quadratic mean-field games (LQ-MFGs). Our setting differs from most existing work on RL for MFGs, in that we consider a non-stationary MFG over an infinite horizon. We propose an actor-critic algorithm to iteratively compute the mean-field equilibrium (MFE) of the LQ-MFG. There a… ▽ More

    Submitted 1 October, 2020; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: To appear in CDC 2020

  13. arXiv:2007.01334  [pdf, other

    math.OC cs.AI eess.SY

    Multi-agent Planning for thermalling gliders using multi level graph-search

    Authors: Muhammad Aneeq uz Zaman, Aamer Iqbal Bhatti

    Abstract: This paper solves a path planning problem for a group of gliders. The gliders are tasked with visiting a set of interest points. The gliders have limited range but are able to increase their range by visiting special points called thermals. The problem addressed in this paper is of path planning for the gliders such that, the total number of interest points visited by the gliders is maximized. Thi… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  14. A Case Study to Identify the Hindrances to Widespread Adoption of Electric Vehicles in Qatar

    Authors: Amith Khandakar, Annaufal Rizqullah, Anas Ashraf Abdou Berbar, Mohammad Rafi Ahmed, Atif Iqbal, Muhammad E. H. Chowdhury, S. M. Ashfaq Uz Zaman

    Abstract: The adoption of electric vehicles (EVs) have proven to be a crucial factor to decreasing the emission of greenhouse gases (GHG) into the atmosphere. However, there are various hurdles that impede people from purchasing EVs. For example, long charging time, short driving range, cost and insufficient charging infrastructures available, etc. This article reports the public perception of EV-adoption u… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Comments: 22 pages, 5 Figures, 5 tables

    Journal ref: Energies 2020, 13(15), 3994

  15. arXiv:2003.13195  [pdf, other

    eess.SY cs.MA

    Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games

    Authors: Muhammad Aneeq uz Zaman, Kaiqing Zhang, Erik Miehling, Tamer Başar

    Abstract: While the topic of mean-field games (MFGs) has a relatively long history, heretofore there has been limited work concerning algorithms for the computation of equilibrium control policies. In this paper, we develop a computable policy iteration algorithm for approximating the mean-field equilibrium in linear-quadratic MFGs with discounted cost. Given the mean-field, each agent faces a linear-quadra… ▽ More

    Submitted 6 April, 2020; v1 submitted 29 March, 2020; originally announced March 2020.

    Comments: This paper has been accepted in ACC 2020