(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 169 results for author: Topcu, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.02860  [pdf, other

    cs.GT

    Nash Equilibrium in Games on Graphs with Incomplete Preferences

    Authors: Abhishek N. Kulkarni, Jie Fu, Ufuk Topcu

    Abstract: Games with incomplete preferences are an important model for studying rational decision-making in scenarios where players face incomplete information about their preferences and must contend with incomparable outcomes. We study the problem of computing Nash equilibrium in a subclass of two-player games played on graphs where each player seeks to maximally satisfy their (possibly incomplete) prefer… ▽ More

    Submitted 11 August, 2024; v1 submitted 5 August, 2024; originally announced August 2024.

    Comments: 14 page, 6 figure, under development

  2. arXiv:2406.07556  [pdf

    cs.CY

    Community Driven Approaches to Research in Technology & Society CCC Workshop Report

    Authors: Suresh Venkatasubramanian, Timnit Gebru, Ufuk Topcu, Haley Griffin, Leah Namisa Rosenbloom, Nasim Sonboli

    Abstract: Based on our workshop activities, we outlined three ways in which research can support community needs: (1) Mapping the ecosystem of both the players and ecosystem and harm landscapes, (2) Counter-Programming, which entails using the same surveillance tools that communities are subjected to observe the entities doing the surveilling, effectively protecting people from surveillance, and conducting… ▽ More

    Submitted 21 March, 2024; originally announced June 2024.

  3. arXiv:2406.03565  [pdf, other

    cs.GT cs.MA eess.SY

    Second-Order Algorithms for Finding Local Nash Equilibria in Zero-Sum Games

    Authors: Kushagra Gupta, Xinjie Liu, Ufuk Topcu, David Fridovich-Keil

    Abstract: Zero-sum games arise in a wide variety of problems, including robust optimization and adversarial learning. However, algorithms deployed for finding a local Nash equilibrium in these games often converge to non-Nash stationary points. This highlights a key challenge: for any algorithm, the stability properties of its underlying dynamical system can cause non-Nash points to be potential attractors.… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2405.14173  [pdf, other

    cs.AI cs.HC

    Human-Agent Cooperation in Games under Incomplete Information through Natural Language Communication

    Authors: Shenghui Chen, Daniel Fried, Ufuk Topcu

    Abstract: Developing autonomous agents that can strategize and cooperate with humans under information asymmetry is challenging without effective communication in natural language. We introduce a shared-control game, where two players collectively control a token in alternating turns to achieve a common objective under incomplete information. We formulate a policy synthesis problem for an autonomous agent i… ▽ More

    Submitted 1 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: with appendix

  5. arXiv:2405.08954  [pdf, other

    cs.RO

    Zero-Shot Transfer of Neural ODEs

    Authors: Tyler Ingebrand, Adam J. Thorpe, Ufuk Topcu

    Abstract: Autonomous systems often encounter environments and scenarios beyond the scope of their training data, which underscores a critical challenge: the need to generalize and adapt to unseen scenarios in real time. This challenge necessitates new mathematical and algorithmic tools that enable adaptation and zero-shot transfer. To this end, we leverage the theory of function encoders, which enables zero… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  6. arXiv:2404.00923  [pdf, other

    cs.CV cs.AI cs.RO

    MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements

    Authors: Lisong C. Sun, Neel P. Bhatt, Jonathan C. Liu, Zhiwen Fan, Zhangyang Wang, Todd E. Humphreys, Ufuk Topcu

    Abstract: Simultaneous localization and mapping is essential for position tracking and scene understanding. 3D Gaussian-based map representations enable photorealistic reconstruction and real-time rendering of scenes using multiple posed cameras. We show for the first time that using 3D Gaussians for map representation with unposed camera images and inertial measurements can enable accurate SLAM. Our method… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Project Webpage: https://vita-group.github.io/MM3DGS-SLAM

  7. arXiv:2403.17233  [pdf, other

    eess.SY cs.LG

    Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process

    Authors: Kevin S. Miller, Adam J. Thorpe, Ufuk Topcu

    Abstract: We present an active learning algorithm for learning dynamics that leverages side information by explicitly incorporating prior domain knowledge into the sampling process. Our proposed algorithm guides the exploration toward regions that demonstrate high empirical discrepancy between the observed data and an imperfect prior model of the dynamics derived from side information. Through numerical exp… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  8. arXiv:2403.12279  [pdf, other

    cs.RO

    Scalable Networked Feature Selection with Randomized Algorithm for Robot Navigation

    Authors: Vivek Pandey, Arash Amini, Guangyi Liu, Ufuk Topcu, Qiyu Sun, Kostas Daniilidis, Nader Motee

    Abstract: We address the problem of sparse selection of visual features for localizing a team of robots navigating an unknown environment, where robots can exchange relative position measurements with neighbors. We select a set of the most informative features by anticipating their importance in robots localization by simulating trajectories of robots over a prediction horizon. Through theoretical proofs, w… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  9. arXiv:2403.10705  [pdf, other

    cs.SI

    Susceptibility of Communities against Low-Credibility Content in Social News Websites

    Authors: Yigit Ege Bayiz, Arash Amini, Radu Marculescu, Ufuk Topcu

    Abstract: Social news websites, such as Reddit, have evolved into prominent platforms for sharing and discussing news. A key issue on social news websites sites is the formation of echo chambers, which often lead to the spread of highly biased or uncredible news. We develop a method to identify communities within a social news website that are prone to uncredible or highly biased news. We employ a user embe… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 figures, Under review in ICWSM 2024

  10. arXiv:2403.10384  [pdf, other

    cs.GT cs.MA eess.SY

    Coordination in Noncooperative Multiplayer Matrix Games via Reduced Rank Correlated Equilibria

    Authors: Jaehan Im, Yue Yu, David Fridovich-Keil, Ufuk Topcu

    Abstract: Coordination in multiplayer games enables players to avoid the lose-lose outcome that often arises at Nash equilibria. However, designing a coordination mechanism typically requires the consideration of the joint actions of all players, which becomes intractable in large-scale games. We develop a novel coordination mechanism, termed reduced rank correlated equilibria, which reduces the number of j… ▽ More

    Submitted 12 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  11. arXiv:2402.10938  [pdf, other

    cs.CL cs.SI

    News Source Credibility Assessment: A Reddit Case Study

    Authors: Arash Amini, Yigit Ege Bayiz, Ashwin Ram, Radu Marculescu, Ufuk Topcu

    Abstract: In the era of social media platforms, identifying the credibility of online content is crucial to combat misinformation. We present the CREDiBERT (CREDibility assessment using Bi-directional Encoder Representations from Transformers), a source credibility assessment model fine-tuned for Reddit submissions focusing on political discourse as the main contribution. We adopt a semi-supervised training… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 12 pages; 3 figures

  12. arXiv:2402.08902  [pdf, other

    cs.RO cs.GT cs.LG cs.MA eess.SY

    Auto-Encoding Bayesian Inverse Games

    Authors: Xinjie Liu, Lasse Peters, Javier Alonso-Mora, Ufuk Topcu, David Fridovich-Keil

    Abstract: When multiple agents interact in a common environment, each agent's actions impact others' future decisions, and noncooperative dynamic games naturally capture this coupling. In interactive motion planning, however, agents typically do not have access to a complete model of the game, e.g., due to unknown objectives of other players. Therefore, we consider the inverse game problem, in which some pr… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  13. arXiv:2402.08570  [pdf, other

    cs.RO cs.AI cs.LG

    Online Foundation Model Selection in Robotics

    Authors: Po-han Li, Oyku Selin Toprak, Aditya Narayanan, Ufuk Topcu, Sandeep Chinchali

    Abstract: Foundation models have recently expanded into robotics after excelling in computer vision and natural language processing. The models are accessible in two ways: open-source or paid, closed-source options. Users with access to both face a problem when deciding between effective yet costly closed-source models and free but less powerful open-source alternatives. We call it the model selection probl… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  14. arXiv:2402.07069  [pdf, other

    cs.LG cs.AI cs.CL

    Using Large Language Models to Automate and Expedite Reinforcement Learning with Reward Machine

    Authors: Shayan Meshkat Alsadat, Jean-Raphael Gaglione, Daniel Neider, Ufuk Topcu, Zhe Xu

    Abstract: We present LARL-RM (Large language model-generated Automaton for Reinforcement Learning with Reward Machine) algorithm in order to encode high-level knowledge into reinforcement learning using automaton to expedite the reinforcement learning. Our method uses Large Language Models (LLM) to obtain high-level domain-specific knowledge using prompt engineering instead of providing the reinforcement le… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  15. arXiv:2401.17173  [pdf, other

    cs.LG cs.AI

    Zero-Shot Reinforcement Learning via Function Encoders

    Authors: Tyler Ingebrand, Amy Zhang, Ufuk Topcu

    Abstract: Although reinforcement learning (RL) can solve many challenging sequential decision making problems, achieving zero-shot transfer across related tasks remains a challenge. The difficulty lies in finding a good representation for the current task so that the agent understands how it relates to previously seen tasks. To achieve zero-shot transfer, we introduce the function encoder, a representation… ▽ More

    Submitted 11 May, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  16. arXiv:2312.13132  [pdf, other

    cs.CC cs.CR

    On the complexity of sabotage games for network security

    Authors: Dhananjay Raju, Georgios Bakirtzis, Ufuk Topcu

    Abstract: Securing dynamic networks against adversarial actions is challenging because of the need to anticipate and counter strategic disruptions by adversarial entities within complex network structures. Traditional game-theoretic models, while insightful, often fail to model the unpredictability and constraints of real-world threat assessment scenarios. We refine sabotage games to reflect the realistic l… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  17. arXiv:2312.01249  [pdf, other

    cs.RO cs.AI eess.SY

    A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

    Authors: Cyrus Neary, Christian Ellis, Aryaman Singh Samyal, Craig Lennon, Ufuk Topcu

    Abstract: We propose and demonstrate a compositional framework for training and verifying reinforcement learning (RL) systems within a multifidelity sim-to-real pipeline, in order to deploy reliable and adaptable RL policies on physical hardware. By decomposing complex robotic tasks into component subtasks and defining mathematical interfaces between them, the framework allows for the independent training a… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  18. arXiv:2311.14200  [pdf, other

    cs.SI eess.SY

    Prebunking Design as a Defense Mechanism Against Misinformation Propagation on Social Networks

    Authors: Yigit Ege Bayiz, Ufuk Topcu

    Abstract: The growing reliance on social media for news consumption necessitates effective countermeasures to mitigate the rapid spread of misinformation. Prebunking, a proactive method that arms users with accurate information before they come across false content, has garnered support from journalism and psychology experts. We formalize the problem of optimal prebunking as optimizing the timing of deliver… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 10 pages, 3 figures, Submitted to PERCOM 2024

  19. arXiv:2311.06275  [pdf

    cs.AI cs.LG

    Algorithmic Robustness

    Authors: David Jensen, Brian LaMacchia, Ufuk Topcu, Pamela Wisniewski

    Abstract: Algorithmic robustness refers to the sustained performance of a computational system in the face of change in the nature of the environment in which that system operates or in the task that the system is meant to perform. Below, we motivate the importance of algorithmic robustness, present a conceptual framework, and highlight the relevant areas of research for which algorithmic robustness is rele… ▽ More

    Submitted 17 October, 2023; originally announced November 2023.

  20. arXiv:2311.06255  [pdf, ps, other

    cs.MA cs.AI cs.LG

    Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning

    Authors: Parham Gohari, Matthew Hale, Ufuk Topcu

    Abstract: In cooperative multi-agent reinforcement learning (Co-MARL), a team of agents must jointly optimize the team's long-term rewards to learn a designated task. Optimizing rewards as a team often requires inter-agent communication and data sharing, leading to potential privacy implications. We assume privacy considerations prohibit the agents from sharing their environment interaction data. Accordingl… ▽ More

    Submitted 12 September, 2023; originally announced November 2023.

    Comments: Paper accepted at 62nd IEEE Conference on Decision and Control

  21. arXiv:2311.01258  [pdf, other

    cs.AI cs.LO eess.SY

    Formal Methods for Autonomous Systems

    Authors: Tichakorn Wongpiromsarn, Mahsa Ghasemi, Murat Cubuktepe, Georgios Bakirtzis, Steven Carr, Mustafa O. Karabag, Cyrus Neary, Parham Gohari, Ufuk Topcu

    Abstract: Formal methods refer to rigorous, mathematical approaches to system development and have played a key role in establishing the correctness of safety-critical systems. The main building blocks of formal methods are models and specifications, which are analogous to behaviors and requirements in system design and give us the means to verify and synthesize system behaviors with formal guarantees. Th… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  22. arXiv:2310.18239  [pdf, other

    cs.AI cs.CL cs.FL cs.RO

    Fine-Tuning Language Models Using Formal Methods Feedback

    Authors: Yunhao Yang, Neel P. Bhatt, Tyler Ingebrand, William Ward, Steven Carr, Zhangyang Wang, Ufuk Topcu

    Abstract: Although pre-trained language models encode generic knowledge beneficial for planning and control, they may fail to generate appropriate control policies for domain-specific tasks. Existing fine-tuning methods use human feedback to address this limitation, however, sourcing human feedback is labor intensive and costly. We present a fully automated approach to fine-tune pre-trained language models… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  23. arXiv:2310.00468  [pdf, ps, other

    cs.GT cs.AI

    Encouraging Inferable Behavior for Autonomy: Repeated Bimatrix Stackelberg Games with Observations

    Authors: Mustafa O. Karabag, Sophia Smith, David Fridovich-Keil, Ufuk Topcu

    Abstract: When interacting with other non-competitive decision-making agents, it is critical for an autonomous agent to have inferable behavior: Their actions must convey their intention and strategy. For example, an autonomous car's strategy must be inferable by the pedestrians interacting with the car. We model the inferability problem using a repeated bimatrix Stackelberg game with observations where a l… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  24. arXiv:2309.10171  [pdf, other

    cs.CV cs.FL

    Specification-Driven Video Search via Foundation Models and Formal Verification

    Authors: Yunhao Yang, Jean-Raphaël Gaglione, Sandeep Chinchali, Ufuk Topcu

    Abstract: The increasing abundance of video data enables users to search for events of interest, e.g., emergency incidents. Meanwhile, it raises new concerns, such as the need for preserving privacy. Existing approaches to video search require either manual inspection or a deep learning model with massive training. We develop a method that uses recent advances in vision and language models, as well as forma… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 12 pages, 18 figures

  25. arXiv:2309.06420  [pdf, other

    eess.SY cs.AI cs.LG

    Verifiable Reinforcement Learning Systems via Compositionality

    Authors: Cyrus Neary, Aryaman Singh Samyal, Christos Verginis, Murat Cubuktepe, Ufuk Topcu

    Abstract: We propose a framework for verifiable and compositional reinforcement learning (RL) in which a collection of RL subsystems, each of which learns to accomplish a separate subtask, are composed to achieve an overall task. The framework consists of a high-level model, represented as a parametric Markov decision process, which is used to plan and analyze compositions of subsystems, and of the collecti… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2106.05864

  26. arXiv:2308.08017  [pdf, other

    cs.GT cs.LG eess.SY

    Active Inverse Learning in Stackelberg Trajectory Games

    Authors: Yue Yu, Jacob Levy, Negar Mehr, David Fridovich-Keil, Ufuk Topcu

    Abstract: Game-theoretic inverse learning is the problem of inferring the players' objectives from their actions. We formulate an inverse learning problem in a Stackelberg game between a leader and a follower, where each player's action is the trajectory of a dynamical system. We propose an active inverse learning method for the leader to infer which hypothesis among a finite set of candidates describes the… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  27. Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception

    Authors: Yunhao Yang, Cyrus Neary, Ufuk Topcu

    Abstract: Recently developed pretrained models can encode rich world knowledge expressed in multiple modalities, such as text and images. However, the outputs of these models cannot be integrated into algorithms to solve sequential decision-making tasks. We develop an algorithm that utilizes the knowledge from pretrained models to construct and verify controllers for sequential decision-making tasks, and to… ▽ More

    Submitted 17 June, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted as full paper in AAMAS 2024

  28. arXiv:2306.13732  [pdf, other

    cs.AI cs.FL

    Reinforcement Learning with Temporal-Logic-Based Causal Diagrams

    Authors: Yash Paliwal, Rajarshi Roy, Jean-Raphaël Gaglione, Nasim Baharisangari, Daniel Neider, Xiaoming Duan, Ufuk Topcu, Zhe Xu

    Abstract: We study a class of reinforcement learning (RL) tasks where the objective of the agent is to accomplish temporally extended goals. In this setting, a common approach is to represent the tasks as deterministic finite automata (DFA) and integrate them into the state-space for RL algorithms. However, while these machines model the reward function, they often overlook the causal knowledge about the en… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  29. arXiv:2306.06335  [pdf, other

    cs.LG cs.RO eess.SY

    How to Learn and Generalize From Three Minutes of Data: Physics-Constrained and Uncertainty-Aware Neural Stochastic Differential Equations

    Authors: Franck Djeumou, Cyrus Neary, Ufuk Topcu

    Abstract: We present a framework and algorithms to learn controlled dynamics models using neural stochastic differential equations (SDEs) -- SDEs whose drift and diffusion terms are both parametrized by neural networks. We construct the drift term to leverage a priori physics knowledge as inductive bias, and we design the diffusion term to represent a distance-aware estimate of the uncertainty in the learne… ▽ More

    Submitted 15 October, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Final submission to CoRL 2023

  30. arXiv:2306.06330  [pdf, other

    eess.SY cs.LG

    Autonomous Drifting with 3 Minutes of Data via Learned Tire Models

    Authors: Franck Djeumou, Jonathan Y. M. Goh, Ufuk Topcu, Avinash Balachandran

    Abstract: Near the limits of adhesion, the forces generated by a tire are nonlinear and intricately coupled. Efficient and accurate modelling in this region could improve safety, especially in emergency situations where high forces are required. To this end, we propose a novel family of tire force models based on neural ordinary differential equations and a neural-ExpTanh parameterization. These models are… ▽ More

    Submitted 16 October, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Final Submission at ICRA 2023

  31. arXiv:2305.17372  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Reinforcement Learning With Reward Machines in Stochastic Games

    Authors: Jueming Hu, Jean-Raphael Gaglione, Yanze Wang, Zhe Xu, Ufuk Topcu, Yongming Liu

    Abstract: We investigate multi-agent reinforcement learning for stochastic games with complex tasks, where the reward functions are non-Markovian. We utilize reward machines to incorporate high-level knowledge of complex tasks. We develop an algorithm called Q-learning with reward machines for stochastic games (QRM-SG), to learn the best-response strategy at Nash equilibrium for each agent. In QRM-SG, we de… ▽ More

    Submitted 28 August, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

  32. arXiv:2305.16505  [pdf, other

    cs.LG

    Reward-Machine-Guided, Self-Paced Reinforcement Learning

    Authors: Cevahir Koprulu, Ufuk Topcu

    Abstract: Self-paced reinforcement learning (RL) aims to improve the data efficiency of learning by automatically creating sequences, namely curricula, of probability distributions over contexts. However, existing techniques for self-paced RL fail in long-horizon planning tasks that involve temporally extended behaviors. We hypothesize that taking advantage of prior knowledge about the underlying task struc… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 9 pages, 11 figures. Accepted for UAI 2023

  33. arXiv:2305.15523  [pdf, other

    cs.IT cs.CV

    Task-aware Distributed Source Coding under Dynamic Bandwidth

    Authors: Po-han Li, Sravan Kumar Ankireddy, Ruihan Zhao, Hossein Nourkhiz Mahjoub, Ehsan Moradi-Pari, Ufuk Topcu, Sandeep Chinchali, Hyeji Kim

    Abstract: Efficient compression of correlated data is essential to minimize communication overload in multi-sensor networks. In such networks, each sensor independently compresses the data and transmits them to a central node due to limited communication bandwidth. A decoder at the central node decompresses and passes the data to a pre-trained machine learning-based task to generate the final output. Thus,… ▽ More

    Submitted 13 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Journal ref: NeurIPS 2023

  34. arXiv:2305.01473  [pdf, other

    cs.LG cs.LO math.OC

    Efficient Sensitivity Analysis for Parametric Robust Markov Chains

    Authors: Thom Badings, Sebastian Junges, Ahmadreza Marandi, Ufuk Topcu, Nils Jansen

    Abstract: We provide a novel method for sensitivity analysis of parametric robust Markov chains. These models incorporate parameters and sets of probability distributions to alleviate the often unrealistic assumption that precise probabilities are available. We measure sensitivity in terms of partial derivatives with respect to the uncertain transition probabilities regarding measures such as the expected r… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: To be presented at CAV 2023

  35. arXiv:2304.00163  [pdf, other

    cs.GT cs.LG

    Soft-Bellman Equilibrium in Affine Markov Games: Forward Solutions and Inverse Learning

    Authors: Shenghui Chen, Yue Yu, David Fridovich-Keil, Ufuk Topcu

    Abstract: Markov games model interactions among multiple players in a stochastic, dynamic environment. Each player in a Markov game maximizes its expected total discounted reward, which depends upon the policies of the other players. We formulate a class of Markov games, termed affine Markov games, where an affine reward function couples the players' actions. We introduce a novel solution concept, the soft-… ▽ More

    Submitted 8 September, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

  36. arXiv:2303.04268  [pdf, ps, other

    cs.LG cs.AI

    On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples

    Authors: Mustafa O. Karabag, Ufuk Topcu

    Abstract: Offline reinforcement learning (offline RL) considers problems where learning is performed using only previously collected samples and is helpful for the settings in which collecting new data is costly or risky. In model-based offline RL, the learner performs estimation (or optimization) using a model constructed according to the empirical transition frequencies. We analyze the sample complexity o… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted to AAAI-23

  37. arXiv:2302.14242  [pdf, other

    cs.RO

    Learning Sparse Control Tasks from Pixels by Latent Nearest-Neighbor-Guided Explorations

    Authors: Ruihan Zhao, Ufuk Topcu, Sandeep Chinchali, Mariano Phielipp

    Abstract: Recent progress in deep reinforcement learning (RL) and computer vision enables artificial agents to solve complex tasks, including locomotion, manipulation and video games from high-dimensional pixel observations. However, domain specific reward functions are often engineered to provide sufficient learning signals, requiring expert knowledge. While it is possible to train vision-based RL agents u… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  38. arXiv:2301.08811  [pdf, ps, other

    cs.MA cs.AI cs.CR

    Differential Privacy in Cooperative Multiagent Planning

    Authors: Bo Chen, Calvin Hawkins, Mustafa O. Karabag, Cyrus Neary, Matthew Hale, Ufuk Topcu

    Abstract: Privacy-aware multiagent systems must protect agents' sensitive data while simultaneously ensuring that agents accomplish their shared objectives. Towards this goal, we propose a framework to privatize inter-agent communications in cooperative multiagent decision-making problems. We study sequential decision-making problems formulated as cooperative Markov games with reach-avoid objectives. We app… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  39. arXiv:2301.03565  [pdf, other

    eess.SY cs.LG math.OC

    Physics-Informed Kernel Embeddings: Integrating Prior System Knowledge with Data-Driven Control

    Authors: Adam J. Thorpe, Cyrus Neary, Franck Djeumou, Meeko M. K. Oishi, Ufuk Topcu

    Abstract: Data-driven control algorithms use observations of system dynamics to construct an implicit model for the purpose of control. However, in practice, data-driven techniques often require excessive sample sizes, which may be infeasible in real-world scenarios where only limited observations of the system are available. Furthermore, purely data-driven methods often neglect useful a priori knowledge, s… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  40. arXiv:2301.01219  [pdf, other

    cs.LG cs.AI cs.FL math.OC

    Task-Guided IRL in POMDPs that Scales

    Authors: Franck Djeumou, Christian Ellis, Murat Cubuktepe, Craig Lennon, Ufuk Topcu

    Abstract: In inverse reinforcement learning (IRL), a learning agent infers a reward function encoding the underlying task using demonstrations from experts. However, many existing IRL techniques make the often unrealistic assumption that the agent has access to full information about the environment. We remove this assumption by developing an algorithm for IRL in partially observable Markov decision process… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Comments: Final submission to the Artificial Intelligence journal (Elsevier). arXiv admin note: substantial text overlap with arXiv:2105.14073

  41. arXiv:2212.01944  [pdf, other

    cs.FL cs.CL

    Automaton-Based Representations of Task Knowledge from Generative Language Models

    Authors: Yunhao Yang, Jean-Raphaël Gaglione, Cyrus Neary, Ufuk Topcu

    Abstract: Automaton-based representations of task knowledge play an important role in control and planning for sequential decision-making problems. However, obtaining the high-level task knowledge required to build such automata is often difficult. Meanwhile, large-scale generative language models (GLMs) can automatically generate relevant task knowledge. However, the textual outputs from GLMs cannot be for… ▽ More

    Submitted 9 August, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: Submitted to JAIR

  42. arXiv:2212.00916  [pdf, other

    cs.LO cs.AI cs.FL cs.LG

    Learning Temporal Logic Properties: an Overview of Two Recent Methods

    Authors: Jean-Raphaël Gaglione, Rajarshi Roy, Nasim Baharisangari, Daniel Neider, Zhe Xu, Ufuk Topcu

    Abstract: Learning linear temporal logic (LTL) formulas from examples labeled as positive or negative has found applications in inferring descriptions of system behavior. We summarize two methods to learn LTL formulas from examples in two different problem settings. The first method assumes noise in the labeling of the examples. For that, they define the problem of inferring an LTL formula that must be cons… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: Appears in Proceedings of AAAI FSS-22 Symposium "Lessons Learned for Autonomous Assessment of Machine Abilities (LLAAMA)"

    ACM Class: I.2; F.4.3

  43. arXiv:2212.00893  [pdf, other

    cs.LG cs.AI eess.SY

    Compositional Learning of Dynamical System Models Using Port-Hamiltonian Neural Networks

    Authors: Cyrus Neary, Ufuk Topcu

    Abstract: Many dynamical systems -- from robots interacting with their surroundings to large-scale multiphysics systems -- involve a number of interacting subsystems. Toward the objective of learning composite models of such systems from data, we present i) a framework for compositional neural networks, ii) algorithms to train these models, iii) a method to compose the learned models, iv) theoretical result… ▽ More

    Submitted 13 May, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Paper accepted for publication at L4DC 2023

  44. arXiv:2211.11741  [pdf, other

    eess.SY cs.LO

    Sensor Placement for Online Fault Diagnosis

    Authors: Dhananjay Raju, Georgios Bakirtzis, Ufuk Topcu

    Abstract: Fault diagnosis is the problem of determining a set of faulty system components that explain discrepancies between observed and expected behavior. Due to the intrinsic relation between observations and sensors placed on a system, sensors' fault diagnosis and placement are mutually dependent. Consequently, it is imperative to solve the fault diagnosis and sensor placement problems jointly. One appr… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  45. arXiv:2211.04617  [pdf, other

    cs.SI

    Countering Misinformation on Social Networks Using Graph Alterations

    Authors: Yigit E. Bayiz, Ufuk Topcu

    Abstract: We restrict the propagation of misinformation in a social-media-like environment while preserving the spread of correct information. We model the environment as a random network of users in which each news item propagates in the network in consecutive cascades. Existing studies suggest that the cascade behaviors of misinformation and correct information are affected differently by user polarizatio… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: 10 pages, 6 figures

  46. arXiv:2210.01221  [pdf, other

    cs.GT math.OC

    Cost Design in Atomic Routing Games

    Authors: Yue Yu, Shenghui Chen, David Fridovich-Keil, Ufuk Topcu

    Abstract: An atomic routing game is a multiplayer game on a directed graph. Each player in the game chooses a path -- a sequence of links that connect its origin node to its destination node -- with the lowest cost, where the cost of each link is a function of all players' choices. We develop a novel numerical method to design the link cost function in atomic routing games such that the players' choices at… ▽ More

    Submitted 17 May, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  47. Alternating Direction Method of Multipliers for Decomposable Saddle-Point Problems

    Authors: Mustafa O. Karabag, David Fridovich-Keil, Ufuk Topcu

    Abstract: Saddle-point problems appear in various settings including machine learning, zero-sum stochastic games, and regression problems. We consider decomposable saddle-point problems and study an extension of the alternating direction method of multipliers to such saddle-point problems. Instead of solving the original saddle-point problem directly, this algorithm solves smaller saddle-point problems by e… ▽ More

    Submitted 27 December, 2022; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: Accepted to 58th Annual Allerton Conference on Communication, Control, and Computing

  48. arXiv:2209.02650  [pdf, other

    cs.LO cs.AI

    Learning Interpretable Temporal Properties from Positive Examples Only

    Authors: Rajarshi Roy, Jean-Raphaël Gaglione, Nasim Baharisangari, Daniel Neider, Zhe Xu, Ufuk Topcu

    Abstract: We consider the problem of explaining the temporal behavior of black-box systems using human-interpretable models. To this end, based on recent research trends, we rely on the fundamental yet interpretable models of deterministic finite automata (DFAs) and linear temporal logic (LTL) formulas. In contrast to most existing works for learning DFAs and LTL formulas, we rely on only positive examples.… ▽ More

    Submitted 2 March, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: Full version of the paper that appeared in AAAI23

    ACM Class: F.4.1; I.2.6

  49. arXiv:2208.13687  [pdf, other

    cs.AI cs.LO eess.SY math.CT

    Categorical semantics of compositional reinforcement learning

    Authors: Georgios Bakirtzis, Michail Savvas, Ufuk Topcu

    Abstract: Reinforcement learning (RL) often requires decomposing a problem into subtasks and composing learned behaviors on these tasks. Compositionality in RL has the potential to create modular subtask units that interface with other system capabilities. However, generating compositional models requires the characterization of minimal assumptions for the robustness of the compositional feature. We develop… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  50. arXiv:2207.08275  [pdf, other

    cs.GT

    Inverse Matrix Games with Unique Quantal Response Equilibrium

    Authors: Yue Yu, Jonathan Salfity, David Fridovich-Keil, Ufuk Topcu

    Abstract: In an inverse game problem, one needs to infer the cost function of the players in a game such that a desired joint strategy is a Nash equilibrium. We study the inverse game problem for a class of multiplayer matrix games, where the cost perceived by each player is corrupted by random noise. We provide sufficient conditions for the players' quantal response equilibrium -- a generalization of the N… ▽ More

    Submitted 13 October, 2022; v1 submitted 17 July, 2022; originally announced July 2022.