-
Perspectives on Contractivity in Control, Optimization, and Learning
Authors:
Alexander Davydov,
Francesco Bullo
Abstract:
Contraction theory is a mathematical framework for studying the convergence, robustness, and modularity properties of dynamical systems and algorithms. In this opinion paper, we provide five main opinions on the virtues of contraction theory. These opinions are (i) contraction theory is a unifying framework emerging from classical and modern works, (ii) contractivity is computationally-friendly, r…
▽ More
Contraction theory is a mathematical framework for studying the convergence, robustness, and modularity properties of dynamical systems and algorithms. In this opinion paper, we provide five main opinions on the virtues of contraction theory. These opinions are (i) contraction theory is a unifying framework emerging from classical and modern works, (ii) contractivity is computationally-friendly, robust, and modular stability, (iii) numerous dynamical systems are contracting, (iv) contraction theory is relevant to modern applications, and (v) contraction theory can be vastly extended in numerous directions. We survey recent theoretical and applied research in each of these five directions.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Exponential Stability of Parametric Optimization-Based Controllers via Lur'e Contractivity
Authors:
Alexander Davydov,
Francesco Bullo
Abstract:
In this letter, we investigate sufficient conditions for the exponential stability of LTI systems driven by controllers derived from parametric optimization problems. Our primary focus is on parametric projection controllers, namely parametric programs whose objective function is the squared distance to a nominal controller. Leveraging the virtual system method of analysis and a novel contractivit…
▽ More
In this letter, we investigate sufficient conditions for the exponential stability of LTI systems driven by controllers derived from parametric optimization problems. Our primary focus is on parametric projection controllers, namely parametric programs whose objective function is the squared distance to a nominal controller. Leveraging the virtual system method of analysis and a novel contractivity result for Lur'e systems, we establish a sufficient LMI condition for the exponential stability of an LTI system with a parametric projection-based controller. Separately, we prove additional results for single-integrator systems. Finally, we apply our results to state-dependent saturated control systems and control barrier function-based control and provide numerical simulations.
△ Less
Submitted 15 April, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
On Weakly Contracting Dynamics for Convex Optimization
Authors:
Veronica Centorrino,
Alexander Davydov,
Anand Gokhale,
Giovanni Russo,
Francesco Bullo
Abstract:
We analyze the convergence behavior of \emph{globally weakly} and \emph{locally strongly contracting} dynamics. Such dynamics naturally arise in the context of convex optimization problems with a unique minimizer. We show that convergence to the equilibrium is \emph{linear-exponential}, in the sense that the distance between each solution and the equilibrium is upper bounded by a function that fir…
▽ More
We analyze the convergence behavior of \emph{globally weakly} and \emph{locally strongly contracting} dynamics. Such dynamics naturally arise in the context of convex optimization problems with a unique minimizer. We show that convergence to the equilibrium is \emph{linear-exponential}, in the sense that the distance between each solution and the equilibrium is upper bounded by a function that first decreases linearly and then exponentially. As we show, the linear-exponential dependency arises naturally in certain dynamics with saturations. Additionally, we provide a sufficient condition for local input-to-state stability. Finally, we illustrate our results on, and propose a conjecture for, continuous-time dynamical systems solving linear programs.
△ Less
Submitted 15 May, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
Learning Neural Contracting Dynamics: Extended Linearization and Global Guarantees
Authors:
Sean Jaffe,
Alexander Davydov,
Deniz Lapsekili,
Ambuj Singh,
Francesco Bullo
Abstract:
Global stability and robustness guarantees in learned dynamical systems are essential to ensure well-behavedness of the systems in the face of uncertainty. We present Extended Linearized Contracting Dynamics (ELCD), the first neural network-based dynamical system with global contractivity guarantees in arbitrary metrics. The key feature of ELCD is a parametrization of the extended linearization of…
▽ More
Global stability and robustness guarantees in learned dynamical systems are essential to ensure well-behavedness of the systems in the face of uncertainty. We present Extended Linearized Contracting Dynamics (ELCD), the first neural network-based dynamical system with global contractivity guarantees in arbitrary metrics. The key feature of ELCD is a parametrization of the extended linearization of the nonlinear vector field. In its most basic form, ELCD is guaranteed to be (i) globally exponentially stable, (ii) equilibrium contracting, and (iii) globally contracting with respect to some metric. To allow for contraction with respect to more general metrics in the data space, we train diffeomorphisms between the data space and a latent space and enforce contractivity in the latent space, which ensures global contractivity in the data space. We demonstrate the performance of ELCD on the high dimensional LASA, multi-link pendulum, and Rosenbrock datasets.
△ Less
Submitted 29 May, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means
Authors:
Sean Jaffe,
Ambuj K. Singh,
Francesco Bullo
Abstract:
Compressing large neural networks with minimal performance loss is crucial to enabling their deployment on edge devices. (Cho et al., 2022) proposed a weight quantization method that uses an attention-based clustering algorithm called differentiable $k$-means (DKM). Despite achieving state-of-the-art results, DKM's performance is constrained by its heavy memory dependency. We propose an implicit,…
▽ More
Compressing large neural networks with minimal performance loss is crucial to enabling their deployment on edge devices. (Cho et al., 2022) proposed a weight quantization method that uses an attention-based clustering algorithm called differentiable $k$-means (DKM). Despite achieving state-of-the-art results, DKM's performance is constrained by its heavy memory dependency. We propose an implicit, differentiable $k$-means algorithm (IDKM), which eliminates the major memory restriction of DKM. Let $t$ be the number of $k$-means iterations, $m$ be the number of weight-vectors, and $b$ be the number of bits per cluster address. IDKM reduces the overall memory complexity of a single $k$-means layer from $\mathcal{O}(t \cdot m \cdot 2^b)$ to $\mathcal{O}( m \cdot 2^b)$. We also introduce a variant, IDKM with Jacobian-Free-Backpropagation (IDKM-JFB), for which the time complexity of the gradient calculation is independent of $t$ as well. We provide a proof of concept of our methods by showing that, under the same settings, IDKM achieves comparable performance to DKM with less compute time and less memory. We also use IDKM and IDKM-JFB to quantize a large neural network, Resnet18, on hardware where DKM cannot train at all.
△ Less
Submitted 15 December, 2023; v1 submitted 12 December, 2023;
originally announced December 2023.
-
Positive Competitive Networks for Sparse Reconstruction
Authors:
Veronica Centorrino,
Anand Gokhale,
Alexander Davydov,
Giovanni Russo,
Francesco Bullo
Abstract:
We propose and analyze a continuous-time firing-rate neural network, the positive firing-rate competitive network (\pfcn), to tackle sparse reconstruction problems with non-negativity constraints. These problems, which involve approximating a given input stimulus from a dictionary using a set of sparse (active) neurons, play a key role in a wide range of domains, including for example neuroscience…
▽ More
We propose and analyze a continuous-time firing-rate neural network, the positive firing-rate competitive network (\pfcn), to tackle sparse reconstruction problems with non-negativity constraints. These problems, which involve approximating a given input stimulus from a dictionary using a set of sparse (active) neurons, play a key role in a wide range of domains, including for example neuroscience, signal processing, and machine learning. First, by leveraging the theory of proximal operators, we relate the equilibria of a family of continuous-time firing-rate neural networks to the optimal solutions of sparse reconstruction problems. Then, we prove that the \pfcn is a positive system and give rigorous conditions for the convergence to the equilibrium. Specifically, we show that the convergence: (i) only depends on a property of the dictionary; (ii) is linear-exponential, in the sense that initially the convergence rate is at worst linear and then, after a transient, it becomes exponential. We also prove a number of technical results to assess the contractivity properties of the neural dynamics of interest. Our analysis leverages contraction theory to characterize the behavior of a family of firing-rate competitive networks for sparse reconstruction with and without non-negativity constraints. Finally, we validate the effectiveness of our approach via a numerical example.
△ Less
Submitted 22 March, 2024; v1 submitted 7 November, 2023;
originally announced November 2023.
-
Multiplayer Homicidal Chauffeur Reach-Avoid Games: A Pursuit Enclosure Function Approach
Authors:
Rui Yan,
Xiaoming Duan,
Rui Zou,
Xin He,
Zongying Shi,
Francesco Bullo
Abstract:
This paper presents a multiplayer Homicidal Chauffeur reach-avoid differential game, which involves Dubins-car pursuers and simple-motion evaders. The goal of the pursuers is to cooperatively protect a planar convex region from the evaders, who strive to reach the region. We propose a cooperative strategy for the pursuers based on subgames for multiple pursuers against one evader and optimal task…
▽ More
This paper presents a multiplayer Homicidal Chauffeur reach-avoid differential game, which involves Dubins-car pursuers and simple-motion evaders. The goal of the pursuers is to cooperatively protect a planar convex region from the evaders, who strive to reach the region. We propose a cooperative strategy for the pursuers based on subgames for multiple pursuers against one evader and optimal task allocation. We introduce pursuit enclosure functions (PEFs) and propose a new enclosure region pursuit (ERP) winning approach that supports forward analysis for the strategy synthesis in the subgames. We show that if a pursuit coalition is able to defend the region against an evader under the ERP winning, then no more than two pursuers in the coalition are necessarily needed. We also propose a steer-to-ERP approach to certify the ERP winning and synthesize the ERP winning strategy. To implement the strategy, we introduce a positional PEF and provide the necessary parameters, states, and strategies that ensure the ERP winning for both one pursuer and two pursuers against one evader. Additionally, we formulate a binary integer program using the subgame outcomes to maximize the captured evaders in the ERP winning for the pursuit task allocation. Finally, we propose a multiplayer receding-horizon strategy where the ERP winnings are checked in each horizon, the task is allocated, and the strategies of the pursuers are determined. Numerical examples are provided to illustrate the results.
△ Less
Submitted 22 December, 2023; v1 submitted 4 November, 2023;
originally announced November 2023.
-
Singular Perturbation via Contraction Theory
Authors:
Liliaokeawawa Cothren,
Francesco Bullo,
Emiliano Dall'Anese
Abstract:
In this paper, we provide a novel contraction-theoretic approach to analyze two-time scale systems. In our proposed framework, systems enjoy several robustness properties, which can lead to a more complete characterization of their behaviors. Key assumptions are the contractivity of the fast sub-system and of the reduced model, combined with an explicit upper bound on the time-scale parameter. For…
▽ More
In this paper, we provide a novel contraction-theoretic approach to analyze two-time scale systems. In our proposed framework, systems enjoy several robustness properties, which can lead to a more complete characterization of their behaviors. Key assumptions are the contractivity of the fast sub-system and of the reduced model, combined with an explicit upper bound on the time-scale parameter. For two-time scale systems subject to disturbances, we show that the distance between solutions of the nominal system and solutions of its reduced model is uniformly upper bounded by a function of contraction rates, Lipschitz constants, the time-scale parameter, and the time variability of the disturbances. We also show local contractivity of the two-time scale system and give sufficient conditions for global contractivity. We then consider two special cases: for autonomous nonlinear systems we obtain sharper bounds than our general results and for linear time-invariant systems we present novel bounds based upon log norms and induced norms. Finally, we apply our theory to two application areas -- online feedback optimization and Stackelberg games -- and obtain new individual tracking error bounds showing that solutions converge to their (time-varying) optimizer and computing overall contraction rates.
△ Less
Submitted 30 January, 2024; v1 submitted 11 October, 2023;
originally announced October 2023.
-
RoSSO: A High-Performance Python Package for Robotic Surveillance Strategy Optimization Using JAX
Authors:
Yohan John,
Connor Hughes,
Gilberto Diaz-Garcia,
Jason R. Marden,
Francesco Bullo
Abstract:
To enable the computation of effective randomized patrol routes for single- or multi-robot teams, we present RoSSO, a Python package designed for solving Markov chain optimization problems. We exploit machine-learning techniques such as reverse-mode automatic differentiation and constraint parametrization to achieve superior efficiency compared to general-purpose nonlinear programming solvers. Add…
▽ More
To enable the computation of effective randomized patrol routes for single- or multi-robot teams, we present RoSSO, a Python package designed for solving Markov chain optimization problems. We exploit machine-learning techniques such as reverse-mode automatic differentiation and constraint parametrization to achieve superior efficiency compared to general-purpose nonlinear programming solvers. Additionally, we supplement a game-theoretic stochastic surveillance formulation in the literature with a novel greedy algorithm and multi-robot extension. We close with numerical results for a police district in downtown San Francisco that demonstrate RoSSO's capabilities on our new formulations and the prior work.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Contractivity of Distributed Optimization and Nash Seeking Dynamics
Authors:
Anand Gokhale,
Alexander Davydov,
Francesco Bullo
Abstract:
In this letter, we study distributed optimization and Nash equilibrium-seeking dynamics from a contraction theoretic perspective. Our first result is a novel bound on the logarithmic norm of saddle matrices. Second, for distributed gradient flows based upon incidence and Laplacian constraints over arbitrary topologies, we establish strong contractivity over an appropriate invariant vector subspace…
▽ More
In this letter, we study distributed optimization and Nash equilibrium-seeking dynamics from a contraction theoretic perspective. Our first result is a novel bound on the logarithmic norm of saddle matrices. Second, for distributed gradient flows based upon incidence and Laplacian constraints over arbitrary topologies, we establish strong contractivity over an appropriate invariant vector subspace. Third, we give sufficient conditions for strong contractivity in pseudogradient and best response games with complete information, show the equivalence of these conditions, and consider the special case of aggregative games.
△ Less
Submitted 21 September, 2023; v1 submitted 11 September, 2023;
originally announced September 2023.
-
A Stochastic Surveillance Stackelberg Game: Co-Optimizing Defense Placement and Patrol Strategy
Authors:
Yohan John,
Gilberto Diaz-Garcia,
Xiaoming Duan,
Jason R. Marden,
Francesco Bullo
Abstract:
Stochastic patrol routing is known to be advantageous in adversarial settings; however, the optimal choice of stochastic routing strategy is dependent on a model of the adversary. We adopt a worst-case omniscient adversary model from the literature and extend the formulation to accommodate heterogeneous defenses at the various nodes of the graph. Introducing this heterogeneity leads to interesting…
▽ More
Stochastic patrol routing is known to be advantageous in adversarial settings; however, the optimal choice of stochastic routing strategy is dependent on a model of the adversary. We adopt a worst-case omniscient adversary model from the literature and extend the formulation to accommodate heterogeneous defenses at the various nodes of the graph. Introducing this heterogeneity leads to interesting new patrol strategies. We identify efficient methods for computing these strategies in certain classes of graphs. We assess the effectiveness of these strategies via comparison to an upper bound on the value of the game. Finally, we leverage the heterogeneous defense formulation to develop novel defense placement algorithms that complement the patrol strategies.
△ Less
Submitted 20 February, 2024; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Time-Varying Convex Optimization: A Contraction and Equilibrium Tracking Approach
Authors:
Alexander Davydov,
Veronica Centorrino,
Anand Gokhale,
Giovanni Russo,
Francesco Bullo
Abstract:
In this article, we provide a novel and broadly-applicable contraction-theoretic approach to continuous-time time-varying convex optimization. For any parameter-dependent contracting dynamics, we show that the tracking error is asymptotically proportional to the rate of change of the parameter with proportionality constant upper bounded by Lipschitz constant in which the parameter appears divided…
▽ More
In this article, we provide a novel and broadly-applicable contraction-theoretic approach to continuous-time time-varying convex optimization. For any parameter-dependent contracting dynamics, we show that the tracking error is asymptotically proportional to the rate of change of the parameter with proportionality constant upper bounded by Lipschitz constant in which the parameter appears divided by the contraction rate of the dynamics squared. We additionally establish that any parameter-dependent contracting dynamics can be augmented with a feedforward prediction term to ensure that the tracking error converges to zero exponentially quickly. To apply these results to time-varying convex optimization problems, we establish the strong infinitesimal contractivity of dynamics solving three canonical problems, namely monotone inclusions, linear equality-constrained problems, and composite minimization problems. For each of these problems, we prove the sharpest-known rates of contraction and provide explicit tracking error bounds between solution trajectories and minimizing trajectories. We validate our theoretical results on three numerical examples including an application to control-barrier function based controller design.
△ Less
Submitted 29 May, 2024; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Learning Robust Data-based LQG Controllers from Noisy Data
Authors:
Wenjie Liu,
Jian Sun,
Gang Wang,
Francesco Bullo,
Jie Chen
Abstract:
This paper addresses the joint state estimation and control problems for unknown linear time-invariant systems subject to both process and measurement noise. The aim is to redesign the linear quadratic Gaussian (LQG) controller based solely on data. The LQG controller comprises a linear quadratic regulator (LQR) and a steady-state Kalman observer; while the data-based LQR design problem has been p…
▽ More
This paper addresses the joint state estimation and control problems for unknown linear time-invariant systems subject to both process and measurement noise. The aim is to redesign the linear quadratic Gaussian (LQG) controller based solely on data. The LQG controller comprises a linear quadratic regulator (LQR) and a steady-state Kalman observer; while the data-based LQR design problem has been previously studied, constructing the Kalman gain and the LQG controller from noisy data presents a novel challenge.
In this work, a data-based formulation for computing the steady-state Kalman gain is proposed based on semi-definite programming (SDP) using some noise-free input-state-output data. Additionally, a data-based LQG controller is developed, which is shown to be equivalent to the model-based LQG controller. For cases where offline data are corrupted by noise, a robust data-based observer gain is constructed by tackling a relaxed SDP. The proposed controllers are proven to achieve robust global exponential stability (RGES) for state estimation and input-to-state practical stability (ISpS) under standard conditions. Finally, numerical tests are conducted to validate the proposed controllers' correctness and effectiveness.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Non-Euclidean Monotone Operator Theory and Applications
Authors:
Alexander Davydov,
Saber Jafarpour,
Anton V. Proskurnikov,
Francesco Bullo
Abstract:
While monotone operator theory is traditionally studied on Hilbert spaces, many interesting problems in data science and machine learning arise naturally in finite-dimensional vector spaces endowed with non-Euclidean norms, such as diagonally-weighted $\ell_1$ or $\ell_\infty$ norms. This paper provides a natural generalization of monotone operator theory to finite-dimensional non-Euclidean spaces…
▽ More
While monotone operator theory is traditionally studied on Hilbert spaces, many interesting problems in data science and machine learning arise naturally in finite-dimensional vector spaces endowed with non-Euclidean norms, such as diagonally-weighted $\ell_1$ or $\ell_\infty$ norms. This paper provides a natural generalization of monotone operator theory to finite-dimensional non-Euclidean spaces. The key tools are weak pairings and logarithmic norms. We show that the resolvent and reflected resolvent operators of non-Euclidean monotone mappings exhibit similar properties to their counterparts in Hilbert spaces. Furthermore, classical iterative methods and splitting methods for finding zeros of monotone operators are shown to converge in the non-Euclidean case. We apply our theory to equilibrium computation and Lipschitz constant estimation of recurrent neural networks, obtaining novel iterations and tighter upper bounds via forward-backward splitting.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Semicontraction and Synchronization of Kuramoto-Sakaguchi Oscillator Networks
Authors:
Robin Delabays,
Francesco Bullo
Abstract:
This paper studies the celebrated Kuramoto-Sakaguchi model of coupled oscillators adopting two recent concepts. First, we consider appropriately-defined subsets of the $n$-torus called winding cells. Second, we analyze the semicontractivity of the model, i.e., the property that the distance between trajectories decreases when measured according to a seminorm.
This paper establishes the local sem…
▽ More
This paper studies the celebrated Kuramoto-Sakaguchi model of coupled oscillators adopting two recent concepts. First, we consider appropriately-defined subsets of the $n$-torus called winding cells. Second, we analyze the semicontractivity of the model, i.e., the property that the distance between trajectories decreases when measured according to a seminorm.
This paper establishes the local semicontractivity of the Kuramoto-Sakaguchi model, which is equivalent to the local contractivity for the reduced model. The reduced model is defined modulo the rotational symmetry. The domains where the system is semicontracting are convex phase-cohesive subsets of winding cells. Our sufficient conditions and estimates of the semicontracting domains are less conservative and more explicit than in previous works. Based on semicontraction on phase-cohesive subsets, we establish the "at most uniqueness" of synchronous states within these domains, thereby characterizing the multistability of this model.
△ Less
Submitted 8 May, 2023; v1 submitted 17 March, 2023;
originally announced March 2023.
-
Euclidean Contractivity of Neural Networks with Symmetric Weights
Authors:
Veronica Centorrino,
Anand Gokhale,
Alexander Davydov,
Giovanni Russo,
Francesco Bullo
Abstract:
This paper investigates stability conditions of continuous-time Hopfield and firing-rate neural networks by leveraging contraction theory. First, we present a number of useful general algebraic results on matrix polytopes and products of symmetric matrices. Then, we give sufficient conditions for strong and weak Euclidean contractivity, i.e., contractivity with respect to the $\ell_2$ norm, of bot…
▽ More
This paper investigates stability conditions of continuous-time Hopfield and firing-rate neural networks by leveraging contraction theory. First, we present a number of useful general algebraic results on matrix polytopes and products of symmetric matrices. Then, we give sufficient conditions for strong and weak Euclidean contractivity, i.e., contractivity with respect to the $\ell_2$ norm, of both models with symmetric weights and (possibly) non-smooth activation functions. Our contraction analysis leads to contraction rates which are log-optimal in almost all symmetric synaptic matrices. Finally, we use our results to propose a firing-rate neural network model to solve a quadratic optimization problem with box constraints.
△ Less
Submitted 12 May, 2023; v1 submitted 26 February, 2023;
originally announced February 2023.
-
Convergence, Consensus and Dissensus in the Weighted-Median Opinion Dynamics
Authors:
Wenjun Mei,
Julien M. Hendrickx,
Ge Chen,
Francesco Bullo,
Florian Dörfler
Abstract:
Mechanistic and tractable mathematical models play a key role in understanding how social influence shapes public opinions. Recently, a weighted-median mechanism has been proposed as a new micro-foundation of opinion dynamics and validated via experimental data. Numerical studies also indicate that this new mechanism recreates some non-trivial real-world features of opinion evolution. In this pape…
▽ More
Mechanistic and tractable mathematical models play a key role in understanding how social influence shapes public opinions. Recently, a weighted-median mechanism has been proposed as a new micro-foundation of opinion dynamics and validated via experimental data. Numerical studies also indicate that this new mechanism recreates some non-trivial real-world features of opinion evolution. In this paper, we conduct a thorough theoretical analysis of the weighted-median opinion dynamics. We fully characterize the set of all equilibria, and we establish the almost-sure finite-time convergence for any initial condition. Moreover, we prove a necessary and sufficient graph-theoretic condition for the almost-sure convergence to consensus, as well as a sufficient graph-theoretic condition for almost-sure persistent dissensus. It turns out that the weighted-median opinion dynamics, despite its simplicity in form, exhibit rich dynamical behavior that depends on some delicate network structures. To complement our sufficient conditions for almost-sure dissensus, we further prove that, given the influence network, determining whether the system almost surely achieves persistent dissensus is NP-hard, which reflects the complexity the network topology contributes to opinion evolution.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Contractivity of the Method of Successive Approximations for Optimal Control
Authors:
Kevin D. Smith,
Francesco Bullo
Abstract:
Strongly contracting dynamical systems have numerous properties (e.g., incremental ISS), find widespread applications (e.g., in controls and learning), and their study is receiving increasing attention. This work starts with the simple observation that, given a strongly contracting system, its adjoint dynamical system is also strongly contracting, with the same rate, with respect to the dual norm,…
▽ More
Strongly contracting dynamical systems have numerous properties (e.g., incremental ISS), find widespread applications (e.g., in controls and learning), and their study is receiving increasing attention. This work starts with the simple observation that, given a strongly contracting system, its adjoint dynamical system is also strongly contracting, with the same rate, with respect to the dual norm, under time reversal. As main implication of this dual contractivity, we show that the classic Method of Successive Approximations (MSA), an indirect method in optimal control, is a contraction mapping for short optimization intervals or large contraction rates. Consequently, we establish new convergence conditions for the MSA algorithm, which further imply uniqueness of the optimal control and sufficiency of Pontryagin's minimum principle under additional assumptions.
△ Less
Submitted 16 December, 2022; v1 submitted 20 September, 2022;
originally announced September 2022.
-
Verifying $k$-Contraction without Computing $k$-Compounds
Authors:
Omri Dalin,
Ron Ofir,
Eyal Bar Shalom,
Alexander Ovseevich,
Francesco Bullo,
Michael Margaliot
Abstract:
Compound matrices have found applications in many fields of science including systems and control theory. In particular, a sufficient condition for $k$-contraction is that a logarithmic norm (also called matrix measure) of the $k$-additive compound of the Jacobian is uniformly negative. However, this may be difficult to check in practice because the $k$-additive compound of an $n\times n$ matrix h…
▽ More
Compound matrices have found applications in many fields of science including systems and control theory. In particular, a sufficient condition for $k$-contraction is that a logarithmic norm (also called matrix measure) of the $k$-additive compound of the Jacobian is uniformly negative. However, this may be difficult to check in practice because the $k$-additive compound of an $n\times n$ matrix has dimensions $\binom{n}{k}\times \binom{n}{k}$. For an $n\times n$ matrix $A$, we prove a duality relation between the $k$ and $(n-k)$ compounds of $A$. We use this duality relation to derive a sufficient condition for $k$-contraction that does not require the computation of any $k$-compounds.
We demonstrate our results by deriving a sufficient condition for $k$-contraction of an $n$-dimensional Hopfield network that does not require to compute any compounds. In particular, for $k=2$ this sufficient condition implies that the network is $2$-contracting and this implies a strong asymptotic property: every bounded solution of the network converges to an equilibrium point, that may not be unique. This is relevant, for example, when using the Hopfield network as an associative memory that stores patterns as equilibrium points of the dynamics.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
Robust Training and Verification of Implicit Neural Networks: A Non-Euclidean Contractive Approach
Authors:
Saber Jafarpour,
Alexander Davydov,
Matthew Abate,
Francesco Bullo,
Samuel Coogan
Abstract:
This paper proposes a theoretical and computational framework for training and robustness verification of implicit neural networks based upon non-Euclidean contraction theory. The basic idea is to cast the robustness analysis of a neural network as a reachability problem and use (i) the $\ell_{\infty}$-norm input-output Lipschitz constant and (ii) the tight inclusion function of the network to ove…
▽ More
This paper proposes a theoretical and computational framework for training and robustness verification of implicit neural networks based upon non-Euclidean contraction theory. The basic idea is to cast the robustness analysis of a neural network as a reachability problem and use (i) the $\ell_{\infty}$-norm input-output Lipschitz constant and (ii) the tight inclusion function of the network to over-approximate its reachable sets. First, for a given implicit neural network, we use $\ell_{\infty}$-matrix measures to propose sufficient conditions for its well-posedness, design an iterative algorithm to compute its fixed points, and provide upper bounds for its $\ell_\infty$-norm input-output Lipschitz constant. Second, we introduce a related embedded network and show that the embedded network can be used to provide an $\ell_\infty$-norm box over-approximation of the reachable sets of the original network. Moreover, we use the embedded network to design an iterative algorithm for computing the upper bounds of the original system's tight inclusion function. Third, we use the upper bounds of the Lipschitz constants and the upper bounds of the tight inclusion functions to design two algorithms for the training and robustness verification of implicit neural networks. Finally, we apply our algorithms to train implicit neural networks on the MNIST dataset and compare the robustness of our models with the models trained via existing approaches in the literature.
△ Less
Submitted 7 August, 2022;
originally announced August 2022.
-
The Yakubovich S-Lemma Revisited: Stability and Contractivity in Non-Euclidean Norms
Authors:
Anton V. Proskurnikov,
Alexander Davydov,
Francesco Bullo
Abstract:
The celebrated S-Lemma was originally proposed to ensure the existence of a quadratic Lyapunov function in the Lur'e problem of absolute stability. A quadratic Lyapunov function is, however, nothing else than a squared Euclidean norm on the state space (that is, a norm induced by an inner product). A natural question arises as to whether squared non-Euclidean norms $V(x)=\|x\|^2$ may serve as Lyap…
▽ More
The celebrated S-Lemma was originally proposed to ensure the existence of a quadratic Lyapunov function in the Lur'e problem of absolute stability. A quadratic Lyapunov function is, however, nothing else than a squared Euclidean norm on the state space (that is, a norm induced by an inner product). A natural question arises as to whether squared non-Euclidean norms $V(x)=\|x\|^2$ may serve as Lyapunov functions in stability problems. This paper presents a novel non-polynomial S-Lemma that leads to constructive criteria for the existence of such functions defined by weighted $\ell_p$ norms. Our generalized S-Lemma leads to new absolute stability and absolute contractivity criteria for Lur'e-type systems, including, for example, a new simple proof of the Aizerman and Kalman conjectures for positive Lur'e systems.
△ Less
Submitted 2 February, 2023; v1 submitted 29 July, 2022;
originally announced July 2022.
-
Data-driven Self-triggered Control via Trajectory Prediction
Authors:
Wenjie Liu,
Jian Sun,
Gang Wang,
Francesco Bullo,
Jie Chen
Abstract:
Self-triggered control, a well-documented technique for reducing the communication overhead while ensuring desired system performance, is gaining increasing popularity. However, existing methods for self-triggered control require explicit system models that are assumed perfectly known a priori. An end-to-end control paradigm known as data-driven control learns control laws directly from data, and…
▽ More
Self-triggered control, a well-documented technique for reducing the communication overhead while ensuring desired system performance, is gaining increasing popularity. However, existing methods for self-triggered control require explicit system models that are assumed perfectly known a priori. An end-to-end control paradigm known as data-driven control learns control laws directly from data, and offers a competing alternative to the routine system identification-then-control method. In this context, the present paper puts forth data-driven self-triggered control schemes for unknown linear systems using data collected offline. Specifically, for output feedback control systems, a data-driven model predictive control (MPC) scheme is proposed, which computes a sequence of control inputs while generating a predicted system trajectory. A data-driven self-triggering law is designed using the predicted trajectory, to determine the next triggering time once a new measurement becomes available. For state feedback control systems, instead of capitalizing on MPC to predict the trajectory, a data-fitting problem using the pre-collected input-state data is solved, whose solution is employed to construct the self-triggering mechanism. Both feasibility and stability are established for the proposed self-triggered controllers, which are validated using numerical examples.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
How social influence affects the wisdom of crowds in influence networks
Authors:
Ye Tian,
Long Wang,
Francesco Bullo
Abstract:
A long-standing debate is whether social influence improves the collective wisdom of a crowd or undermines it. This paper addresses this question based on a naive learning setting in influence systems theory: in our models individuals evolve their estimates of an unknown truth according to the weighted-average opinion dynamics. A formal mathematization is provided with rigorous theoretical analysi…
▽ More
A long-standing debate is whether social influence improves the collective wisdom of a crowd or undermines it. This paper addresses this question based on a naive learning setting in influence systems theory: in our models individuals evolve their estimates of an unknown truth according to the weighted-average opinion dynamics. A formal mathematization is provided with rigorous theoretical analysis. We obtain various conditions for improving, optimizing and undermining the crowd accuracy, respectively. We prove that if the wisdom of finite-size group is improved, then the collective estimate converges to the truth as group size increases, provided individuals' variances are finite. We show that whether social influence improves or undermines the wisdom is determined by the social power allocation of the influence system: if the influence system allocates relatively larger social power to relatively more accurate individuals, it improves the wisdom; on the contrary, if the influence system assigns less social power to more accurate individuals, it undermines the wisdom. At a population level, individuals' susceptibilities to interpersonal influence and network centralities are both crucial. To improve the wisdom, more accurate individuals should be less susceptible and have larger network centralities. Particularly, in democratic influence networks, if relatively more accurate individuals are relatively less susceptible, the wisdom is improved; if more accurate individuals are more susceptible, the wisdom is undermined, which is consistent with the reported empirical evidence. Our investigation provides a theoretical framework for understanding the role social influence plays in the emergence of collective wisdom.
△ Less
Submitted 4 March, 2023; v1 submitted 28 April, 2022;
originally announced April 2022.
-
Modeling and Contractivity of Neural-Synaptic Networks with Hebbian Learning
Authors:
Veronica Centorrino,
Francesco Bullo,
Giovanni Russo
Abstract:
This paper is concerned with the modeling and analysis of two of the most commonly used recurrent neural network models (i.e., Hopfield neural network and firing-rate neural network) with dynamic recurrent connections undergoing Hebbian learning rules. To capture the synaptic sparsity of neural circuits we propose a low dimensional formulation. We then characterize certain key dynamical properties…
▽ More
This paper is concerned with the modeling and analysis of two of the most commonly used recurrent neural network models (i.e., Hopfield neural network and firing-rate neural network) with dynamic recurrent connections undergoing Hebbian learning rules. To capture the synaptic sparsity of neural circuits we propose a low dimensional formulation. We then characterize certain key dynamical properties. First, we give biologically-inspired forward invariance results. Then, we give sufficient conditions for the non-Euclidean contractivity of the models. Our contraction analysis leads to stability and robustness of time-varying trajectories -- for networks with both excitatory and inhibitory synapses governed by both Hebbian and anti-Hebbian rules. For each model, we propose a contractivity test based upon biologically meaningful quantities, e.g., neural and synaptic decay rate, maximum in-degree, and the maximum synaptic strength. Then, we show that the models satisfy Dale's Principle. Finally, we illustrate the effectiveness of our results via a numerical example.
△ Less
Submitted 22 March, 2024; v1 submitted 11 April, 2022;
originally announced April 2022.
-
Non-Euclidean Monotone Operator Theory with Applications to Recurrent Neural Networks
Authors:
Alexander Davydov,
Saber Jafarpour,
Anton V. Proskurnikov,
Francesco Bullo
Abstract:
We provide a novel transcription of monotone operator theory to the non-Euclidean finite-dimensional spaces $\ell_1$ and $\ell_{\infty}$. We first establish properties of mappings which are monotone with respect to the non-Euclidean norms $\ell_1$ or $\ell_{\infty}$. In analogy with their Euclidean counterparts, mappings which are monotone with respect to a non-Euclidean norm are amenable to numer…
▽ More
We provide a novel transcription of monotone operator theory to the non-Euclidean finite-dimensional spaces $\ell_1$ and $\ell_{\infty}$. We first establish properties of mappings which are monotone with respect to the non-Euclidean norms $\ell_1$ or $\ell_{\infty}$. In analogy with their Euclidean counterparts, mappings which are monotone with respect to a non-Euclidean norm are amenable to numerous algorithms for computing their zeros. We demonstrate that several classic iterative methods for computing zeros of monotone operators are directly applicable in the non-Euclidean framework. We present a case-study in the equilibrium computation of recurrent neural networks and demonstrate that casting the computation as a suitable operator splitting problem improves convergence rates.
△ Less
Submitted 18 July, 2022; v1 submitted 4 April, 2022;
originally announced April 2022.
-
Comparative Analysis of Interval Reachability for Robust Implicit and Feedforward Neural Networks
Authors:
Alexander Davydov,
Saber Jafarpour,
Matthew Abate,
Francesco Bullo,
Samuel Coogan
Abstract:
We use interval reachability analysis to obtain robustness guarantees for implicit neural networks (INNs). INNs are a class of implicit learning models that use implicit equations as layers and have been shown to exhibit several notable benefits over traditional deep neural networks. We first establish that tight inclusion functions of neural networks, which provide the tightest rectangular over-a…
▽ More
We use interval reachability analysis to obtain robustness guarantees for implicit neural networks (INNs). INNs are a class of implicit learning models that use implicit equations as layers and have been shown to exhibit several notable benefits over traditional deep neural networks. We first establish that tight inclusion functions of neural networks, which provide the tightest rectangular over-approximation of an input-output map, lead to sharper robustness guarantees than the well-studied robustness measures of local Lipschitz constants. Like Lipschitz constants, tight inclusions functions are computationally challenging to obtain, and we thus propose using mixed monotonicity and contraction theory to obtain computationally efficient estimates of tight inclusion functions for INNs. We show that our approach performs at least as well as, and generally better than, applying state-of-the-art interval bound propagation methods to INNs. We design a novel optimization problem for training robust INNs and we provide empirical evidence that suitably-trained INNs can be more robust than comparably-trained feedforward networks.
△ Less
Submitted 31 March, 2022;
originally announced April 2022.
-
Minimum effort decentralized control design for contracting network systems
Authors:
Ron Ofir,
Francesco Bullo,
Michael Margaliot
Abstract:
We consider the problem of making a networked system contracting by designing minimal effort local controllers. Our method combines a hierarchical contraction characterization and a matrix-balancing approach to stabilizing a Metzler matrix via minimal diagonal perturbations. We demonstrate our approach by designing local controllers that render contractive a network of FitzHugh-Nagumo neurons with…
▽ More
We consider the problem of making a networked system contracting by designing minimal effort local controllers. Our method combines a hierarchical contraction characterization and a matrix-balancing approach to stabilizing a Metzler matrix via minimal diagonal perturbations. We demonstrate our approach by designing local controllers that render contractive a network of FitzHugh-Nagumo neurons with a general topology of interactions.
△ Less
Submitted 19 March, 2022;
originally announced March 2022.
-
Multistability and anomalies in oscillator models of lossy power grids
Authors:
Robin Delabays,
Saber Jafarpour,
Francesco Bullo
Abstract:
The analysis of dissipatively coupled oscillators is challenging and highly relevant in power grids. Standard mathematical methods are not applicable, due to the lack of network symmetry induced by dissipative couplings. Here we demonstrate a close correspondence between stable synchronous states in dissipatively coupled oscillators, and the {winding partition} of their state space, a geometric no…
▽ More
The analysis of dissipatively coupled oscillators is challenging and highly relevant in power grids. Standard mathematical methods are not applicable, due to the lack of network symmetry induced by dissipative couplings. Here we demonstrate a close correspondence between stable synchronous states in dissipatively coupled oscillators, and the {winding partition} of their state space, a geometric notion induced by the network topology. Leveraging this winding partition, we accompany this article with an algorithms to compute all synchronous solutions of complex networks of dissipatively coupled oscillators. These geometric and computational tools allow us to identify anomalous behaviors of lossy networked systems. Counterintuitively, we show that loop flows and dissipation can increase the system's transfer capacity, and that dissipation can promote multistability. We apply our geometric framework to compute power flows on the IEEE RTS-96 test system, where we identify two high voltage solutions with distinct loop flows.
△ Less
Submitted 6 September, 2022; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Minimax Flow over Acyclic Networks: Distributed Algorithms and Microgrid Application
Authors:
Marco Coraggio,
Saber Jafarpour,
Francesco Bullo,
Mario di Bernardo
Abstract:
Given a flow network with variable suppliers and fixed consumers, the minimax flow problem consists in minimizing the maximum flow between nodes, subject to flow conservation and capacity constraints. We solve this problem over acyclic graphs in a distributed manner by showing that it can be recast as a consensus problem between the maximum downstream flows, which we define here for the first time…
▽ More
Given a flow network with variable suppliers and fixed consumers, the minimax flow problem consists in minimizing the maximum flow between nodes, subject to flow conservation and capacity constraints. We solve this problem over acyclic graphs in a distributed manner by showing that it can be recast as a consensus problem between the maximum downstream flows, which we define here for the first time. Additionally, we present a distributed algorithm to estimate these quantities. Finally, exploiting our theoretical results, we design an online distributed controller to prevent overcurrent in microgrids consisting of loads and droop-controlled inverters. Our results are validated numerically on the CIGRE benchmark microgrid.
△ Less
Submitted 9 July, 2022; v1 submitted 10 January, 2022;
originally announced January 2022.
-
Dual Seminorms, Ergodic Coefficients and Semicontraction Theory
Authors:
Giulia De Pasquale,
Kevin D. Smith,
Francesco Bullo,
Maria Elena Valcher
Abstract:
Dynamical systems that are contracting on a subspace are said to be semicontracting. Semicontraction theory is a useful tool in the study of consensus algorithms and dynamical flow systems such as Markov chains. To develop a comprehensive theory of semicontracting systems, we investigate seminorms on vector spaces and define two canonical notions: projection and distance semi-norms. We show that t…
▽ More
Dynamical systems that are contracting on a subspace are said to be semicontracting. Semicontraction theory is a useful tool in the study of consensus algorithms and dynamical flow systems such as Markov chains. To develop a comprehensive theory of semicontracting systems, we investigate seminorms on vector spaces and define two canonical notions: projection and distance semi-norms. We show that the well-known lp ergodic coefficients are induced matrix seminorms and play a central role in stability problems. In particular, we formulate a duality theorem that explains why the Markov-Dobrushin coefficient is the rate of contraction for both averaging and conservation flows in discrete time. Moreover, we obtain parallel results for induced matrix log seminorms. Finally, we propose comprehensive theorems for strong semicontractivity of linear and non-linear time-varying dynamical systems with invariance and conservation properties both in discrete and continuous time.
△ Less
Submitted 20 December, 2022; v1 submitted 9 January, 2022;
originally announced January 2022.
-
Modeling Human-AI Team Decision Making
Authors:
Wei Ye,
Francesco Bullo,
Noah Friedkin,
Ambuj K Singh
Abstract:
AI and humans bring complementary skills to group deliberations. Modeling this group decision making is especially challenging when the deliberations include an element of risk and an exploration-exploitation process of appraising the capabilities of the human and AI agents. To investigate this question, we presented a sequence of intellective issues to a set of human groups aided by imperfect AI…
▽ More
AI and humans bring complementary skills to group deliberations. Modeling this group decision making is especially challenging when the deliberations include an element of risk and an exploration-exploitation process of appraising the capabilities of the human and AI agents. To investigate this question, we presented a sequence of intellective issues to a set of human groups aided by imperfect AI agents. A group's goal was to appraise the relative expertise of the group's members and its available AI agents, evaluate the risks associated with different actions, and maximize the overall reward by reaching consensus. We propose and empirically validate models of human-AI team decision making under such uncertain circumstances, and show the value of socio-cognitive constructs of prospect theory, influence dynamics, and Bayesian learning in predicting the behavior of human-AI groups.
△ Less
Submitted 7 January, 2022;
originally announced January 2022.
-
Robustness Certificates for Implicit Neural Networks: A Mixed Monotone Contractive Approach
Authors:
Saber Jafarpour,
Matthew Abate,
Alexander Davydov,
Francesco Bullo,
Samuel Coogan
Abstract:
Implicit neural networks are a general class of learning models that replace the layers in traditional feedforward models with implicit algebraic equations. Compared to traditional learning models, implicit networks offer competitive performance and reduced memory consumption. However, they can remain brittle with respect to input adversarial perturbations.
This paper proposes a theoretical and…
▽ More
Implicit neural networks are a general class of learning models that replace the layers in traditional feedforward models with implicit algebraic equations. Compared to traditional learning models, implicit networks offer competitive performance and reduced memory consumption. However, they can remain brittle with respect to input adversarial perturbations.
This paper proposes a theoretical and computational framework for robustness verification of implicit neural networks; our framework blends together mixed monotone systems theory and contraction theory. First, given an implicit neural network, we introduce a related embedded network and show that, given an $\ell_\infty$-norm box constraint on the input, the embedded network provides an $\ell_\infty$-norm box overapproximation for the output of the given network. Second, using $\ell_{\infty}$-matrix measures, we propose sufficient conditions for well-posedness of both the original and embedded system and design an iterative algorithm to compute the $\ell_{\infty}$-norm box robustness margins for reachability and classification problems. Third, of independent value, we propose a novel relative classifier variable that leads to tighter bounds on the certified adversarial robustness in classification problems. Finally, we perform numerical simulations on a Non-Euclidean Monotone Operator Network (NEMON) trained on the MNIST dataset. In these simulations, we compare the accuracy and run time of our mixed monotone contractive approach with the existing robustness verification approaches in the literature for estimating the certified adversarial robustness.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
Data-Driven Resilient Predictive Control under Denial-of-Service
Authors:
Wenjie Liu,
Jian Sun,
Gang Wang,
Francesco Bullo,
Jie Chen
Abstract:
The study of resilient control of linear time-invariant (LTI) systems against denial-of-service (DoS) attacks is gaining popularity in emerging cyber-physical applications. In previous works, explicit system models are required to design a predictor-based resilient controller. These models can be either given a priori or obtained through a prior system identification step. Recent research efforts…
▽ More
The study of resilient control of linear time-invariant (LTI) systems against denial-of-service (DoS) attacks is gaining popularity in emerging cyber-physical applications. In previous works, explicit system models are required to design a predictor-based resilient controller. These models can be either given a priori or obtained through a prior system identification step. Recent research efforts have focused on data-driven control based on pre-collected input-output trajectories (i.e., without explicit system models). In this paper, we take an initial step toward data-driven stabilization of stochastic LTI systems under DoS attacks, and develop a resilient model predictive control (MPC) scheme driven purely by data-dependent conditions. The proposed data-driven control method achieves the same level of resilience as the model-based control method. For example, local input-to-state stability (ISS) is achieved under mild assumptions on the noise and the DoS attacks. To recover global ISS, two modifications are further suggested at the price of reduced resilience against DoS attacks or increased computational complexity. Finally, a numerical example is given to validate the effectiveness of the proposed control method.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Non-Euclidean Contraction Analysis of Continuous-Time Neural Networks
Authors:
Alexander Davydov,
Anton V. Proskurnikov,
Francesco Bullo
Abstract:
Critical questions in dynamical neuroscience and machine learning are related to the study of continuous-time neural networks and their stability, robustness, and computational efficiency. These properties can be simultaneously established via a contraction analysis. This paper develops a comprehensive non-Euclidean contraction theory for continuous-time neural networks. Specifically, we provide n…
▽ More
Critical questions in dynamical neuroscience and machine learning are related to the study of continuous-time neural networks and their stability, robustness, and computational efficiency. These properties can be simultaneously established via a contraction analysis. This paper develops a comprehensive non-Euclidean contraction theory for continuous-time neural networks. Specifically, we provide novel sufficient conditions for the contractivity of general classes of continuous-time neural networks including Hopfield, firing rate, Persidskii, Lur'e, and other neural networks with respect to the non-Euclidean $\ell_1/\ell_\infty$ norms. These sufficient conditions are based upon linear programming or, in some special cases, establishing the Hurwitzness of a particular Metzler matrix. To prove these sufficient conditions, we develop novel results on non-Euclidean logarithmic norms and a novel necessary and sufficient condition for contractivity of systems with locally Lipschitz dynamics. For each model, we apply our theoretical results to compute the optimal contraction rate and corresponding weighted non-Euclidean norm with respect to which the neural network is contracting.
△ Less
Submitted 29 June, 2024; v1 submitted 15 October, 2021;
originally announced October 2021.
-
From Contraction Theory to Fixed Point Algorithms on Riemannian and Non-Euclidean Spaces
Authors:
Francesco Bullo,
Pedro Cisneros-Velarde,
Alexander Davydov,
Saber Jafarpour
Abstract:
The design of fixed point algorithms is at the heart of monotone operator theory, convex analysis, and of many modern optimization problems arising in machine learning and control. This tutorial reviews recent advances in understanding the relationship between Demidovich conditions, one-sided Lipschitz conditions, and contractivity theorems. We review the standard contraction theory on Euclidean s…
▽ More
The design of fixed point algorithms is at the heart of monotone operator theory, convex analysis, and of many modern optimization problems arising in machine learning and control. This tutorial reviews recent advances in understanding the relationship between Demidovich conditions, one-sided Lipschitz conditions, and contractivity theorems. We review the standard contraction theory on Euclidean spaces as well as little-known results for Riemannian manifolds. Special emphasis is placed on the setting of non-Euclidean norms and the recently introduced weak pairings for the $\ell_1$ and $\ell_\infty$ norms. We highlight recent results on explicit and implicit fixed point schemes for non-Euclidean contracting systems.
△ Less
Submitted 7 October, 2021;
originally announced October 2021.
-
Convex Optimization of the Basic Reproduction Number
Authors:
Kevin D. Smith,
Francesco Bullo
Abstract:
The basic reproduction number $R_0$ is a fundamental quantity in epidemiological modeling, reflecting the typical number of secondary infections that arise from a single infected individual. While $R_0$ is widely known to scientists, policymakers, and the general public, it has received comparatively little attention in the controls community. This note provides two novel characterizations of…
▽ More
The basic reproduction number $R_0$ is a fundamental quantity in epidemiological modeling, reflecting the typical number of secondary infections that arise from a single infected individual. While $R_0$ is widely known to scientists, policymakers, and the general public, it has received comparatively little attention in the controls community. This note provides two novel characterizations of $R_0$: a stability characterization and a geometric program characterization. The geometric program characterization allows us to write $R_0$-constrained and budget-constrained optimal resource allocation problems as geometric programs, which are easily transformed into convex optimization problems. We apply these programs to allocating vaccines and antidotes in numerical examples, finding that targeting $R_0$ instead of the spectral abscissa of the Jacobian matrix (a common target in the controls literature) leads to qualitatively different solutions.
△ Less
Submitted 2 September, 2022; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Robust Implicit Networks via Non-Euclidean Contractions
Authors:
Saber Jafarpour,
Alexander Davydov,
Anton V. Proskurnikov,
Francesco Bullo
Abstract:
Implicit neural networks, a.k.a., deep equilibrium networks, are a class of implicit-depth learning models where function evaluation is performed by solving a fixed point equation. They generalize classic feedforward models and are equivalent to infinite-depth weight-tied feedforward networks. While implicit models show improved accuracy and significant reduction in memory consumption, they can su…
▽ More
Implicit neural networks, a.k.a., deep equilibrium networks, are a class of implicit-depth learning models where function evaluation is performed by solving a fixed point equation. They generalize classic feedforward models and are equivalent to infinite-depth weight-tied feedforward networks. While implicit models show improved accuracy and significant reduction in memory consumption, they can suffer from ill-posedness and convergence instability.
This paper provides a new framework, which we call Non-Euclidean Monotone Operator Network (NEMON), to design well-posed and robust implicit neural networks based upon contraction theory for the non-Euclidean norm $\ell_{\infty}$. Our framework includes (i) a novel condition for well-posedness based on one-sided Lipschitz constants, (ii) an average iteration for computing fixed-points, and (iii) explicit estimates on input-output Lipschitz constants. Additionally, we design a training problem with the well-posedness condition and the average iteration as constraints and, to achieve robust models, with the input-output Lipschitz constant as a regularizer. Our $\ell_{\infty}$ well-posedness condition leads to a larger polytopic training search space than existing conditions and our average iteration enjoys accelerated convergence. Finally, we evaluate our framework in image classification through the MNIST and the CIFAR-10 datasets. Our numerical results demonstrate improved accuracy and robustness of the implicit models with smaller input-output Lipschitz bounds. Code is available at https://github.com/davydovalexander/Non-Euclidean_Mon_Op_Net.
△ Less
Submitted 25 January, 2022; v1 submitted 6 June, 2021;
originally announced June 2021.
-
A Contraction Theory Approach to Optimization Algorithms from Acceleration Flows
Authors:
Pedro Cisneros-Velarde,
Francesco Bullo
Abstract:
Much recent interest has focused on the design of optimization algorithms from the discretization of an associated optimization flow, i.e., a system of differential equations (ODEs) whose trajectories solve an associated optimization problem. Such a design approach poses an important problem: how to find a principled methodology to design and discretize appropriate ODEs. This paper aims to provide…
▽ More
Much recent interest has focused on the design of optimization algorithms from the discretization of an associated optimization flow, i.e., a system of differential equations (ODEs) whose trajectories solve an associated optimization problem. Such a design approach poses an important problem: how to find a principled methodology to design and discretize appropriate ODEs. This paper aims to provide a solution to this problem through the use of contraction theory. We first introduce general mathematical results that explain how contraction theory guarantees the stability of the implicit and explicit Euler integration methods. Then, we propose a novel system of ODEs, namely the Accelerated-Contracting-Nesterov flow, and use contraction theory to establish it is an optimization flow with exponential convergence rate, from which the linear convergence rate of its associated optimization algorithm is immediately established. Remarkably, a simple explicit Euler discretization of this flow corresponds to the Nesterov acceleration method. Finally, we present how our approach leads to performance guarantees in the design of optimization algorithms for time-varying optimization problems.
△ Less
Submitted 31 January, 2022; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Non-Euclidean Contraction Theory for Monotone and Positive Systems
Authors:
Saber Jafarpour,
Alexander Davydov,
Francesco Bullo
Abstract:
In this note we study contractivity of monotone systems and exponential convergence of positive systems using non-Euclidean norms. We first introduce the notion of conic matrix measure as a framework to study stability of monotone and positive systems. We study properties of the conic matrix measures and investigate their connection with weak pairings and standard matrix measures. Using conic matr…
▽ More
In this note we study contractivity of monotone systems and exponential convergence of positive systems using non-Euclidean norms. We first introduce the notion of conic matrix measure as a framework to study stability of monotone and positive systems. We study properties of the conic matrix measures and investigate their connection with weak pairings and standard matrix measures. Using conic matrix measures and weak pairings, we characterize contractivity and incremental stability of monotone systems with respect to non-Euclidean norms. Moreover, we use conic matrix measures to provide sufficient conditions for exponential convergence of positive systems to their equilibria. We show that our framework leads to novel results on (i) the contractivity of excitatory Hopfield neural networks, and (ii) the stability of interconnected systems using non-monotone positive comparison systems.
△ Less
Submitted 21 August, 2022; v1 submitted 3 April, 2021;
originally announced April 2021.
-
Non-Euclidean Contraction Theory for Robust Nonlinear Stability
Authors:
Alexander Davydov,
Saber Jafarpour,
Francesco Bullo
Abstract:
We study necessary and sufficient conditions for contraction and incremental stability of dynamical systems with respect to non-Euclidean norms. First, we introduce weak pairings as a framework to study contractivity with respect to arbitrary norms, and characterize their properties. We introduce and study the sign and max pairings for the $\ell_1$ and $\ell_\infty$ norms, respectively. Using weak…
▽ More
We study necessary and sufficient conditions for contraction and incremental stability of dynamical systems with respect to non-Euclidean norms. First, we introduce weak pairings as a framework to study contractivity with respect to arbitrary norms, and characterize their properties. We introduce and study the sign and max pairings for the $\ell_1$ and $\ell_\infty$ norms, respectively. Using weak pairings, we establish five equivalent characterizations for contraction, including the one-sided Lipschitz condition for the vector field as well as matrix measure and Demidovich conditions for the corresponding Jacobian. Third, we extend our contraction framework in two directions: we prove equivalences for contraction of continuous vector fields and we formalize the weaker notion of equilibrium contraction, which ensures exponential convergence to an equilibrium. Finally, as an application, we provide (i) incremental input-to-state stability and finite input-state gain properties for contracting systems, and (ii) a general theorem about the Lipschitz interconnection of contracting systems, whereby the Hurwitzness of a gain matrix implies the contractivity of the interconnected system.
△ Less
Submitted 1 August, 2022; v1 submitted 22 March, 2021;
originally announced March 2021.
-
Resilient Control under Quantization and Denial-of-Service: Co-designing a Deadbeat Controller and Transmission Protocol
Authors:
Wenjie Liu,
Jian Sun,
Gang Wang,
Francesco Bullo,
Jie Chen
Abstract:
This paper is concerned with the problem of stabilizing continuous-time linear time-invariant systems subject to quantization and Denial-of-Service (DoS) attacks. In this context, two DoS-induced challenges emerge with the design of resilient encoding schemes, namely, the coupling between encoding strategies of different signals, and the synchronization between the encoder and decoder. To address…
▽ More
This paper is concerned with the problem of stabilizing continuous-time linear time-invariant systems subject to quantization and Denial-of-Service (DoS) attacks. In this context, two DoS-induced challenges emerge with the design of resilient encoding schemes, namely, the coupling between encoding strategies of different signals, and the synchronization between the encoder and decoder. To address these challenges, a novel structure that is equipped with a deadbeat controller as well as a delicate transmission protocol for the input and output channels, co-designed leveraging the controllability index, is put forward. When both input and output channels are subject to DoS attacks and quantization, the proposed structure is shown able to decouple the encoding schemes for input, output, and estimated output signals. This property is further corroborated by designing encoding schemes as well as conditions that ensure exponential stability of the closed-loop system. On the other hand, when only the output channel is subject to network phenomenon, the proposed structure can achieve exponential stabilization without acknowledgment (ACK) signals, in contrast to existing ACK-based results. Finally, a numerical example is given to demonstrate the practical merits of the proposed approach as well as the theory.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
Distributed Wasserstein Barycenters via Displacement Interpolation
Authors:
Pedro Cisneros-Velarde,
Francesco Bullo
Abstract:
Consider a multi-agent system whereby each agent has an initial probability measure. In this paper, we propose a distributed algorithm based upon stochastic, asynchronous and pairwise exchange of information and displacement interpolation in the Wasserstein space. We characterize the evolution of this algorithm and prove it computes the Wasserstein barycenter of the initial measures under various…
▽ More
Consider a multi-agent system whereby each agent has an initial probability measure. In this paper, we propose a distributed algorithm based upon stochastic, asynchronous and pairwise exchange of information and displacement interpolation in the Wasserstein space. We characterize the evolution of this algorithm and prove it computes the Wasserstein barycenter of the initial measures under various conditions. One version of the algorithm computes a standard Wasserstein barycenter, i.e., a barycenter based upon equal weights; and the other version computes a randomized Wasserstein barycenter, i.e., a barycenter based upon random weights for the initial measures. Finally, we specialize our algorithm to Gaussian distributions and draw a connection with the modeling of opinion dynamics in mathematical sociology.
△ Less
Submitted 25 February, 2022; v1 submitted 15 December, 2020;
originally announced December 2020.
-
Stochastic Strategies for Robotic Surveillance as Stackelberg Games
Authors:
Xiaoming Duan,
Dario Paccagnan,
Francesco Bullo
Abstract:
This paper studies a stochastic robotic surveillance problem where a mobile robot moves randomly on a graph to capture a potential intruder that strategically attacks a location on the graph. The intruder is assumed to be omniscient: it knows the current location of the mobile agent and can learn the surveillance strategy. The goal for the mobile robot is to design a stochastic strategy so as to m…
▽ More
This paper studies a stochastic robotic surveillance problem where a mobile robot moves randomly on a graph to capture a potential intruder that strategically attacks a location on the graph. The intruder is assumed to be omniscient: it knows the current location of the mobile agent and can learn the surveillance strategy. The goal for the mobile robot is to design a stochastic strategy so as to maximize the probability of capturing the intruder. We model the strategic interactions between the surveillance robot and the intruder as a Stackelberg game, and optimal and suboptimal Markov chain based surveillance strategies in star, complete and line graphs are studied. We first derive a universal upper bound on the capture probability, i.e., the performance limit for the surveillance agent. We show that this upper bound is tight in the complete graph and further provide suboptimality guarantees for a natural design. For the star and line graphs, we first characterize dominant strategies for the surveillance agent and the intruder. Then, we rigorously prove the optimal strategy for the surveillance agent.
△ Less
Submitted 15 November, 2020;
originally announced November 2020.
-
Expertise and confidence explain how social influence evolves along intellective tasks
Authors:
Omid Askarisichani,
Elizabeth Y. Huang,
Kekoa S. Sato,
Noah E. Friedkin,
Francesco Bullo,
Ambuj K. Singh
Abstract:
Discovering the antecedents of individuals' influence in collaborative environments is an important, practical, and challenging problem. In this paper, we study interpersonal influence in small groups of individuals who collectively execute a sequence of intellective tasks. We observe that along an issue sequence with feedback, individuals with higher expertise and social confidence are accorded h…
▽ More
Discovering the antecedents of individuals' influence in collaborative environments is an important, practical, and challenging problem. In this paper, we study interpersonal influence in small groups of individuals who collectively execute a sequence of intellective tasks. We observe that along an issue sequence with feedback, individuals with higher expertise and social confidence are accorded higher interpersonal influence. We also observe that low-performing individuals tend to underestimate their high-performing teammate's expertise. Based on these observations, we introduce three hypotheses and present empirical and theoretical support for their validity. We report empirical evidence on longstanding theories of transactive memory systems, social comparison, and confidence heuristics on the origins of social influence. We propose a cognitive dynamical model inspired by these theories to describe the process by which individuals adjust interpersonal influences over time. We demonstrate the model's accuracy in predicting individuals' influence and provide analytical results on its asymptotic behavior for the case with identically performing individuals. Lastly, we propose a novel approach using deep neural networks on a pre-trained text embedding model for predicting the influence of individuals. Using message contents, message times, and individual correctness collected during tasks, we are able to accurately predict individuals' self-reported influence over time. Extensive experiments verify the accuracy of the proposed models compared to baselines such as structural balance and reflected appraisal model. While the neural networks model is the most accurate, the dynamical model is the most interpretable for influence prediction.
△ Less
Submitted 13 November, 2020;
originally announced November 2020.
-
Contraction Theory for Dynamical Systems on Hilbert Spaces
Authors:
Pedro Cisneros-Velarde,
Saber Jafarpour,
Francesco Bullo
Abstract:
Contraction theory for dynamical systems on Euclidean spaces is well-established. For contractive (resp. semi-contractive) systems, the distance (resp. semi-distance) between any two trajectories decreases exponentially fast. For partially contractive systems, each trajectory converges exponentially fast to an invariant subspace.
In this note, we develop contraction theory on Hilbert spaces. Fir…
▽ More
Contraction theory for dynamical systems on Euclidean spaces is well-established. For contractive (resp. semi-contractive) systems, the distance (resp. semi-distance) between any two trajectories decreases exponentially fast. For partially contractive systems, each trajectory converges exponentially fast to an invariant subspace.
In this note, we develop contraction theory on Hilbert spaces. First, we provide a novel integral condition for contractivity, and for time-invariant systems, we establish the existence of a unique globally exponentially stable equilibrium. Second, we introduce the notions of partial and semi-contraction and we provide various sufficient conditions for time-varying and time-invariant systems. Finally, we apply the theory on a classic reaction-diffusion system.
△ Less
Submitted 4 June, 2021; v1 submitted 2 October, 2020;
originally announced October 2020.
-
Assign and Appraise: Achieving Optimal Performance in Collaborative Teams
Authors:
Elizabeth Y. Huang,
Dario Paccagnan,
Wenjun Mei,
Francesco Bullo
Abstract:
Tackling complex team problems requires understanding each team member's skills in order to devise a task assignment maximizing the team performance. This paper proposes a novel quantitative model describing the decentralized process by which individuals in a team learn who has what abilities, while concurrently assigning tasks to each of the team members. In the model, the appraisal network repre…
▽ More
Tackling complex team problems requires understanding each team member's skills in order to devise a task assignment maximizing the team performance. This paper proposes a novel quantitative model describing the decentralized process by which individuals in a team learn who has what abilities, while concurrently assigning tasks to each of the team members. In the model, the appraisal network represents team member's evaluations of one another and each team member chooses their own workload. The appraisals and workload assignment change simultaneously: each member builds their own local appraisal of neighboring members based on the performance exhibited on previous tasks, while the workload is redistributed based on the current appraisal estimates. We show that the appraisal states can be reduced to a lower dimension due to the presence of conserved quantities associated to the cycles of the appraisal network. Building on this, we provide rigorous results characterizing the ability, or inability, of the team to learn each other's skill and thus converge to an allocation maximizing the team performance. We complement our analysis with extensive numerical experiments.
△ Less
Submitted 22 August, 2020;
originally announced August 2020.
-
Markov Chain-Based Stochastic Strategies for Robotic Surveillance
Authors:
Xiaoming Duan,
Francesco Bullo
Abstract:
This article surveys recent advancements of strategy designs for persistent robotic surveillance tasks with the focus on stochastic approaches. The problem describes how mobile robots stochastically patrol a graph in an efficient way where the efficiency is defined with respect to relevant underlying performance metrics. We first start by reviewing the basics of Markov chains, which is the primary…
▽ More
This article surveys recent advancements of strategy designs for persistent robotic surveillance tasks with the focus on stochastic approaches. The problem describes how mobile robots stochastically patrol a graph in an efficient way where the efficiency is defined with respect to relevant underlying performance metrics. We first start by reviewing the basics of Markov chains, which is the primary motion model for stochastic robotic surveillance. Then two main criteria regarding the speed and unpredictability of surveillance strategies are discussed. The central objects that appear throughout the treatment is the hitting times of Markov chains, their distributions and expectations. We formulate various optimization problems based on the concerned metrics in different scenarios and establish their respective properties.
△ Less
Submitted 20 August, 2020;
originally announced August 2020.
-
Policy Evaluation and Seeking for Multi-Agent Reinforcement Learning via Best Response
Authors:
Rui Yan,
Xiaoming Duan,
Zongying Shi,
Yisheng Zhong,
Jason R. Marden,
Francesco Bullo
Abstract:
This paper introduces two metrics (cycle-based and memory-based metrics), grounded on a dynamical game-theoretic solution concept called sink equilibrium, for the evaluation, ranking, and computation of policies in multi-agent learning. We adopt strict best response dynamics (SBRD) to model selfish behaviors at a meta-level for multi-agent reinforcement learning. Our approach can deal with dynamic…
▽ More
This paper introduces two metrics (cycle-based and memory-based metrics), grounded on a dynamical game-theoretic solution concept called sink equilibrium, for the evaluation, ranking, and computation of policies in multi-agent learning. We adopt strict best response dynamics (SBRD) to model selfish behaviors at a meta-level for multi-agent reinforcement learning. Our approach can deal with dynamical cyclical behaviors (unlike approaches based on Nash equilibria and Elo ratings), and is more compatible with single-agent reinforcement learning than alpha-rank which relies on weakly better responses. We first consider settings where the difference between largest and second largest underlying metric has a known lower bound. With this knowledge we propose a class of perturbed SBRD with the following property: only policies with maximum metric are observed with nonzero probability for a broad class of stochastic games with finite memory. We then consider settings where the lower bound for the difference is unknown. For this setting, we propose a class of perturbed SBRD such that the metrics of the policies observed with nonzero probability differ from the optimal by any given tolerance. The proposed perturbed SBRD addresses the opponent-induced non-stationarity by fixing the strategies of others for the learning agent, and uses empirical game-theoretic analysis to estimate payoffs for each strategy profile obtained due to the perturbation.
△ Less
Submitted 20 June, 2020; v1 submitted 16 June, 2020;
originally announced June 2020.
-
Multi-group SIS Epidemics with Simplicial and Higher-Order Interactions
Authors:
Pedro Cisneros-Velarde,
Francesco Bullo
Abstract:
This paper analyzes a Susceptible-Infected-Susceptible (SIS) model of epidemic propagation over hypergraphs and, motivated by an important special case, we refer to the model as to the simplicial SIS model. Classically, the multi-group SIS model has assumed pairwise interactions of contagion across groups and thus has been vastly studied in the literature. It is only recently that a renewed specia…
▽ More
This paper analyzes a Susceptible-Infected-Susceptible (SIS) model of epidemic propagation over hypergraphs and, motivated by an important special case, we refer to the model as to the simplicial SIS model. Classically, the multi-group SIS model has assumed pairwise interactions of contagion across groups and thus has been vastly studied in the literature. It is only recently that a renewed special attention has been drawn to the study of contagion dynamics over higher-order interactions and over more general graph structures, like simplexes. Previous work on mean-field approximation scalar models of the simplicial SIS model has indicated that a new dynamical behavior domain, compared to the classical SIS model, appears due to the newly introduced higher order interaction terms: both a disease-free equilibrium and an endemic equilibrium co-exist and are both locally asymptotically stable. This paper formally establishes that bistability (as a new epidemiological behavior) also appears in the multi-group simplicial SIS model. We give sufficient conditions over the model's parameters for the appearance of this and the other behavioral domains present in the classical multi-group SIS model. We additionally provide an algorithm to compute the value of the endemic equilibrium and report numerical analysis of the transition from the disease-free domain to the bistable domain.
△ Less
Submitted 3 October, 2021; v1 submitted 22 May, 2020;
originally announced May 2020.
-
Weak and Semi-Contraction for Network Systems and Diffusively-Coupled Oscillators
Authors:
Saber Jafarpour,
Pedro Cisneros-Velarde,
Francesco Bullo
Abstract:
We develop two generalizations of contraction theory, namely, semi-contraction and weak-contraction theory. First, using the notion of semi-norm, we propose a geometric framework for semi-contraction theory. We introduce matrix semi-measures and characterize their properties. We show that the spectral abscissa of a matrix is the infimum over weighted semi-measures. For dynamical systems, we use th…
▽ More
We develop two generalizations of contraction theory, namely, semi-contraction and weak-contraction theory. First, using the notion of semi-norm, we propose a geometric framework for semi-contraction theory. We introduce matrix semi-measures and characterize their properties. We show that the spectral abscissa of a matrix is the infimum over weighted semi-measures. For dynamical systems, we use the semi-measure of their Jacobian to characterize the contractivity properties of their trajectories. Second, for weakly contracting systems, we prove a dichotomy for the asymptotic behavior of their trajectories and novel sufficient conditions for convergence to an equilibrium. Third, we show that every trajectory of a doubly-contracting system, i.e., a system that is both weakly and semi-contracting, converges to an equilibrium point. Finally, we apply our results to various important network systems including affine averaging and affine flow systems, continuous-time distributed primal-dual algorithms, and networks of diffusively-coupled dynamical systems. For diffusively-coupled systems, the semi-contraction theory leads to a sufficient condition for synchronization that is sharper, in general, than previously-known tests.
△ Less
Submitted 4 October, 2020; v1 submitted 19 May, 2020;
originally announced May 2020.