Search | arXiv e-print repository

Perspectives on Contractivity in Control, Optimization, and Learning

Authors: Alexander Davydov, Francesco Bullo

Abstract: Contraction theory is a mathematical framework for studying the convergence, robustness, and modularity properties of dynamical systems and algorithms. In this opinion paper, we provide five main opinions on the virtues of contraction theory. These opinions are (i) contraction theory is a unifying framework emerging from classical and modern works, (ii) contractivity is computationally-friendly, r… ▽ More Contraction theory is a mathematical framework for studying the convergence, robustness, and modularity properties of dynamical systems and algorithms. In this opinion paper, we provide five main opinions on the virtues of contraction theory. These opinions are (i) contraction theory is a unifying framework emerging from classical and modern works, (ii) contractivity is computationally-friendly, robust, and modular stability, (iii) numerous dynamical systems are contracting, (iv) contraction theory is relevant to modern applications, and (v) contraction theory can be vastly extended in numerous directions. We survey recent theoretical and applied research in each of these five directions. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2403.08159 [pdf, other]

Exponential Stability of Parametric Optimization-Based Controllers via Lur'e Contractivity

Authors: Alexander Davydov, Francesco Bullo

Abstract: In this letter, we investigate sufficient conditions for the exponential stability of LTI systems driven by controllers derived from parametric optimization problems. Our primary focus is on parametric projection controllers, namely parametric programs whose objective function is the squared distance to a nominal controller. Leveraging the virtual system method of analysis and a novel contractivit… ▽ More In this letter, we investigate sufficient conditions for the exponential stability of LTI systems driven by controllers derived from parametric optimization problems. Our primary focus is on parametric projection controllers, namely parametric programs whose objective function is the squared distance to a nominal controller. Leveraging the virtual system method of analysis and a novel contractivity result for Lur'e systems, we establish a sufficient LMI condition for the exponential stability of an LTI system with a parametric projection-based controller. Separately, we prove additional results for single-integrator systems. Finally, we apply our results to state-dependent saturated control systems and control barrier function-based control and provide numerical simulations. △ Less

Submitted 15 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

arXiv:2403.07572 [pdf, other]

On Weakly Contracting Dynamics for Convex Optimization

Authors: Veronica Centorrino, Alexander Davydov, Anand Gokhale, Giovanni Russo, Francesco Bullo

Abstract: We analyze the convergence behavior of \emph{globally weakly} and \emph{locally strongly contracting} dynamics. Such dynamics naturally arise in the context of convex optimization problems with a unique minimizer. We show that convergence to the equilibrium is \emph{linear-exponential}, in the sense that the distance between each solution and the equilibrium is upper bounded by a function that fir… ▽ More We analyze the convergence behavior of \emph{globally weakly} and \emph{locally strongly contracting} dynamics. Such dynamics naturally arise in the context of convex optimization problems with a unique minimizer. We show that convergence to the equilibrium is \emph{linear-exponential}, in the sense that the distance between each solution and the equilibrium is upper bounded by a function that first decreases linearly and then exponentially. As we show, the linear-exponential dependency arises naturally in certain dynamics with saturations. Additionally, we provide a sufficient condition for local input-to-state stability. Finally, we illustrate our results on, and propose a conjecture for, continuous-time dynamical systems solving linear programs. △ Less

Submitted 15 May, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

Comments: 16 pages, 4 Figures

arXiv:2402.08090 [pdf, other]

Learning Neural Contracting Dynamics: Extended Linearization and Global Guarantees

Authors: Sean Jaffe, Alexander Davydov, Deniz Lapsekili, Ambuj Singh, Francesco Bullo

Abstract: Global stability and robustness guarantees in learned dynamical systems are essential to ensure well-behavedness of the systems in the face of uncertainty. We present Extended Linearized Contracting Dynamics (ELCD), the first neural network-based dynamical system with global contractivity guarantees in arbitrary metrics. The key feature of ELCD is a parametrization of the extended linearization of… ▽ More Global stability and robustness guarantees in learned dynamical systems are essential to ensure well-behavedness of the systems in the face of uncertainty. We present Extended Linearized Contracting Dynamics (ELCD), the first neural network-based dynamical system with global contractivity guarantees in arbitrary metrics. The key feature of ELCD is a parametrization of the extended linearization of the nonlinear vector field. In its most basic form, ELCD is guaranteed to be (i) globally exponentially stable, (ii) equilibrium contracting, and (iii) globally contracting with respect to some metric. To allow for contraction with respect to more general metrics in the data space, we train diffeomorphisms between the data space and a latent space and enforce contractivity in the latent space, which ensures global contractivity in the data space. We demonstrate the performance of ELCD on the high dimensional LASA, multi-link pendulum, and Rosenbrock datasets. △ Less

Submitted 29 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: 9 pages, 3 figures. Under Review

arXiv:2312.07759 [pdf, ps, other]

IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means

Authors: Sean Jaffe, Ambuj K. Singh, Francesco Bullo

Abstract: Compressing large neural networks with minimal performance loss is crucial to enabling their deployment on edge devices. (Cho et al., 2022) proposed a weight quantization method that uses an attention-based clustering algorithm called differentiable $k$-means (DKM). Despite achieving state-of-the-art results, DKM's performance is constrained by its heavy memory dependency. We propose an implicit,… ▽ More Compressing large neural networks with minimal performance loss is crucial to enabling their deployment on edge devices. (Cho et al., 2022) proposed a weight quantization method that uses an attention-based clustering algorithm called differentiable $k$-means (DKM). Despite achieving state-of-the-art results, DKM's performance is constrained by its heavy memory dependency. We propose an implicit, differentiable $k$-means algorithm (IDKM), which eliminates the major memory restriction of DKM. Let $t$ be the number of $k$-means iterations, $m$ be the number of weight-vectors, and $b$ be the number of bits per cluster address. IDKM reduces the overall memory complexity of a single $k$-means layer from $\mathcal{O}(t \cdot m \cdot 2^b)$ to $\mathcal{O}( m \cdot 2^b)$. We also introduce a variant, IDKM with Jacobian-Free-Backpropagation (IDKM-JFB), for which the time complexity of the gradient calculation is independent of $t$ as well. We provide a proof of concept of our methods by showing that, under the same settings, IDKM achieves comparable performance to DKM with less compute time and less memory. We also use IDKM and IDKM-JFB to quantize a large neural network, Resnet18, on hardware where DKM cannot train at all. △ Less

Submitted 15 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

arXiv:2311.03821 [pdf, other]

Positive Competitive Networks for Sparse Reconstruction

Authors: Veronica Centorrino, Anand Gokhale, Alexander Davydov, Giovanni Russo, Francesco Bullo

Abstract: We propose and analyze a continuous-time firing-rate neural network, the positive firing-rate competitive network (\pfcn), to tackle sparse reconstruction problems with non-negativity constraints. These problems, which involve approximating a given input stimulus from a dictionary using a set of sparse (active) neurons, play a key role in a wide range of domains, including for example neuroscience… ▽ More We propose and analyze a continuous-time firing-rate neural network, the positive firing-rate competitive network (\pfcn), to tackle sparse reconstruction problems with non-negativity constraints. These problems, which involve approximating a given input stimulus from a dictionary using a set of sparse (active) neurons, play a key role in a wide range of domains, including for example neuroscience, signal processing, and machine learning. First, by leveraging the theory of proximal operators, we relate the equilibria of a family of continuous-time firing-rate neural networks to the optimal solutions of sparse reconstruction problems. Then, we prove that the \pfcn is a positive system and give rigorous conditions for the convergence to the equilibrium. Specifically, we show that the convergence: (i) only depends on a property of the dictionary; (ii) is linear-exponential, in the sense that initially the convergence rate is at worst linear and then, after a transient, it becomes exponential. We also prove a number of technical results to assess the contractivity properties of the neural dynamics of interest. Our analysis leverages contraction theory to characterize the behavior of a family of firing-rate competitive networks for sparse reconstruction with and without non-negativity constraints. Finally, we validate the effectiveness of our approach via a numerical example. △ Less

Submitted 22 March, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

Comments: 26 pages, 9 Figure, 1 Table

arXiv:2311.02389 [pdf, other]

Multiplayer Homicidal Chauffeur Reach-Avoid Games: A Pursuit Enclosure Function Approach

Authors: Rui Yan, Xiaoming Duan, Rui Zou, Xin He, Zongying Shi, Francesco Bullo

Abstract: This paper presents a multiplayer Homicidal Chauffeur reach-avoid differential game, which involves Dubins-car pursuers and simple-motion evaders. The goal of the pursuers is to cooperatively protect a planar convex region from the evaders, who strive to reach the region. We propose a cooperative strategy for the pursuers based on subgames for multiple pursuers against one evader and optimal task… ▽ More This paper presents a multiplayer Homicidal Chauffeur reach-avoid differential game, which involves Dubins-car pursuers and simple-motion evaders. The goal of the pursuers is to cooperatively protect a planar convex region from the evaders, who strive to reach the region. We propose a cooperative strategy for the pursuers based on subgames for multiple pursuers against one evader and optimal task allocation. We introduce pursuit enclosure functions (PEFs) and propose a new enclosure region pursuit (ERP) winning approach that supports forward analysis for the strategy synthesis in the subgames. We show that if a pursuit coalition is able to defend the region against an evader under the ERP winning, then no more than two pursuers in the coalition are necessarily needed. We also propose a steer-to-ERP approach to certify the ERP winning and synthesize the ERP winning strategy. To implement the strategy, we introduce a positional PEF and provide the necessary parameters, states, and strategies that ensure the ERP winning for both one pursuer and two pursuers against one evader. Additionally, we formulate a binary integer program using the subgame outcomes to maximize the captured evaders in the ERP winning for the pursuit task allocation. Finally, we propose a multiplayer receding-horizon strategy where the ERP winnings are checked in each horizon, the task is allocated, and the strategies of the pursuers are determined. Numerical examples are provided to illustrate the results. △ Less

Submitted 22 December, 2023; v1 submitted 4 November, 2023; originally announced November 2023.

Comments: 17 pages, 5 figures

arXiv:2310.07966 [pdf, ps, other]

Singular Perturbation via Contraction Theory

Authors: Liliaokeawawa Cothren, Francesco Bullo, Emiliano Dall'Anese

Abstract: In this paper, we provide a novel contraction-theoretic approach to analyze two-time scale systems. In our proposed framework, systems enjoy several robustness properties, which can lead to a more complete characterization of their behaviors. Key assumptions are the contractivity of the fast sub-system and of the reduced model, combined with an explicit upper bound on the time-scale parameter. For… ▽ More In this paper, we provide a novel contraction-theoretic approach to analyze two-time scale systems. In our proposed framework, systems enjoy several robustness properties, which can lead to a more complete characterization of their behaviors. Key assumptions are the contractivity of the fast sub-system and of the reduced model, combined with an explicit upper bound on the time-scale parameter. For two-time scale systems subject to disturbances, we show that the distance between solutions of the nominal system and solutions of its reduced model is uniformly upper bounded by a function of contraction rates, Lipschitz constants, the time-scale parameter, and the time variability of the disturbances. We also show local contractivity of the two-time scale system and give sufficient conditions for global contractivity. We then consider two special cases: for autonomous nonlinear systems we obtain sharper bounds than our general results and for linear time-invariant systems we present novel bounds based upon log norms and induced norms. Finally, we apply our theory to two application areas -- online feedback optimization and Stackelberg games -- and obtain new individual tracking error bounds showing that solutions converge to their (time-varying) optimizer and computing overall contraction rates. △ Less

Submitted 30 January, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: This paper has been submitted to IEEE Transactions on Automatic Control

arXiv:2309.08742 [pdf, other]

RoSSO: A High-Performance Python Package for Robotic Surveillance Strategy Optimization Using JAX

Authors: Yohan John, Connor Hughes, Gilberto Diaz-Garcia, Jason R. Marden, Francesco Bullo

Abstract: To enable the computation of effective randomized patrol routes for single- or multi-robot teams, we present RoSSO, a Python package designed for solving Markov chain optimization problems. We exploit machine-learning techniques such as reverse-mode automatic differentiation and constraint parametrization to achieve superior efficiency compared to general-purpose nonlinear programming solvers. Add… ▽ More To enable the computation of effective randomized patrol routes for single- or multi-robot teams, we present RoSSO, a Python package designed for solving Markov chain optimization problems. We exploit machine-learning techniques such as reverse-mode automatic differentiation and constraint parametrization to achieve superior efficiency compared to general-purpose nonlinear programming solvers. Additionally, we supplement a game-theoretic stochastic surveillance formulation in the literature with a novel greedy algorithm and multi-robot extension. We close with numerical results for a police district in downtown San Francisco that demonstrate RoSSO's capabilities on our new formulations and the prior work. △ Less

Submitted 15 September, 2023; originally announced September 2023.

Comments: 7 pages, 4 figures, 3 tables, submitted to the 2024 IEEE International Conference on Robotics and Automation. See https://github.com/conhugh/RoSSO for associated codebase

arXiv:2309.05873 [pdf, other]

Contractivity of Distributed Optimization and Nash Seeking Dynamics

Authors: Anand Gokhale, Alexander Davydov, Francesco Bullo

Abstract: In this letter, we study distributed optimization and Nash equilibrium-seeking dynamics from a contraction theoretic perspective. Our first result is a novel bound on the logarithmic norm of saddle matrices. Second, for distributed gradient flows based upon incidence and Laplacian constraints over arbitrary topologies, we establish strong contractivity over an appropriate invariant vector subspace… ▽ More In this letter, we study distributed optimization and Nash equilibrium-seeking dynamics from a contraction theoretic perspective. Our first result is a novel bound on the logarithmic norm of saddle matrices. Second, for distributed gradient flows based upon incidence and Laplacian constraints over arbitrary topologies, we establish strong contractivity over an appropriate invariant vector subspace. Third, we give sufficient conditions for strong contractivity in pseudogradient and best response games with complete information, show the equivalence of these conditions, and consider the special case of aggregative games. △ Less

Submitted 21 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

Comments: 7 pages, 1 figure, jointly submitted to the IEEE Control Systems Letters and the 2024 American Control Conference

arXiv:2308.14714 [pdf, other]

A Stochastic Surveillance Stackelberg Game: Co-Optimizing Defense Placement and Patrol Strategy

Authors: Yohan John, Gilberto Diaz-Garcia, Xiaoming Duan, Jason R. Marden, Francesco Bullo

Abstract: Stochastic patrol routing is known to be advantageous in adversarial settings; however, the optimal choice of stochastic routing strategy is dependent on a model of the adversary. We adopt a worst-case omniscient adversary model from the literature and extend the formulation to accommodate heterogeneous defenses at the various nodes of the graph. Introducing this heterogeneity leads to interesting… ▽ More Stochastic patrol routing is known to be advantageous in adversarial settings; however, the optimal choice of stochastic routing strategy is dependent on a model of the adversary. We adopt a worst-case omniscient adversary model from the literature and extend the formulation to accommodate heterogeneous defenses at the various nodes of the graph. Introducing this heterogeneity leads to interesting new patrol strategies. We identify efficient methods for computing these strategies in certain classes of graphs. We assess the effectiveness of these strategies via comparison to an upper bound on the value of the game. Finally, we leverage the heterogeneous defense formulation to develop novel defense placement algorithms that complement the patrol strategies. △ Less

Submitted 20 February, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: 9 pages, 1 figure, submitted as a technical note to the IEEE Transactions on Automatic Control. Replaced to fix inaccuracies

arXiv:2305.15595 [pdf, other]

Time-Varying Convex Optimization: A Contraction and Equilibrium Tracking Approach

Authors: Alexander Davydov, Veronica Centorrino, Anand Gokhale, Giovanni Russo, Francesco Bullo

Abstract: In this article, we provide a novel and broadly-applicable contraction-theoretic approach to continuous-time time-varying convex optimization. For any parameter-dependent contracting dynamics, we show that the tracking error is asymptotically proportional to the rate of change of the parameter with proportionality constant upper bounded by Lipschitz constant in which the parameter appears divided… ▽ More In this article, we provide a novel and broadly-applicable contraction-theoretic approach to continuous-time time-varying convex optimization. For any parameter-dependent contracting dynamics, we show that the tracking error is asymptotically proportional to the rate of change of the parameter with proportionality constant upper bounded by Lipschitz constant in which the parameter appears divided by the contraction rate of the dynamics squared. We additionally establish that any parameter-dependent contracting dynamics can be augmented with a feedforward prediction term to ensure that the tracking error converges to zero exponentially quickly. To apply these results to time-varying convex optimization problems, we establish the strong infinitesimal contractivity of dynamics solving three canonical problems, namely monotone inclusions, linear equality-constrained problems, and composite minimization problems. For each of these problems, we prove the sharpest-known rates of contraction and provide explicit tracking error bounds between solution trajectories and minimizing trajectories. We validate our theoretical results on three numerical examples including an application to control-barrier function based controller design. △ Less

Submitted 29 May, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

arXiv:2305.01417 [pdf, other]

Learning Robust Data-based LQG Controllers from Noisy Data

Authors: Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

Abstract: This paper addresses the joint state estimation and control problems for unknown linear time-invariant systems subject to both process and measurement noise. The aim is to redesign the linear quadratic Gaussian (LQG) controller based solely on data. The LQG controller comprises a linear quadratic regulator (LQR) and a steady-state Kalman observer; while the data-based LQR design problem has been p… ▽ More This paper addresses the joint state estimation and control problems for unknown linear time-invariant systems subject to both process and measurement noise. The aim is to redesign the linear quadratic Gaussian (LQG) controller based solely on data. The LQG controller comprises a linear quadratic regulator (LQR) and a steady-state Kalman observer; while the data-based LQR design problem has been previously studied, constructing the Kalman gain and the LQG controller from noisy data presents a novel challenge. In this work, a data-based formulation for computing the steady-state Kalman gain is proposed based on semi-definite programming (SDP) using some noise-free input-state-output data. Additionally, a data-based LQG controller is developed, which is shown to be equivalent to the model-based LQG controller. For cases where offline data are corrupted by noise, a robust data-based observer gain is constructed by tackling a relaxed SDP. The proposed controllers are proven to achieve robust global exponential stability (RGES) for state estimation and input-to-state practical stability (ISpS) under standard conditions. Finally, numerical tests are conducted to validate the proposed controllers' correctness and effectiveness. △ Less

Submitted 2 May, 2023; originally announced May 2023.

arXiv:2303.11273 [pdf, other]

Non-Euclidean Monotone Operator Theory and Applications

Authors: Alexander Davydov, Saber Jafarpour, Anton V. Proskurnikov, Francesco Bullo

Abstract: While monotone operator theory is traditionally studied on Hilbert spaces, many interesting problems in data science and machine learning arise naturally in finite-dimensional vector spaces endowed with non-Euclidean norms, such as diagonally-weighted $\ell_1$ or $\ell_\infty$ norms. This paper provides a natural generalization of monotone operator theory to finite-dimensional non-Euclidean spaces… ▽ More While monotone operator theory is traditionally studied on Hilbert spaces, many interesting problems in data science and machine learning arise naturally in finite-dimensional vector spaces endowed with non-Euclidean norms, such as diagonally-weighted $\ell_1$ or $\ell_\infty$ norms. This paper provides a natural generalization of monotone operator theory to finite-dimensional non-Euclidean spaces. The key tools are weak pairings and logarithmic norms. We show that the resolvent and reflected resolvent operators of non-Euclidean monotone mappings exhibit similar properties to their counterparts in Hilbert spaces. Furthermore, classical iterative methods and splitting methods for finding zeros of monotone operators are shown to converge in the non-Euclidean case. We apply our theory to equilibrium computation and Lipschitz constant estimation of recurrent neural networks, obtaining novel iterations and tighter upper bounds via forward-backward splitting. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: arXiv admin note: text overlap with arXiv:2204.01877

arXiv:2303.10127 [pdf, other]

doi 10.1109/LCSYS.2023.3275169

Semicontraction and Synchronization of Kuramoto-Sakaguchi Oscillator Networks

Authors: Robin Delabays, Francesco Bullo

Abstract: This paper studies the celebrated Kuramoto-Sakaguchi model of coupled oscillators adopting two recent concepts. First, we consider appropriately-defined subsets of the $n$-torus called winding cells. Second, we analyze the semicontractivity of the model, i.e., the property that the distance between trajectories decreases when measured according to a seminorm. This paper establishes the local sem… ▽ More This paper studies the celebrated Kuramoto-Sakaguchi model of coupled oscillators adopting two recent concepts. First, we consider appropriately-defined subsets of the $n$-torus called winding cells. Second, we analyze the semicontractivity of the model, i.e., the property that the distance between trajectories decreases when measured according to a seminorm. This paper establishes the local semicontractivity of the Kuramoto-Sakaguchi model, which is equivalent to the local contractivity for the reduced model. The reduced model is defined modulo the rotational symmetry. The domains where the system is semicontracting are convex phase-cohesive subsets of winding cells. Our sufficient conditions and estimates of the semicontracting domains are less conservative and more explicit than in previous works. Based on semicontraction on phase-cohesive subsets, we establish the "at most uniqueness" of synchronous states within these domains, thereby characterizing the multistability of this model. △ Less

Submitted 8 May, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

Comments: 6 pages, 3 figures

arXiv:2302.13452 [pdf, ps, other]

Euclidean Contractivity of Neural Networks with Symmetric Weights

Authors: Veronica Centorrino, Anand Gokhale, Alexander Davydov, Giovanni Russo, Francesco Bullo

Abstract: This paper investigates stability conditions of continuous-time Hopfield and firing-rate neural networks by leveraging contraction theory. First, we present a number of useful general algebraic results on matrix polytopes and products of symmetric matrices. Then, we give sufficient conditions for strong and weak Euclidean contractivity, i.e., contractivity with respect to the $\ell_2$ norm, of bot… ▽ More This paper investigates stability conditions of continuous-time Hopfield and firing-rate neural networks by leveraging contraction theory. First, we present a number of useful general algebraic results on matrix polytopes and products of symmetric matrices. Then, we give sufficient conditions for strong and weak Euclidean contractivity, i.e., contractivity with respect to the $\ell_2$ norm, of both models with symmetric weights and (possibly) non-smooth activation functions. Our contraction analysis leads to contraction rates which are log-optimal in almost all symmetric synaptic matrices. Finally, we use our results to propose a firing-rate neural network model to solve a quadratic optimization problem with box constraints. △ Less

Submitted 12 May, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

Comments: 17 pages, 2 figures

arXiv:2212.08808 [pdf, other]

Convergence, Consensus and Dissensus in the Weighted-Median Opinion Dynamics

Authors: Wenjun Mei, Julien M. Hendrickx, Ge Chen, Francesco Bullo, Florian Dörfler

Abstract: Mechanistic and tractable mathematical models play a key role in understanding how social influence shapes public opinions. Recently, a weighted-median mechanism has been proposed as a new micro-foundation of opinion dynamics and validated via experimental data. Numerical studies also indicate that this new mechanism recreates some non-trivial real-world features of opinion evolution. In this pape… ▽ More Mechanistic and tractable mathematical models play a key role in understanding how social influence shapes public opinions. Recently, a weighted-median mechanism has been proposed as a new micro-foundation of opinion dynamics and validated via experimental data. Numerical studies also indicate that this new mechanism recreates some non-trivial real-world features of opinion evolution. In this paper, we conduct a thorough theoretical analysis of the weighted-median opinion dynamics. We fully characterize the set of all equilibria, and we establish the almost-sure finite-time convergence for any initial condition. Moreover, we prove a necessary and sufficient graph-theoretic condition for the almost-sure convergence to consensus, as well as a sufficient graph-theoretic condition for almost-sure persistent dissensus. It turns out that the weighted-median opinion dynamics, despite its simplicity in form, exhibit rich dynamical behavior that depends on some delicate network structures. To complement our sufficient conditions for almost-sure dissensus, we further prove that, given the influence network, determining whether the system almost surely achieves persistent dissensus is NP-hard, which reflects the complexity the network topology contributes to opinion evolution. △ Less

Submitted 17 December, 2022; originally announced December 2022.

MSC Class: 93E03; 93A30; 91D30

arXiv:2209.10046 [pdf, other]

doi 10.1109/LCSYS.2022.3228723

Contractivity of the Method of Successive Approximations for Optimal Control

Authors: Kevin D. Smith, Francesco Bullo

Abstract: Strongly contracting dynamical systems have numerous properties (e.g., incremental ISS), find widespread applications (e.g., in controls and learning), and their study is receiving increasing attention. This work starts with the simple observation that, given a strongly contracting system, its adjoint dynamical system is also strongly contracting, with the same rate, with respect to the dual norm,… ▽ More Strongly contracting dynamical systems have numerous properties (e.g., incremental ISS), find widespread applications (e.g., in controls and learning), and their study is receiving increasing attention. This work starts with the simple observation that, given a strongly contracting system, its adjoint dynamical system is also strongly contracting, with the same rate, with respect to the dual norm, under time reversal. As main implication of this dual contractivity, we show that the classic Method of Successive Approximations (MSA), an indirect method in optimal control, is a contraction mapping for short optimization intervals or large contraction rates. Consequently, we establish new convergence conditions for the MSA algorithm, which further imply uniqueness of the optimal control and sufficiency of Pontryagin's minimum principle under additional assumptions. △ Less

Submitted 16 December, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

arXiv:2209.01046 [pdf, other]

Verifying $k$-Contraction without Computing $k$-Compounds

Authors: Omri Dalin, Ron Ofir, Eyal Bar Shalom, Alexander Ovseevich, Francesco Bullo, Michael Margaliot

Abstract: Compound matrices have found applications in many fields of science including systems and control theory. In particular, a sufficient condition for $k$-contraction is that a logarithmic norm (also called matrix measure) of the $k$-additive compound of the Jacobian is uniformly negative. However, this may be difficult to check in practice because the $k$-additive compound of an $n\times n$ matrix h… ▽ More Compound matrices have found applications in many fields of science including systems and control theory. In particular, a sufficient condition for $k$-contraction is that a logarithmic norm (also called matrix measure) of the $k$-additive compound of the Jacobian is uniformly negative. However, this may be difficult to check in practice because the $k$-additive compound of an $n\times n$ matrix has dimensions $\binom{n}{k}\times \binom{n}{k}$. For an $n\times n$ matrix $A$, we prove a duality relation between the $k$ and $(n-k)$ compounds of $A$. We use this duality relation to derive a sufficient condition for $k$-contraction that does not require the computation of any $k$-compounds. We demonstrate our results by deriving a sufficient condition for $k$-contraction of an $n$-dimensional Hopfield network that does not require to compute any compounds. In particular, for $k=2$ this sufficient condition implies that the network is $2$-contracting and this implies a strong asymptotic property: every bounded solution of the network converges to an equilibrium point, that may not be unique. This is relevant, for example, when using the Hopfield network as an associative memory that stores patterns as equilibrium points of the dynamics. △ Less

Submitted 2 September, 2022; originally announced September 2022.

arXiv:2208.03889 [pdf, other]

Robust Training and Verification of Implicit Neural Networks: A Non-Euclidean Contractive Approach

Authors: Saber Jafarpour, Alexander Davydov, Matthew Abate, Francesco Bullo, Samuel Coogan

Abstract: This paper proposes a theoretical and computational framework for training and robustness verification of implicit neural networks based upon non-Euclidean contraction theory. The basic idea is to cast the robustness analysis of a neural network as a reachability problem and use (i) the $\ell_{\infty}$-norm input-output Lipschitz constant and (ii) the tight inclusion function of the network to ove… ▽ More This paper proposes a theoretical and computational framework for training and robustness verification of implicit neural networks based upon non-Euclidean contraction theory. The basic idea is to cast the robustness analysis of a neural network as a reachability problem and use (i) the $\ell_{\infty}$-norm input-output Lipschitz constant and (ii) the tight inclusion function of the network to over-approximate its reachable sets. First, for a given implicit neural network, we use $\ell_{\infty}$-matrix measures to propose sufficient conditions for its well-posedness, design an iterative algorithm to compute its fixed points, and provide upper bounds for its $\ell_\infty$-norm input-output Lipschitz constant. Second, we introduce a related embedded network and show that the embedded network can be used to provide an $\ell_\infty$-norm box over-approximation of the reachable sets of the original network. Moreover, we use the embedded network to design an iterative algorithm for computing the upper bounds of the original system's tight inclusion function. Third, we use the upper bounds of the Lipschitz constants and the upper bounds of the tight inclusion functions to design two algorithms for the training and robustness verification of implicit neural networks. Finally, we apply our algorithms to train implicit neural networks on the MNIST dataset and compare the robustness of our models with the models trained via existing approaches in the literature. △ Less

Submitted 7 August, 2022; originally announced August 2022.

Comments: arXiv admin note: text overlap with arXiv:2112.05310

arXiv:2207.14579 [pdf, other]

doi 10.1137/22M1512600

The Yakubovich S-Lemma Revisited: Stability and Contractivity in Non-Euclidean Norms

Authors: Anton V. Proskurnikov, Alexander Davydov, Francesco Bullo

Abstract: The celebrated S-Lemma was originally proposed to ensure the existence of a quadratic Lyapunov function in the Lur'e problem of absolute stability. A quadratic Lyapunov function is, however, nothing else than a squared Euclidean norm on the state space (that is, a norm induced by an inner product). A natural question arises as to whether squared non-Euclidean norms $V(x)=\|x\|^2$ may serve as Lyap… ▽ More The celebrated S-Lemma was originally proposed to ensure the existence of a quadratic Lyapunov function in the Lur'e problem of absolute stability. A quadratic Lyapunov function is, however, nothing else than a squared Euclidean norm on the state space (that is, a norm induced by an inner product). A natural question arises as to whether squared non-Euclidean norms $V(x)=\|x\|^2$ may serve as Lyapunov functions in stability problems. This paper presents a novel non-polynomial S-Lemma that leads to constructive criteria for the existence of such functions defined by weighted $\ell_p$ norms. Our generalized S-Lemma leads to new absolute stability and absolute contractivity criteria for Lur'e-type systems, including, for example, a new simple proof of the Aizerman and Kalman conjectures for positive Lur'e systems. △ Less

Submitted 2 February, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

MSC Class: 34H05; 93C15

arXiv:2207.08596 [pdf, other]

Data-driven Self-triggered Control via Trajectory Prediction

Authors: Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

Abstract: Self-triggered control, a well-documented technique for reducing the communication overhead while ensuring desired system performance, is gaining increasing popularity. However, existing methods for self-triggered control require explicit system models that are assumed perfectly known a priori. An end-to-end control paradigm known as data-driven control learns control laws directly from data, and… ▽ More Self-triggered control, a well-documented technique for reducing the communication overhead while ensuring desired system performance, is gaining increasing popularity. However, existing methods for self-triggered control require explicit system models that are assumed perfectly known a priori. An end-to-end control paradigm known as data-driven control learns control laws directly from data, and offers a competing alternative to the routine system identification-then-control method. In this context, the present paper puts forth data-driven self-triggered control schemes for unknown linear systems using data collected offline. Specifically, for output feedback control systems, a data-driven model predictive control (MPC) scheme is proposed, which computes a sequence of control inputs while generating a predicted system trajectory. A data-driven self-triggering law is designed using the predicted trajectory, to determine the next triggering time once a new measurement becomes available. For state feedback control systems, instead of capitalizing on MPC to predict the trajectory, a data-fitting problem using the pre-collected input-state data is solved, whose solution is employed to construct the self-triggering mechanism. Both feasibility and stability are established for the proposed self-triggered controllers, which are validated using numerical examples. △ Less

Submitted 18 July, 2022; originally announced July 2022.

arXiv:2204.13610 [pdf, other]

How social influence affects the wisdom of crowds in influence networks

Authors: Ye Tian, Long Wang, Francesco Bullo

Abstract: A long-standing debate is whether social influence improves the collective wisdom of a crowd or undermines it. This paper addresses this question based on a naive learning setting in influence systems theory: in our models individuals evolve their estimates of an unknown truth according to the weighted-average opinion dynamics. A formal mathematization is provided with rigorous theoretical analysi… ▽ More A long-standing debate is whether social influence improves the collective wisdom of a crowd or undermines it. This paper addresses this question based on a naive learning setting in influence systems theory: in our models individuals evolve their estimates of an unknown truth according to the weighted-average opinion dynamics. A formal mathematization is provided with rigorous theoretical analysis. We obtain various conditions for improving, optimizing and undermining the crowd accuracy, respectively. We prove that if the wisdom of finite-size group is improved, then the collective estimate converges to the truth as group size increases, provided individuals' variances are finite. We show that whether social influence improves or undermines the wisdom is determined by the social power allocation of the influence system: if the influence system allocates relatively larger social power to relatively more accurate individuals, it improves the wisdom; on the contrary, if the influence system assigns less social power to more accurate individuals, it undermines the wisdom. At a population level, individuals' susceptibilities to interpersonal influence and network centralities are both crucial. To improve the wisdom, more accurate individuals should be less susceptible and have larger network centralities. Particularly, in democratic influence networks, if relatively more accurate individuals are relatively less susceptible, the wisdom is improved; if more accurate individuals are more susceptible, the wisdom is undermined, which is consistent with the reported empirical evidence. Our investigation provides a theoretical framework for understanding the role social influence plays in the emergence of collective wisdom. △ Less

Submitted 4 March, 2023; v1 submitted 28 April, 2022; originally announced April 2022.

arXiv:2204.05382 [pdf, other]

Modeling and Contractivity of Neural-Synaptic Networks with Hebbian Learning

Authors: Veronica Centorrino, Francesco Bullo, Giovanni Russo

Abstract: This paper is concerned with the modeling and analysis of two of the most commonly used recurrent neural network models (i.e., Hopfield neural network and firing-rate neural network) with dynamic recurrent connections undergoing Hebbian learning rules. To capture the synaptic sparsity of neural circuits we propose a low dimensional formulation. We then characterize certain key dynamical properties… ▽ More This paper is concerned with the modeling and analysis of two of the most commonly used recurrent neural network models (i.e., Hopfield neural network and firing-rate neural network) with dynamic recurrent connections undergoing Hebbian learning rules. To capture the synaptic sparsity of neural circuits we propose a low dimensional formulation. We then characterize certain key dynamical properties. First, we give biologically-inspired forward invariance results. Then, we give sufficient conditions for the non-Euclidean contractivity of the models. Our contraction analysis leads to stability and robustness of time-varying trajectories -- for networks with both excitatory and inhibitory synapses governed by both Hebbian and anti-Hebbian rules. For each model, we propose a contractivity test based upon biologically meaningful quantities, e.g., neural and synaptic decay rate, maximum in-degree, and the maximum synaptic strength. Then, we show that the models satisfy Dale's Principle. Finally, we illustrate the effectiveness of our results via a numerical example. △ Less

Submitted 22 March, 2024; v1 submitted 11 April, 2022; originally announced April 2022.

Comments: 24 pages, 4 figures

arXiv:2204.01877 [pdf, other]

doi 10.1109/CDC51059.2022.9993197

Non-Euclidean Monotone Operator Theory with Applications to Recurrent Neural Networks

Authors: Alexander Davydov, Saber Jafarpour, Anton V. Proskurnikov, Francesco Bullo

Abstract: We provide a novel transcription of monotone operator theory to the non-Euclidean finite-dimensional spaces $\ell_1$ and $\ell_{\infty}$. We first establish properties of mappings which are monotone with respect to the non-Euclidean norms $\ell_1$ or $\ell_{\infty}$. In analogy with their Euclidean counterparts, mappings which are monotone with respect to a non-Euclidean norm are amenable to numer… ▽ More We provide a novel transcription of monotone operator theory to the non-Euclidean finite-dimensional spaces $\ell_1$ and $\ell_{\infty}$. We first establish properties of mappings which are monotone with respect to the non-Euclidean norms $\ell_1$ or $\ell_{\infty}$. In analogy with their Euclidean counterparts, mappings which are monotone with respect to a non-Euclidean norm are amenable to numerous algorithms for computing their zeros. We demonstrate that several classic iterative methods for computing zeros of monotone operators are directly applicable in the non-Euclidean framework. We present a case-study in the equilibrium computation of recurrent neural networks and demonstrate that casting the computation as a suitable operator splitting problem improves convergence rates. △ Less

Submitted 18 July, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

arXiv:2204.00187 [pdf, other]

Comparative Analysis of Interval Reachability for Robust Implicit and Feedforward Neural Networks

Authors: Alexander Davydov, Saber Jafarpour, Matthew Abate, Francesco Bullo, Samuel Coogan

Abstract: We use interval reachability analysis to obtain robustness guarantees for implicit neural networks (INNs). INNs are a class of implicit learning models that use implicit equations as layers and have been shown to exhibit several notable benefits over traditional deep neural networks. We first establish that tight inclusion functions of neural networks, which provide the tightest rectangular over-a… ▽ More We use interval reachability analysis to obtain robustness guarantees for implicit neural networks (INNs). INNs are a class of implicit learning models that use implicit equations as layers and have been shown to exhibit several notable benefits over traditional deep neural networks. We first establish that tight inclusion functions of neural networks, which provide the tightest rectangular over-approximation of an input-output map, lead to sharper robustness guarantees than the well-studied robustness measures of local Lipschitz constants. Like Lipschitz constants, tight inclusions functions are computationally challenging to obtain, and we thus propose using mixed monotonicity and contraction theory to obtain computationally efficient estimates of tight inclusion functions for INNs. We show that our approach performs at least as well as, and generally better than, applying state-of-the-art interval bound propagation methods to INNs. We design a novel optimization problem for training robust INNs and we provide empirical evidence that suitably-trained INNs can be more robust than comparably-trained feedforward networks. △ Less

Submitted 31 March, 2022; originally announced April 2022.

arXiv:2203.10392 [pdf, other]

Minimum effort decentralized control design for contracting network systems

Authors: Ron Ofir, Francesco Bullo, Michael Margaliot

Abstract: We consider the problem of making a networked system contracting by designing minimal effort local controllers. Our method combines a hierarchical contraction characterization and a matrix-balancing approach to stabilizing a Metzler matrix via minimal diagonal perturbations. We demonstrate our approach by designing local controllers that render contractive a network of FitzHugh-Nagumo neurons with… ▽ More We consider the problem of making a networked system contracting by designing minimal effort local controllers. Our method combines a hierarchical contraction characterization and a matrix-balancing approach to stabilizing a Metzler matrix via minimal diagonal perturbations. We demonstrate our approach by designing local controllers that render contractive a network of FitzHugh-Nagumo neurons with a general topology of interactions. △ Less

Submitted 19 March, 2022; originally announced March 2022.

arXiv:2202.02439 [pdf, other]

doi 10.1038/s41467-022-32931-8

Multistability and anomalies in oscillator models of lossy power grids

Authors: Robin Delabays, Saber Jafarpour, Francesco Bullo

Abstract: The analysis of dissipatively coupled oscillators is challenging and highly relevant in power grids. Standard mathematical methods are not applicable, due to the lack of network symmetry induced by dissipative couplings. Here we demonstrate a close correspondence between stable synchronous states in dissipatively coupled oscillators, and the {winding partition} of their state space, a geometric no… ▽ More The analysis of dissipatively coupled oscillators is challenging and highly relevant in power grids. Standard mathematical methods are not applicable, due to the lack of network symmetry induced by dissipative couplings. Here we demonstrate a close correspondence between stable synchronous states in dissipatively coupled oscillators, and the {winding partition} of their state space, a geometric notion induced by the network topology. Leveraging this winding partition, we accompany this article with an algorithms to compute all synchronous solutions of complex networks of dissipatively coupled oscillators. These geometric and computational tools allow us to identify anomalous behaviors of lossy networked systems. Counterintuitively, we show that loop flows and dissipation can increase the system's transfer capacity, and that dissipation can promote multistability. We apply our geometric framework to compute power flows on the IEEE RTS-96 test system, where we identify two high voltage solutions with distinct loop flows. △ Less

Submitted 6 September, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

Comments: Main text: 14 pages, 5 figures. Supplementary information: 6 pages, 2 figures

Journal ref: Nat. Commun. 13, 5238 (2022)

arXiv:2201.03310 [pdf, other]

Minimax Flow over Acyclic Networks: Distributed Algorithms and Microgrid Application

Authors: Marco Coraggio, Saber Jafarpour, Francesco Bullo, Mario di Bernardo

Abstract: Given a flow network with variable suppliers and fixed consumers, the minimax flow problem consists in minimizing the maximum flow between nodes, subject to flow conservation and capacity constraints. We solve this problem over acyclic graphs in a distributed manner by showing that it can be recast as a consensus problem between the maximum downstream flows, which we define here for the first time… ▽ More Given a flow network with variable suppliers and fixed consumers, the minimax flow problem consists in minimizing the maximum flow between nodes, subject to flow conservation and capacity constraints. We solve this problem over acyclic graphs in a distributed manner by showing that it can be recast as a consensus problem between the maximum downstream flows, which we define here for the first time. Additionally, we present a distributed algorithm to estimate these quantities. Finally, exploiting our theoretical results, we design an online distributed controller to prevent overcurrent in microgrids consisting of loads and droop-controlled inverters. Our results are validated numerically on the CIGRE benchmark microgrid. △ Less

Submitted 9 July, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

arXiv:2201.03103 [pdf, other]

Dual Seminorms, Ergodic Coefficients and Semicontraction Theory

Authors: Giulia De Pasquale, Kevin D. Smith, Francesco Bullo, Maria Elena Valcher

Abstract: Dynamical systems that are contracting on a subspace are said to be semicontracting. Semicontraction theory is a useful tool in the study of consensus algorithms and dynamical flow systems such as Markov chains. To develop a comprehensive theory of semicontracting systems, we investigate seminorms on vector spaces and define two canonical notions: projection and distance semi-norms. We show that t… ▽ More Dynamical systems that are contracting on a subspace are said to be semicontracting. Semicontraction theory is a useful tool in the study of consensus algorithms and dynamical flow systems such as Markov chains. To develop a comprehensive theory of semicontracting systems, we investigate seminorms on vector spaces and define two canonical notions: projection and distance semi-norms. We show that the well-known lp ergodic coefficients are induced matrix seminorms and play a central role in stability problems. In particular, we formulate a duality theorem that explains why the Markov-Dobrushin coefficient is the rate of contraction for both averaging and conservation flows in discrete time. Moreover, we obtain parallel results for induced matrix log seminorms. Finally, we propose comprehensive theorems for strong semicontractivity of linear and non-linear time-varying dynamical systems with invariance and conservation properties both in discrete and continuous time. △ Less

Submitted 20 December, 2022; v1 submitted 9 January, 2022; originally announced January 2022.

arXiv:2201.02759 [pdf, other]

Modeling Human-AI Team Decision Making

Authors: Wei Ye, Francesco Bullo, Noah Friedkin, Ambuj K Singh

Abstract: AI and humans bring complementary skills to group deliberations. Modeling this group decision making is especially challenging when the deliberations include an element of risk and an exploration-exploitation process of appraising the capabilities of the human and AI agents. To investigate this question, we presented a sequence of intellective issues to a set of human groups aided by imperfect AI… ▽ More AI and humans bring complementary skills to group deliberations. Modeling this group decision making is especially challenging when the deliberations include an element of risk and an exploration-exploitation process of appraising the capabilities of the human and AI agents. To investigate this question, we presented a sequence of intellective issues to a set of human groups aided by imperfect AI agents. A group's goal was to appraise the relative expertise of the group's members and its available AI agents, evaluate the risks associated with different actions, and maximize the overall reward by reaching consensus. We propose and empirically validate models of human-AI team decision making under such uncertain circumstances, and show the value of socio-cognitive constructs of prospect theory, influence dynamics, and Bayesian learning in predicting the behavior of human-AI groups. △ Less

Submitted 7 January, 2022; originally announced January 2022.

Comments: 12 pages, preprint

arXiv:2112.05310 [pdf, other]

Robustness Certificates for Implicit Neural Networks: A Mixed Monotone Contractive Approach

Authors: Saber Jafarpour, Matthew Abate, Alexander Davydov, Francesco Bullo, Samuel Coogan

Abstract: Implicit neural networks are a general class of learning models that replace the layers in traditional feedforward models with implicit algebraic equations. Compared to traditional learning models, implicit networks offer competitive performance and reduced memory consumption. However, they can remain brittle with respect to input adversarial perturbations. This paper proposes a theoretical and… ▽ More Implicit neural networks are a general class of learning models that replace the layers in traditional feedforward models with implicit algebraic equations. Compared to traditional learning models, implicit networks offer competitive performance and reduced memory consumption. However, they can remain brittle with respect to input adversarial perturbations. This paper proposes a theoretical and computational framework for robustness verification of implicit neural networks; our framework blends together mixed monotone systems theory and contraction theory. First, given an implicit neural network, we introduce a related embedded network and show that, given an $\ell_\infty$-norm box constraint on the input, the embedded network provides an $\ell_\infty$-norm box overapproximation for the output of the given network. Second, using $\ell_{\infty}$-matrix measures, we propose sufficient conditions for well-posedness of both the original and embedded system and design an iterative algorithm to compute the $\ell_{\infty}$-norm box robustness margins for reachability and classification problems. Third, of independent value, we propose a novel relative classifier variable that leads to tighter bounds on the certified adversarial robustness in classification problems. Finally, we perform numerical simulations on a Non-Euclidean Monotone Operator Network (NEMON) trained on the MNIST dataset. In these simulations, we compare the accuracy and run time of our mixed monotone contractive approach with the existing robustness verification approaches in the literature for estimating the certified adversarial robustness. △ Less

Submitted 9 December, 2021; originally announced December 2021.

arXiv:2110.12766 [pdf, other]

Data-Driven Resilient Predictive Control under Denial-of-Service

Authors: Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

Abstract: The study of resilient control of linear time-invariant (LTI) systems against denial-of-service (DoS) attacks is gaining popularity in emerging cyber-physical applications. In previous works, explicit system models are required to design a predictor-based resilient controller. These models can be either given a priori or obtained through a prior system identification step. Recent research efforts… ▽ More The study of resilient control of linear time-invariant (LTI) systems against denial-of-service (DoS) attacks is gaining popularity in emerging cyber-physical applications. In previous works, explicit system models are required to design a predictor-based resilient controller. These models can be either given a priori or obtained through a prior system identification step. Recent research efforts have focused on data-driven control based on pre-collected input-output trajectories (i.e., without explicit system models). In this paper, we take an initial step toward data-driven stabilization of stochastic LTI systems under DoS attacks, and develop a resilient model predictive control (MPC) scheme driven purely by data-dependent conditions. The proposed data-driven control method achieves the same level of resilience as the model-based control method. For example, local input-to-state stability (ISS) is achieved under mild assumptions on the noise and the DoS attacks. To recover global ISS, two modifications are further suggested at the price of reduced resilience against DoS attacks or increased computational complexity. Finally, a numerical example is given to validate the effectiveness of the proposed control method. △ Less

Submitted 25 October, 2021; originally announced October 2021.

arXiv:2110.08298 [pdf, other]

Non-Euclidean Contraction Analysis of Continuous-Time Neural Networks

Authors: Alexander Davydov, Anton V. Proskurnikov, Francesco Bullo

Abstract: Critical questions in dynamical neuroscience and machine learning are related to the study of continuous-time neural networks and their stability, robustness, and computational efficiency. These properties can be simultaneously established via a contraction analysis. This paper develops a comprehensive non-Euclidean contraction theory for continuous-time neural networks. Specifically, we provide n… ▽ More Critical questions in dynamical neuroscience and machine learning are related to the study of continuous-time neural networks and their stability, robustness, and computational efficiency. These properties can be simultaneously established via a contraction analysis. This paper develops a comprehensive non-Euclidean contraction theory for continuous-time neural networks. Specifically, we provide novel sufficient conditions for the contractivity of general classes of continuous-time neural networks including Hopfield, firing rate, Persidskii, Lur'e, and other neural networks with respect to the non-Euclidean $\ell_1/\ell_\infty$ norms. These sufficient conditions are based upon linear programming or, in some special cases, establishing the Hurwitzness of a particular Metzler matrix. To prove these sufficient conditions, we develop novel results on non-Euclidean logarithmic norms and a novel necessary and sufficient condition for contractivity of systems with locally Lipschitz dynamics. For each model, we apply our theoretical results to compute the optimal contraction rate and corresponding weighted non-Euclidean norm with respect to which the neural network is contracting. △ Less

Submitted 29 June, 2024; v1 submitted 15 October, 2021; originally announced October 2021.

arXiv:2110.03623 [pdf, other]

doi 10.1109/CDC45484.2021.9682883

From Contraction Theory to Fixed Point Algorithms on Riemannian and Non-Euclidean Spaces

Authors: Francesco Bullo, Pedro Cisneros-Velarde, Alexander Davydov, Saber Jafarpour

Abstract: The design of fixed point algorithms is at the heart of monotone operator theory, convex analysis, and of many modern optimization problems arising in machine learning and control. This tutorial reviews recent advances in understanding the relationship between Demidovich conditions, one-sided Lipschitz conditions, and contractivity theorems. We review the standard contraction theory on Euclidean s… ▽ More The design of fixed point algorithms is at the heart of monotone operator theory, convex analysis, and of many modern optimization problems arising in machine learning and control. This tutorial reviews recent advances in understanding the relationship between Demidovich conditions, one-sided Lipschitz conditions, and contractivity theorems. We review the standard contraction theory on Euclidean spaces as well as little-known results for Riemannian manifolds. Special emphasis is placed on the setting of non-Euclidean norms and the recently introduced weak pairings for the $\ell_1$ and $\ell_\infty$ norms. We highlight recent results on explicit and implicit fixed point schemes for non-Euclidean contracting systems. △ Less

Submitted 7 October, 2021; originally announced October 2021.

Comments: Paper in the invited tutorial session "Contraction Theory for Machine Learning" at 60th IEEE Conference on Decision and Control, 2021

arXiv:2109.07643 [pdf, other]

Convex Optimization of the Basic Reproduction Number

Authors: Kevin D. Smith, Francesco Bullo

Abstract: The basic reproduction number $R_0$ is a fundamental quantity in epidemiological modeling, reflecting the typical number of secondary infections that arise from a single infected individual. While $R_0$ is widely known to scientists, policymakers, and the general public, it has received comparatively little attention in the controls community. This note provides two novel characterizations of… ▽ More The basic reproduction number $R_0$ is a fundamental quantity in epidemiological modeling, reflecting the typical number of secondary infections that arise from a single infected individual. While $R_0$ is widely known to scientists, policymakers, and the general public, it has received comparatively little attention in the controls community. This note provides two novel characterizations of $R_0$: a stability characterization and a geometric program characterization. The geometric program characterization allows us to write $R_0$-constrained and budget-constrained optimal resource allocation problems as geometric programs, which are easily transformed into convex optimization problems. We apply these programs to allocating vaccines and antidotes in numerical examples, finding that targeting $R_0$ instead of the spectral abscissa of the Jacobian matrix (a common target in the controls literature) leads to qualitatively different solutions. △ Less

Submitted 2 September, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

arXiv:2106.03194 [pdf, other]

Robust Implicit Networks via Non-Euclidean Contractions

Authors: Saber Jafarpour, Alexander Davydov, Anton V. Proskurnikov, Francesco Bullo

Abstract: Implicit neural networks, a.k.a., deep equilibrium networks, are a class of implicit-depth learning models where function evaluation is performed by solving a fixed point equation. They generalize classic feedforward models and are equivalent to infinite-depth weight-tied feedforward networks. While implicit models show improved accuracy and significant reduction in memory consumption, they can su… ▽ More Implicit neural networks, a.k.a., deep equilibrium networks, are a class of implicit-depth learning models where function evaluation is performed by solving a fixed point equation. They generalize classic feedforward models and are equivalent to infinite-depth weight-tied feedforward networks. While implicit models show improved accuracy and significant reduction in memory consumption, they can suffer from ill-posedness and convergence instability. This paper provides a new framework, which we call Non-Euclidean Monotone Operator Network (NEMON), to design well-posed and robust implicit neural networks based upon contraction theory for the non-Euclidean norm $\ell_{\infty}$. Our framework includes (i) a novel condition for well-posedness based on one-sided Lipschitz constants, (ii) an average iteration for computing fixed-points, and (iii) explicit estimates on input-output Lipschitz constants. Additionally, we design a training problem with the well-posedness condition and the average iteration as constraints and, to achieve robust models, with the input-output Lipschitz constant as a regularizer. Our $\ell_{\infty}$ well-posedness condition leads to a larger polytopic training search space than existing conditions and our average iteration enjoys accelerated convergence. Finally, we evaluate our framework in image classification through the MNIST and the CIFAR-10 datasets. Our numerical results demonstrate improved accuracy and robustness of the implicit models with smaller input-output Lipschitz bounds. Code is available at https://github.com/davydovalexander/Non-Euclidean_Mon_Op_Net. △ Less

Submitted 25 January, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

arXiv:2105.08832 [pdf, ps, other]

A Contraction Theory Approach to Optimization Algorithms from Acceleration Flows

Authors: Pedro Cisneros-Velarde, Francesco Bullo

Abstract: Much recent interest has focused on the design of optimization algorithms from the discretization of an associated optimization flow, i.e., a system of differential equations (ODEs) whose trajectories solve an associated optimization problem. Such a design approach poses an important problem: how to find a principled methodology to design and discretize appropriate ODEs. This paper aims to provide… ▽ More Much recent interest has focused on the design of optimization algorithms from the discretization of an associated optimization flow, i.e., a system of differential equations (ODEs) whose trajectories solve an associated optimization problem. Such a design approach poses an important problem: how to find a principled methodology to design and discretize appropriate ODEs. This paper aims to provide a solution to this problem through the use of contraction theory. We first introduce general mathematical results that explain how contraction theory guarantees the stability of the implicit and explicit Euler integration methods. Then, we propose a novel system of ODEs, namely the Accelerated-Contracting-Nesterov flow, and use contraction theory to establish it is an optimization flow with exponential convergence rate, from which the linear convergence rate of its associated optimization algorithm is immediately established. Remarkably, a simple explicit Euler discretization of this flow corresponds to the Nesterov acceleration method. Finally, we present how our approach leads to performance guarantees in the design of optimization algorithms for time-varying optimization problems. △ Less

Submitted 31 January, 2022; v1 submitted 18 May, 2021; originally announced May 2021.

MSC Class: 34D05; 34D23; 34H15; 65L20; 65K05

arXiv:2104.01321 [pdf, other]

Non-Euclidean Contraction Theory for Monotone and Positive Systems

Authors: Saber Jafarpour, Alexander Davydov, Francesco Bullo

Abstract: In this note we study contractivity of monotone systems and exponential convergence of positive systems using non-Euclidean norms. We first introduce the notion of conic matrix measure as a framework to study stability of monotone and positive systems. We study properties of the conic matrix measures and investigate their connection with weak pairings and standard matrix measures. Using conic matr… ▽ More In this note we study contractivity of monotone systems and exponential convergence of positive systems using non-Euclidean norms. We first introduce the notion of conic matrix measure as a framework to study stability of monotone and positive systems. We study properties of the conic matrix measures and investigate their connection with weak pairings and standard matrix measures. Using conic matrix measures and weak pairings, we characterize contractivity and incremental stability of monotone systems with respect to non-Euclidean norms. Moreover, we use conic matrix measures to provide sufficient conditions for exponential convergence of positive systems to their equilibria. We show that our framework leads to novel results on (i) the contractivity of excitatory Hopfield neural networks, and (ii) the stability of interconnected systems using non-monotone positive comparison systems. △ Less

Submitted 21 August, 2022; v1 submitted 3 April, 2021; originally announced April 2021.

arXiv:2103.12263 [pdf, other]

doi 10.1109/TAC.2022.3183966

Non-Euclidean Contraction Theory for Robust Nonlinear Stability

Authors: Alexander Davydov, Saber Jafarpour, Francesco Bullo

Abstract: We study necessary and sufficient conditions for contraction and incremental stability of dynamical systems with respect to non-Euclidean norms. First, we introduce weak pairings as a framework to study contractivity with respect to arbitrary norms, and characterize their properties. We introduce and study the sign and max pairings for the $\ell_1$ and $\ell_\infty$ norms, respectively. Using weak… ▽ More We study necessary and sufficient conditions for contraction and incremental stability of dynamical systems with respect to non-Euclidean norms. First, we introduce weak pairings as a framework to study contractivity with respect to arbitrary norms, and characterize their properties. We introduce and study the sign and max pairings for the $\ell_1$ and $\ell_\infty$ norms, respectively. Using weak pairings, we establish five equivalent characterizations for contraction, including the one-sided Lipschitz condition for the vector field as well as matrix measure and Demidovich conditions for the corresponding Jacobian. Third, we extend our contraction framework in two directions: we prove equivalences for contraction of continuous vector fields and we formalize the weaker notion of equilibrium contraction, which ensures exponential convergence to an equilibrium. Finally, as an application, we provide (i) incremental input-to-state stability and finite input-state gain properties for contracting systems, and (ii) a general theorem about the Lipschitz interconnection of contracting systems, whereby the Hurwitzness of a gain matrix implies the contractivity of the interconnected system. △ Less

Submitted 1 August, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

arXiv:2103.11862 [pdf, other]

doi 10.1109/TAC.2021.3107145

Resilient Control under Quantization and Denial-of-Service: Co-designing a Deadbeat Controller and Transmission Protocol

Authors: Wenjie Liu, Jian Sun, Gang Wang, Francesco Bullo, Jie Chen

Abstract: This paper is concerned with the problem of stabilizing continuous-time linear time-invariant systems subject to quantization and Denial-of-Service (DoS) attacks. In this context, two DoS-induced challenges emerge with the design of resilient encoding schemes, namely, the coupling between encoding strategies of different signals, and the synchronization between the encoder and decoder. To address… ▽ More This paper is concerned with the problem of stabilizing continuous-time linear time-invariant systems subject to quantization and Denial-of-Service (DoS) attacks. In this context, two DoS-induced challenges emerge with the design of resilient encoding schemes, namely, the coupling between encoding strategies of different signals, and the synchronization between the encoder and decoder. To address these challenges, a novel structure that is equipped with a deadbeat controller as well as a delicate transmission protocol for the input and output channels, co-designed leveraging the controllability index, is put forward. When both input and output channels are subject to DoS attacks and quantization, the proposed structure is shown able to decouple the encoding schemes for input, output, and estimated output signals. This property is further corroborated by designing encoding schemes as well as conditions that ensure exponential stability of the closed-loop system. On the other hand, when only the output channel is subject to network phenomenon, the proposed structure can achieve exponential stabilization without acknowledgment (ACK) signals, in contrast to existing ACK-based results. Finally, a numerical example is given to demonstrate the practical merits of the proposed approach as well as the theory. △ Less

Submitted 22 March, 2021; originally announced March 2021.

Journal ref: IEEE Transactions on Automatic Control, 2021

arXiv:2012.08610 [pdf, other]

Distributed Wasserstein Barycenters via Displacement Interpolation

Authors: Pedro Cisneros-Velarde, Francesco Bullo

Abstract: Consider a multi-agent system whereby each agent has an initial probability measure. In this paper, we propose a distributed algorithm based upon stochastic, asynchronous and pairwise exchange of information and displacement interpolation in the Wasserstein space. We characterize the evolution of this algorithm and prove it computes the Wasserstein barycenter of the initial measures under various… ▽ More Consider a multi-agent system whereby each agent has an initial probability measure. In this paper, we propose a distributed algorithm based upon stochastic, asynchronous and pairwise exchange of information and displacement interpolation in the Wasserstein space. We characterize the evolution of this algorithm and prove it computes the Wasserstein barycenter of the initial measures under various conditions. One version of the algorithm computes a standard Wasserstein barycenter, i.e., a barycenter based upon equal weights; and the other version computes a randomized Wasserstein barycenter, i.e., a barycenter based upon random weights for the initial measures. Finally, we specialize our algorithm to Gaussian distributions and draw a connection with the modeling of opinion dynamics in mathematical sociology. △ Less

Submitted 25 February, 2022; v1 submitted 15 December, 2020; originally announced December 2020.

Comments: 25 pages, 4 figures

MSC Class: 60J20 (Primary); 49N99; 46N10 (Secondary)

arXiv:2011.07604 [pdf, other]

Stochastic Strategies for Robotic Surveillance as Stackelberg Games

Authors: Xiaoming Duan, Dario Paccagnan, Francesco Bullo

Abstract: This paper studies a stochastic robotic surveillance problem where a mobile robot moves randomly on a graph to capture a potential intruder that strategically attacks a location on the graph. The intruder is assumed to be omniscient: it knows the current location of the mobile agent and can learn the surveillance strategy. The goal for the mobile robot is to design a stochastic strategy so as to m… ▽ More This paper studies a stochastic robotic surveillance problem where a mobile robot moves randomly on a graph to capture a potential intruder that strategically attacks a location on the graph. The intruder is assumed to be omniscient: it knows the current location of the mobile agent and can learn the surveillance strategy. The goal for the mobile robot is to design a stochastic strategy so as to maximize the probability of capturing the intruder. We model the strategic interactions between the surveillance robot and the intruder as a Stackelberg game, and optimal and suboptimal Markov chain based surveillance strategies in star, complete and line graphs are studied. We first derive a universal upper bound on the capture probability, i.e., the performance limit for the surveillance agent. We show that this upper bound is tight in the complete graph and further provide suboptimality guarantees for a natural design. For the star and line graphs, we first characterize dominant strategies for the surveillance agent and the intruder. Then, we rigorously prove the optimal strategy for the surveillance agent. △ Less

Submitted 15 November, 2020; originally announced November 2020.

arXiv:2011.07168 [pdf, other]

Expertise and confidence explain how social influence evolves along intellective tasks

Authors: Omid Askarisichani, Elizabeth Y. Huang, Kekoa S. Sato, Noah E. Friedkin, Francesco Bullo, Ambuj K. Singh

Abstract: Discovering the antecedents of individuals' influence in collaborative environments is an important, practical, and challenging problem. In this paper, we study interpersonal influence in small groups of individuals who collectively execute a sequence of intellective tasks. We observe that along an issue sequence with feedback, individuals with higher expertise and social confidence are accorded h… ▽ More Discovering the antecedents of individuals' influence in collaborative environments is an important, practical, and challenging problem. In this paper, we study interpersonal influence in small groups of individuals who collectively execute a sequence of intellective tasks. We observe that along an issue sequence with feedback, individuals with higher expertise and social confidence are accorded higher interpersonal influence. We also observe that low-performing individuals tend to underestimate their high-performing teammate's expertise. Based on these observations, we introduce three hypotheses and present empirical and theoretical support for their validity. We report empirical evidence on longstanding theories of transactive memory systems, social comparison, and confidence heuristics on the origins of social influence. We propose a cognitive dynamical model inspired by these theories to describe the process by which individuals adjust interpersonal influences over time. We demonstrate the model's accuracy in predicting individuals' influence and provide analytical results on its asymptotic behavior for the case with identically performing individuals. Lastly, we propose a novel approach using deep neural networks on a pre-trained text embedding model for predicting the influence of individuals. Using message contents, message times, and individual correctness collected during tasks, we are able to accurately predict individuals' self-reported influence over time. Extensive experiments verify the accuracy of the proposed models compared to baselines such as structural balance and reflected appraisal model. While the neural networks model is the most accurate, the dynamical model is the most interpretable for influence prediction. △ Less

Submitted 13 November, 2020; originally announced November 2020.

arXiv:2010.01219 [pdf, other]

Contraction Theory for Dynamical Systems on Hilbert Spaces

Authors: Pedro Cisneros-Velarde, Saber Jafarpour, Francesco Bullo

Abstract: Contraction theory for dynamical systems on Euclidean spaces is well-established. For contractive (resp. semi-contractive) systems, the distance (resp. semi-distance) between any two trajectories decreases exponentially fast. For partially contractive systems, each trajectory converges exponentially fast to an invariant subspace. In this note, we develop contraction theory on Hilbert spaces. Fir… ▽ More Contraction theory for dynamical systems on Euclidean spaces is well-established. For contractive (resp. semi-contractive) systems, the distance (resp. semi-distance) between any two trajectories decreases exponentially fast. For partially contractive systems, each trajectory converges exponentially fast to an invariant subspace. In this note, we develop contraction theory on Hilbert spaces. First, we provide a novel integral condition for contractivity, and for time-invariant systems, we establish the existence of a unique globally exponentially stable equilibrium. Second, we introduce the notions of partial and semi-contraction and we provide various sufficient conditions for time-varying and time-invariant systems. Finally, we apply the theory on a classic reaction-diffusion system. △ Less

Submitted 4 June, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

MSC Class: 34G20; 93D23 (Primary) 93D23; 37N99; 93D20 (Secondary)

arXiv:2008.09817 [pdf, other]

Assign and Appraise: Achieving Optimal Performance in Collaborative Teams

Authors: Elizabeth Y. Huang, Dario Paccagnan, Wenjun Mei, Francesco Bullo

Abstract: Tackling complex team problems requires understanding each team member's skills in order to devise a task assignment maximizing the team performance. This paper proposes a novel quantitative model describing the decentralized process by which individuals in a team learn who has what abilities, while concurrently assigning tasks to each of the team members. In the model, the appraisal network repre… ▽ More Tackling complex team problems requires understanding each team member's skills in order to devise a task assignment maximizing the team performance. This paper proposes a novel quantitative model describing the decentralized process by which individuals in a team learn who has what abilities, while concurrently assigning tasks to each of the team members. In the model, the appraisal network represents team member's evaluations of one another and each team member chooses their own workload. The appraisals and workload assignment change simultaneously: each member builds their own local appraisal of neighboring members based on the performance exhibited on previous tasks, while the workload is redistributed based on the current appraisal estimates. We show that the appraisal states can be reduced to a lower dimension due to the presence of conserved quantities associated to the cycles of the appraisal network. Building on this, we provide rigorous results characterizing the ability, or inability, of the team to learn each other's skill and thus converge to an allocation maximizing the team performance. We complement our analysis with extensive numerical experiments. △ Less

Submitted 22 August, 2020; originally announced August 2020.

arXiv:2008.09050 [pdf, other]

Markov Chain-Based Stochastic Strategies for Robotic Surveillance

Authors: Xiaoming Duan, Francesco Bullo

Abstract: This article surveys recent advancements of strategy designs for persistent robotic surveillance tasks with the focus on stochastic approaches. The problem describes how mobile robots stochastically patrol a graph in an efficient way where the efficiency is defined with respect to relevant underlying performance metrics. We first start by reviewing the basics of Markov chains, which is the primary… ▽ More This article surveys recent advancements of strategy designs for persistent robotic surveillance tasks with the focus on stochastic approaches. The problem describes how mobile robots stochastically patrol a graph in an efficient way where the efficiency is defined with respect to relevant underlying performance metrics. We first start by reviewing the basics of Markov chains, which is the primary motion model for stochastic robotic surveillance. Then two main criteria regarding the speed and unpredictability of surveillance strategies are discussed. The central objects that appear throughout the treatment is the hitting times of Markov chains, their distributions and expectations. We formulate various optimization problems based on the concerned metrics in different scenarios and establish their respective properties. △ Less

Submitted 20 August, 2020; originally announced August 2020.

arXiv:2006.09585 [pdf, other]

Policy Evaluation and Seeking for Multi-Agent Reinforcement Learning via Best Response

Authors: Rui Yan, Xiaoming Duan, Zongying Shi, Yisheng Zhong, Jason R. Marden, Francesco Bullo

Abstract: This paper introduces two metrics (cycle-based and memory-based metrics), grounded on a dynamical game-theoretic solution concept called sink equilibrium, for the evaluation, ranking, and computation of policies in multi-agent learning. We adopt strict best response dynamics (SBRD) to model selfish behaviors at a meta-level for multi-agent reinforcement learning. Our approach can deal with dynamic… ▽ More This paper introduces two metrics (cycle-based and memory-based metrics), grounded on a dynamical game-theoretic solution concept called sink equilibrium, for the evaluation, ranking, and computation of policies in multi-agent learning. We adopt strict best response dynamics (SBRD) to model selfish behaviors at a meta-level for multi-agent reinforcement learning. Our approach can deal with dynamical cyclical behaviors (unlike approaches based on Nash equilibria and Elo ratings), and is more compatible with single-agent reinforcement learning than alpha-rank which relies on weakly better responses. We first consider settings where the difference between largest and second largest underlying metric has a known lower bound. With this knowledge we propose a class of perturbed SBRD with the following property: only policies with maximum metric are observed with nonzero probability for a broad class of stochastic games with finite memory. We then consider settings where the lower bound for the difference is unknown. For this setting, we propose a class of perturbed SBRD such that the metrics of the policies observed with nonzero probability differ from the optimal by any given tolerance. The proposed perturbed SBRD addresses the opponent-induced non-stationarity by fixing the strategies of others for the learning agent, and uses empirical game-theoretic analysis to estimate payoffs for each strategy profile obtained due to the perturbation. △ Less

Submitted 20 June, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

Comments: 14 pages, 6 figures

arXiv:2005.11404 [pdf, other]

Multi-group SIS Epidemics with Simplicial and Higher-Order Interactions

Authors: Pedro Cisneros-Velarde, Francesco Bullo

Abstract: This paper analyzes a Susceptible-Infected-Susceptible (SIS) model of epidemic propagation over hypergraphs and, motivated by an important special case, we refer to the model as to the simplicial SIS model. Classically, the multi-group SIS model has assumed pairwise interactions of contagion across groups and thus has been vastly studied in the literature. It is only recently that a renewed specia… ▽ More This paper analyzes a Susceptible-Infected-Susceptible (SIS) model of epidemic propagation over hypergraphs and, motivated by an important special case, we refer to the model as to the simplicial SIS model. Classically, the multi-group SIS model has assumed pairwise interactions of contagion across groups and thus has been vastly studied in the literature. It is only recently that a renewed special attention has been drawn to the study of contagion dynamics over higher-order interactions and over more general graph structures, like simplexes. Previous work on mean-field approximation scalar models of the simplicial SIS model has indicated that a new dynamical behavior domain, compared to the classical SIS model, appears due to the newly introduced higher order interaction terms: both a disease-free equilibrium and an endemic equilibrium co-exist and are both locally asymptotically stable. This paper formally establishes that bistability (as a new epidemiological behavior) also appears in the multi-group simplicial SIS model. We give sufficient conditions over the model's parameters for the appearance of this and the other behavioral domains present in the classical multi-group SIS model. We additionally provide an algorithm to compute the value of the endemic equilibrium and report numerical analysis of the transition from the disease-free domain to the bistable domain. △ Less

Submitted 3 October, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

MSC Class: 92B05; 37N25; 37N99

arXiv:2005.09774 [pdf, other]

Weak and Semi-Contraction for Network Systems and Diffusively-Coupled Oscillators

Authors: Saber Jafarpour, Pedro Cisneros-Velarde, Francesco Bullo

Abstract: We develop two generalizations of contraction theory, namely, semi-contraction and weak-contraction theory. First, using the notion of semi-norm, we propose a geometric framework for semi-contraction theory. We introduce matrix semi-measures and characterize their properties. We show that the spectral abscissa of a matrix is the infimum over weighted semi-measures. For dynamical systems, we use th… ▽ More We develop two generalizations of contraction theory, namely, semi-contraction and weak-contraction theory. First, using the notion of semi-norm, we propose a geometric framework for semi-contraction theory. We introduce matrix semi-measures and characterize their properties. We show that the spectral abscissa of a matrix is the infimum over weighted semi-measures. For dynamical systems, we use the semi-measure of their Jacobian to characterize the contractivity properties of their trajectories. Second, for weakly contracting systems, we prove a dichotomy for the asymptotic behavior of their trajectories and novel sufficient conditions for convergence to an equilibrium. Third, we show that every trajectory of a doubly-contracting system, i.e., a system that is both weakly and semi-contracting, converges to an equilibrium point. Finally, we apply our results to various important network systems including affine averaging and affine flow systems, continuous-time distributed primal-dual algorithms, and networks of diffusively-coupled dynamical systems. For diffusively-coupled systems, the semi-contraction theory leads to a sufficient condition for synchronization that is sharper, in general, than previously-known tests. △ Less

Submitted 4 October, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

Showing 1–50 of 141 results for author: Bullo, F