Optimal transport with constraints: from mirror descent to classical mechanics

Abdullahi Adinoyi Ibrahim abdullahi.ibrahim@tuebingen.mpg.de Max Planck Institute for Intelligent Systems, Cyber Valley, Tübingen 72076, Germany Michael Muehlebach michaelm@tuebingen.mpg.de Max Planck Institute for Intelligent Systems, Cyber Valley, Tübingen 72076, Germany Caterina De Bacco caterina.debacco@tuebingen.mpg.de Max Planck Institute for Intelligent Systems, Cyber Valley, Tübingen 72076, Germany

Abstract

Finding optimal trajectories for multiple traffic demands in a congested network is a challenging task. Optimal transport theory is a principled approach that has been used successfully to study various transportation problems. Its usage is limited by the lack of principled and flexible ways to incorporate realistic constraints. We propose a principled physics-based approach to impose constraints flexibly in optimal transport problems. Constraints are included in mirror descent dynamics using the principle of D’Alembert-Lagrange from classical mechanics. This results in a sparse, local and linear approximation of the feasible set leading in many cases to closed-form updates.

Introduction.

Optimal transport in networks has important applications in different disciplines, in particular in urban transportation networks Arnott and Small (1994). Congestion not only increases travel time for users and decreases productivity, but it also drives air pollution. Reducing congestion and making transportation more efficient are also a core objective for EU policies, as highlighted throughout the EU Transport White Paper and the Strategic Plan 2020-2024 Commission (2011, 2020).

The design of efficient transportation networks is a complex task that requires a multifaceted solution. One of these facets is the problem of finding optimal routes for passengers. This is a well-studied problem in operations research Ahuja et al. (1988) where minimum-cost optimization is often considered to model discrete flows and can be solved using classical techniques from linear programming. In our work, we consider the continuous case, where flows are real-valued quantities. A variety of approaches have been suggested to model transport in networks using techniques from physics of complex systems Morris and Barthelemy (2012); Gao et al. (2019). Path optimality and congestion control have been studied in discrete settings Noh and Rieger (2002); Dobrin and Duxbury (2001); Bayati et al. (2008) or using the cavity method Yeung and Saad (2012); Yeung et al. (2013). These usually rely on ad-hoc algorithmic updates that depend on the specific type of constraints. The computational complexity of the ad-hoc updates is greatly influenced by the constraints. Other approaches have been proposed to investigate navigation in complex systems Solé-Ribalta et al. (2016); Gómez-Gardenes and Latora (2008); Lacasa et al. (2009); Sneppen et al. (2005); Rosvall et al. (2005); Zhao et al. (2005); Estrada et al. (2023), where the focus lies on investigating the properties of flows, rather than their optimization, as we consider here. In addition, these models often assume that passengers follow their shortest paths, an assumption, which may not be satisfied in practice. Adaptation dynamics Tero et al. (2010); Hu and Cai (2013); Ronellenfitsch and Katifori (2016) have been proposed to model biological distribution networks. However, these methods fall short of describing realistic scenarios where transport flows are limited by constraints.

In the following we cast the problem of designing efficient transportation networks under the broader framework of optimal transport theory (OT) Santambrogio (2015). This has been used to model and optimize various aspects of transport networks such as network design Tero et al. (2010); Ronellenfitsch and Katifori (2016); Baptista et al. (2020); Leite and De Bacco (2022) and traffic flows Bonifaci et al. (2012); Lonardi et al. (2021); Bohn and Magnasco (2007); Ibrahim et al. (2021); Lonardi et al. (2023a). These approaches guarantee a principled and computationally efficient way of solving transportation problems on networks. In addition, they model traffic congestion with a single tuning parameter that enables a transition between opposite traffic regimes, where traffic congestion can either be consolidated or discouraged. In standard OT methods, beyond few obvious constraints (e.g. conservation of mass), the amount of flow passing through an edge of the transportation network is unconstrained. As a result, traffic tends to concentrate on path trajectories that may be structurally unfeasible, which severely limits the applicability of OT models in real-world situations, where, for example, roads have a limited capacity of vehicles traveling at the same time. This letter proposes an approach to avoid this crucial flaw of OT models by imposing constraints. Applying this approach significantly impacts the overall network topology induced by the optimal flows, as the resulting path trajectories have different path lengths and traffic distribution than those obtained from unconstrained scenarios.

Our approach has not only a solid foundation via the principle of D’Alembert-Lagrange from classical mechanics Lanczos (1949), but also leads to algorithms that are computationally efficient and have a low implementation complexity. The key idea is to consider mirror descent dynamics of an OT problem, where constraints are included on a velocity level. This leads to a sparse, local and linear approximation of the feasible set which, in many cases, allows for a closed-form update rule, even in situations where the feasible set is nonconvex.

The model.

In analogy with electrical grids or hydraulic networks, we model mass flow on a transportation network using conductivities and flows on network edges. We consider a multi-commodity scenario Lonardi et al. (2021); Bonifaci et al. (2022), where mass of different type $i=1,\dots,M$ can move along different trajectories. The flow $F_{e}^{i}$ of mass of type $i$ along an edge $e=(u,v)$ can be described by $F_{e}^{i}=\mu_{e}(p_{u}^{i}-p_{v}^{i})/\ell_{e},$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) / roman_ℓ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT , where $p_{u}^{i}$ is a pressure potential at node $u$ for passenger of type $i$ , $\ell_{e}$ is the length of the edge $e$ and $\mu_{e}$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT its conductivity. This latter quantity can be seen as proportional to the size of an edge, and is the main variable of interest in determining optimal trajectories. Once the conductivity is known, the pressure differences can then be calculated from Kirchhoff’s law, which in turns determines the flows $F_{e}^{i}$ , see Supporting Material (SM) sup . In the absence of constraints, the optimal conductivities are the stationary solutions of the dynamics $\dot{\mu}=f$ μみゅー end_ARG = italic_f, where

f_{e}=\mu_{e}^{\beta}\frac{\sum_{i}(p_{u}^{i}-p_{v}^{i})^{2}}{\ell_{e}^{2}}-% \mu_{e}\equiv\mu_{e}^{\beta-2}|F_{e}|^{2}-\mu_{e}\quad,

μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_βべーた end_POSTSUPERSCRIPT divide start_ARG ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG roman_ℓ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≡ italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_βべーた - 2 end_POSTSUPERSCRIPT | italic_F start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ,

(1)

with $F_{e}=(F_{e}^{1},\dots,F_{e}^{M})$ and $|\cdot|$ denotes the Euclidean norm. Intuitively, this equation describes a positive feedback mechanism where conductivities increase for larger fluxes and decrease for negligible ones Tero et al. (2010). It can be shown that the dynamics in Eq. 1 admits a Lyapunov function $\mathfrak{L}_{\beta}$ βべーた end_POSTSUBSCRIPT which can be interpreted as a combination of the cost to operate the network and that of building the infrastructure Lonardi et al. (2021), see SM sup . Moreover, we have that $f=-S\,\nabla\mathfrak{L}_{\beta}$ βべーた end_POSTSUBSCRIPT, where $S$ is a diagonal matrix with diagonal entries $S_{e}=2\mu_{e}^{\beta}/\ell_{e}$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_βべーた end_POSTSUPERSCRIPT / roman_ℓ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT and Eq. 1 can therefore be seen as a mirror descent for the cost function $\mathfrak{L}_{\beta}$ βべーた end_POSTSUBSCRIPT Bonifaci (2021). This scaling in $S$ has the advantage of ensuring good behavior of the resulting numerical methods. One can also reinterpret Eq. 1 as a classical gradient descent by applying a suitable transformation Facca and Benzi (2021), we do not explore this here.

Variants of these dynamics have been proposed to model distributions over networks Hu and Cai (2013); Bohn and Magnasco (2007); Katifori et al. (2010); Banavar et al. (2000); Ronellenfitsch and Katifori (2016). The constant $\beta\in(0,2)$ βべーた ∈ ( 0 , 2 ) regulates the desired transportation regime. The setting $\beta<1$ βべーた < 1 penalizes traffic congestion by distributing paths on more edges, $\beta>1$ βべーた > 1 encourages path consolidation into fewer highways, and $\beta=1$ βべーた = 1 is shortest path-like.

In addition to imposing Kirchhoff’s law on nodes to ensure mass conservation, solving these dynamics outputs otherwise unconstrained optimal $\mu_{e}$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT and $F_{e}$ (see SM sup ). While this may be enough in ideal cases, in more realistic scenarios it is important to further constrain the solution. For instance, structural constraints may limit the maximum amount of flow that an edge can carry, or a budget constraint may be used to limit the infrastructure cost for building the network. Hence, the dynamics $\dot{\mu}=f$ μみゅー end_ARG = italic_f must be altered to account for these additional constraints.
There are many ways in which constraints can be added. A popular approach is to add constraints on a so-called position level, which leads to gradient inclusions in continuous time (Aubin and Cellina, 1984, Ch 3.4), and projected gradient descent in discrete time. Unfortunately, the scope of projected gradients is limited, due to the fact that projections can only be efficiently evaluated for constraints that have a particular structure (such as a low-dimensional hyperplane, the probability simplex, or a Euclidean norm ball). When the feasible set is nonconvex and/or fails to have a simple structure, evaluating projections is a computationally daunting task. This motivates our formulation (see also Muehlebach and Jordan (2022)), which includes constraints on a velocity level and yields a sparse local and linear approximation of the feasible set. As a consequence, the updates for $\mu$ μみゅー can often still be evaluated in closed-form (or there is an efficient way of computing them numerically) even though the underlying feasible set is nonconvex or fails to have a simple structure. We will highlight explicit examples of such situations in the remainder of this letter.

For $\mu(t)\not\in C$ μみゅー ( italic_t ) ∉ italic_C and an active constraint $i$ , the constraint $\dot{\mu}(t)\in V_{\alpha}(\mu(t))$ μみゅー end_ARG ( italic_t ) ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ( italic_t ) ) is equivalent to $\text{d}g_{i}(\mu(t))/\text{d}t\geq-\alpha g_{i}(\mu(t))$ μみゅー ( italic_t ) ) / d italic_t ≥ - italic_αあるふぁ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_μみゅー ( italic_t ) ), which ensures that potential constraint violations decay at the rate $\alpha>0$ αあるふぁ > 0. The situation is visualized in Fig. 1(A).

Refer to caption — Figure 1: (A) Visualization of the set $C$ and the set of feasible velocities $V_{\alpha}(\mu_{1})$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) and $V_{\alpha}(\mu_{2})$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) at points $\mu_{1}$ μみゅー start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and $\mu_{2}$ μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, respectively. Point $\mu_{1}$ μみゅー start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT lies on the boundary of $C$ , while $\mu_{2}$ μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is infeasible; $\alpha$ αあるふぁ is a restitution parameter. (B) When the vector field $f$ is pushing away from $C$ , a force $-R\in N_{V_{\alpha}}(\dot{\mu})$ αあるふぁ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( over˙ start_ARG italic_μみゅー end_ARG ) is added to the dynamics to ensure $\dot{\mu}\in V_{\alpha}(\mu)$ μみゅー end_ARG ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) .

In order to account for the velocity constraint $\dot{\mu}\in V_{\alpha}(\mu)$ μみゅー end_ARG ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) we augment the dynamics $\dot{\mu}=f$ μみゅー end_ARG = italic_f with a reaction force $R$ that forces the solution to remain within the desired constraints:

\dot{\mu}=f+R,\quad\text{with}\,\ -R\in N_{V_{\alpha}(\mu)}(\dot{\mu}),

μみゅー end_ARG = italic_f + italic_R , with - italic_R ∈ italic_N start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) end_POSTSUBSCRIPT ( over˙ start_ARG italic_μみゅー end_ARG ) ,

(2)

where $N_{V_{\alpha}(\mu)}(\dot{\mu})$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) end_POSTSUBSCRIPT ( over˙ start_ARG italic_μみゅー end_ARG ) denotes the normal cone of the set $V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) at $\dot{\mu}$ μみゅー end_ARG. Due to the scaling of the gradient with $S$ , the normal cone is defined with respect to the inner product $\langle a,b\rangle=a^{T}S^{-1}b$ , where $a,b\in\mathbb{R}^{E}$ are arbitrary vectors. This has the important effect of guaranteeing that $\mathfrak{L}_{\beta}$ βべーた end_POSTSUBSCRIPT (of the unconstrained dynamics) is still a Lyapunov function also in the constrained setting and that $\mathfrak{L}_{\beta}(\mu(t))$ βべーた end_POSTSUBSCRIPT ( italic_μみゅー ( italic_t ) ) is monotonically decreasing along the trajectories of Eq. 2. A detailed derivation is included in SM sup .

The addition of $R$ ensures that even if $f$ pushes $\mu$ μみゅー away from $C$ , as shown in Fig. 1(B), the force $R$ , which is orthogonal to the set $V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ), annihilates the component of $f$ that would lead to a constraint violation and ensures that $\dot{\mu}\in V_{\alpha}(\mu)$ μみゅー end_ARG ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ). As discussed above, we can therefore conclude that $\mu(0)\in C\Rightarrow\mu(t)\in C$ μみゅー ( 0 ) ∈ italic_C ⇒ italic_μみゅー ( italic_t ) ∈ italic_C for all $t\geq 0$ and $\mu(0)\not\in C\Rightarrow\mu(t)\rightarrow C$ μみゅー ( 0 ) ∉ italic_C ⇒ italic_μみゅー ( italic_t ) → italic_C for $t\rightarrow\infty$ .

In addition, we infer from Fig. 1 that the resulting $\dot{\mu}$ μみゅー end_ARG in Eq. 2 is nothing but the projection of $f$ onto the set $V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) and as a result, we can rewrite $\dot{\mu}$ μみゅー end_ARG in the following way:

\dot{\mu}:=\operatorname*{arg\,min}_{v\in V_{\alpha}(\mu)}\frac{1}{2}\langle v% -f,v-f\rangle\quad,

μみゅー end_ARG := start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_v ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ⟨ italic_v - italic_f , italic_v - italic_f ⟩ ,

(3)

which can also be equivalently reformulated as the quadratic program (QP)

\displaystyle\dot{\mu}

μみゅー end_ARG

\displaystyle:=\operatorname*{arg\,min}_{v\in V_{\alpha}(\mu)}\frac{1}{2}(v-f)% ^{T}S^{-1}(v-f)\quad.

αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_v - italic_f ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_S start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_v - italic_f ) .

(4)

This reformulation is not only useful for numerical computations, but also highlights that the velocity $\dot{\mu}$ μみゅー end_ARG is chosen, at each point in time, to match the unconstrained $f$ . Fig. 1(A) visualizes the set $C$ and the set of feasible velocities $V_{\alpha}(\mu_{1})$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) and $V_{\alpha}(\mu_{2})$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) at points $\mu_{1}$ μみゅー start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and $\mu_{2}$ μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, respectively. Point $\mu_{1}$ μみゅー start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT lies on the boundary of $C$ , while $\mu_{2}$ μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is infeasible. We note that the cone $V_{\alpha}(\mu_{2})$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) includes an offset, which is controlled by the restitution parameter $\alpha$ αあるふぁ; this ensures that any $v\in V_{\alpha}(\mu_{2})$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) leads to a decrease in constraint violation. Fig. 1 (B) shows that when the vector field $f$ is pushing away from $C$ , a force $-R\in N_{V_{\alpha}}(\dot{\mu})$ αあるふぁ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( over˙ start_ARG italic_μみゅー end_ARG ) is added to the dynamics. The force $R$ annihilates the component of $f$ that would lead to a constraint violation and ensures $\dot{\mu}\in V_{\alpha}(\mu)$ μみゅー end_ARG ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ), where $\dot{\mu}$ μみゅー end_ARG is chosen as close as possible to $f$ . This can also be interpreted as Gauss’s principle of least constraint. It is important to note that $V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) is a polyhedral set that only includes the constraints $I_{\mu}$ μみゅー end_POSTSUBSCRIPT, a subset of the original constraints $g(\mu)\geq 0$ μみゅー ) ≥ 0. The set $V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) represents therefore a sparse, local and linear approximation of the feasible set. The solution $\dot{\mu}$ μみゅー end_ARG of Eq. 3 can then be used to update the conductivity with a discrete-time algorithm:

\mu^{t+1}=\mu^{t}+\tau\dot{\mu}\quad,

μみゅー start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = italic_μみゅー start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + italic_τたう over˙ start_ARG italic_μみゅー end_ARG ,

(5)

where $\tau>0$ τたう > 0 is the step size.

This general formalism can be applied to a variety of scenarios, provided one can compute $\nabla g$ , which determines the set $V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ). We can then solve Eq. 4 by using numerical solvers tailored to QP, which then yields the update Eq. 5. Additional details about the computational complexity for solving Eq. 5 are described in SM sup . However, in important special cases, the optimization Eq. 5 can be solved in closed-form, as we illustrate below with three relevant examples.

Capacity constraints.

In cases of structural constraints that strictly limit the amount of mass that can travel along any given edge, one can consider capacities $c_{e}\geq 0$ on edges and set constraints as $g_{e}(\mu)=c_{e}-\mu_{e}$ μみゅー ) = italic_c start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT. The velocity constraint $v\in V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) in Eq. 3 reads as $v_{e}\leq\alpha g_{e}(\mu_{e})$ αあるふぁ italic_g start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ( italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ), for $e\in I_{\mu}$ μみゅー end_POSTSUBSCRIPT, which is strictly negative, since $\alpha>0$ αあるふぁ > 0 (SM sup ). As previously discussed, $\alpha>0$ αあるふぁ > 0 is a restitution parameter that dictates the rate at which constraint violations decay. In discrete time, one should choose $\alpha>0$ αあるふぁ > 0 such that $\alpha\,\tau\leq 1$ αあるふぁ italic_τたう ≤ 1 to guarantee convergence (see Muehlebach and Jordan (2022)). We can then solve Eq. 3 in closed-form for edges violating the constraint obtaining $v_{e}=\min\left\{\alpha\,(c_{e}-\mu_{e}),{f}_{e}\right\}$ αあるふぁ ( italic_c start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ) , italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT }. In summary, for each edge $e$ , we have:

\dot{\mu}_{e}=\begin{cases}\alpha\,(c_{e}-\mu_{e}),&\text{if}\ \ {f}_{e}\geq% \alpha\,(c_{e}-\mu_{e})\ \text{and}\ \mu_{e}\geq c_{e},\\ \ {f}_{e}&\text{otherwise}\quad.\end{cases}

(6)

Fig. 2 shows the path topologies with capacity constraints on synthetic data, compared against the unconstrained case. We generate random planar networks as the Delaunay triangulation (Guibas and Stolfi, 1985) of $N=300$ points in the plane. We measure the Gini coefficient $Gini(T)$ calculated on the traffic on edges, defined as the $E$ -dimensional vector $T$ with entries $T_{e}=\sum_{i}|F_{e}^{i}|/n$ , where $n$ is the number of passengers. The coefficient has value in $[0,1]$ and it determines how traffic is distributed along network edges, with $Gini(T)=0,1$ meaning equally-balanced or highly unbalanced traffic on few edges, respectively. The choice of the edge capacity $c_{e}$ influences this value, with lower $c_{e}$ imposing stricter constraint and thus encouraging traffic to distribute more equally along the edge, i.e. lower Gini, as shown in Fig. 2(A). Conversely, this implies longer routes for passengers, as measured by an increasing average total path length $\langle l\rangle=\sum_{e,i}\,\ell_{e}\,|F_{e}^{i}|/n$ compared to the unconstrained solution, as shown in Fig. 2(B).

Budget constraint.

As a second example, we consider a global constraint that involves all the edges at once, a budget constraint $g_{b}(\mu)=b\,-\,\sum_{e}\mu_{e}$ μみゅー ) = italic_b - ∑ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT. This is relevant when a network manager has a fixed limited amount of resources $b>0$ to invest. We note that, while the Lyapunov function $\mathfrak{L}_{\beta}$ βべーた end_POSTSUBSCRIPT contains a similar budget term–the cost to build the infrastracture–this cost is not regarded as a constraint in standard approaches Hu and Cai (2013); Lonardi et al. (2021) but as part of the energy consumption, and the budget $b$ is not a Lagrange multiplier but a measurable constant. Furthermore, unlike the previous case where including a positivity constraint $\mu_{e}\geq 0$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ 0 is optional (but it can in principle be imposed as well, see SM sup ), here we need to include that explicitly. In the standard OT formalism positivity is ensured, provided $\mu_{e}$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT is initialized as a positive quantity. Adding constraint may not preserve positivity anymore during the updates, this is the case for the budget constraint, as we observed empirically. Positivity is enforced by adding $g_{p}(\mu)=\mu\geq 0$ μみゅー ) = italic_μみゅー ≥ 0, i.e. $\mu_{e}\geq 0\,\forall e$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ 0 ∀ italic_e.

In this budget constraint setting, the conductivities violate the constraint whenever $\sum_{e}\mu_{e}>b$ μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT > italic_b. We derive a closed-form solution as: $\dot{\mu}_{e}=f_{e}-S_{e}\lambda_{b}$ μみゅー end_ARG start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_λらむだ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, if $f_{e}-S_{e}\lambda_{b}\geq-\alpha\,\mu_{e}$ λらむだ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ≥ - italic_αあるふぁ italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT, and $\dot{\mu}_{e}=-\alpha\,\mu_{e}$ μみゅー end_ARG start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = - italic_αあるふぁ italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT otherwise, where $\lambda_{b}\in\mathbb{R}$ λらむだ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ∈ blackboard_R, a Lagrange multiplier for the budget constraint, can be numerically determined via fixed-point iteration (SM sup ).

Combining linear and non-linear constraints.

All the previous examples considered linear constraints, where it is simple to derive analytical solutions. In general, constraints can be more complicated and thus require numerical methods to solve the constrained QP in Eq. 3. In this scenario, we consider a non-linear budget constraint of the form: $g_{\delta}(\mu)=b-\sum_{e}\mu_{e}^{\delta}\geq 0$ δでるた end_POSTSUBSCRIPT ( italic_μみゅー ) = italic_b - ∑ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_δでるた end_POSTSUPERSCRIPT ≥ 0, where $\delta>0$ δでるた > 0 is a nonlinearity parameter. Setting $\delta=1$ δでるた = 1 gives a linear budget constraint as the one discussed earlier. A non-linear example is a volume-preserving constraint where $\delta=1/2$ δでるた = 1 / 2, this is relevant for biological processes such as leaf venation and vascular systems Takamatsu et al. (2017); Ronellenfitsch and Katifori (2016). This non-linear budget induces the velocity constraint $\sum_{e}\delta\mu_{e}^{\delta-1}v_{e}\leq\alpha\,g_{\delta}(\mu)$ δでるた italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_δでるた - 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≤ italic_αあるふぁ italic_g start_POSTSUBSCRIPT italic_δでるた end_POSTSUBSCRIPT ( italic_μみゅー ). In addition, we also consider a capacity constraint as in the first scenario studied above. Overall, three functions are required: i) $g_{\delta}(\mu)$ δでるた end_POSTSUBSCRIPT ( italic_μみゅー ) to impose non-linear budget constraint; ii) $g_{e}(\mu)$ μみゅー ) to impose edge capacity and iii) $g_{p}(\mu)$ μみゅー ) to ensure positivity. We derive the closed-form solution as

\dot{\mu}_{e}=\begin{cases}\alpha\,(c_{e}-\mu_{e})&\text{if}\ {f}_{e}{-S_{e}% \lambda_{\delta}\,h_{e}}\geq\alpha\,(c_{e}-\mu_{e}),\,\mu_{e}\geq c_{e}\\ \\ -\alpha\,\mu_{e}&\text{if}\ {f}_{e}{-S_{e}\lambda_{\delta}\,h_{e}}\leq-\alpha% \,\mu_{e},\,\mu_{e}\leq 0\\ \\ f_{e}-S_{e}\lambda_{\delta}\,h_{e}&\text{otherwise}\quad,\end{cases}

μみゅー end_ARG start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = { start_ROW start_CELL italic_αあるふぁ ( italic_c start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ) end_CELL start_CELL if italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_λらむだ start_POSTSUBSCRIPT italic_δでるた end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ italic_αあるふぁ ( italic_c start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ) , italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ italic_c start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL - italic_αあるふぁ italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL if italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_λらむだ start_POSTSUBSCRIPT italic_δでるた end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≤ - italic_αあるふぁ italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT , italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≤ 0 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT italic_λらむだ start_POSTSUBSCRIPT italic_δでるた end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_CELL start_CELL otherwise , end_CELL end_ROW

(7)

where $h_{e}=\delta\,{\mu}_{e}^{\delta-1}$ δでるた italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_δでるた - 1 end_POSTSUPERSCRIPT and $\lambda_{\delta}>0$ λらむだ start_POSTSUBSCRIPT italic_δでるた end_POSTSUBSCRIPT > 0. The value of $\lambda_{\delta}$ λらむだ start_POSTSUBSCRIPT italic_δでるた end_POSTSUBSCRIPT can be determined numerically using fixed-point iteration (SM sup ). The value $\alpha\,(c_{e}-\mu_{e})$ αあるふぁ ( italic_c start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ) ensures there is no violation on the edge capacity, $-\alpha\,\mu_{e}$ αあるふぁ italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT imposes positivity constraint and ${f}_{e}{-S_{e}\lambda^{\delta}h_{e}}$ λらむだ start_POSTSUPERSCRIPT italic_δでるた end_POSTSUPERSCRIPT italic_h start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT captures budget violation. Overall, this scenario ensures that the velocity $\dot{\mu}_{e}$ μみゅー end_ARG start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT has an upper bound of $\alpha\,(c_{e}-\mu_{e})$ αあるふぁ ( italic_c start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ) and lower bound of $-\alpha\,\mu_{e}$ αあるふぁ italic_μみゅー start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT. The choice of $\delta$ δでるた impacts the topological properties of the resulting network, e.g., the total path length. In the numerical experiments, we set the nonlinearity parameter as $\delta\in(0,1)$ δでるた ∈ ( 0 , 1 ).

network.

We examine the topology of various constrained solutions on the road network of the city of Kujala et al. (2018), see Fig. 3(A). This has 640 nodes and 740 edges. As a relevant example, we set the central bus station as the destination node and select the remaining $639$ nodes as origins, but our method still applies to other choices of origin-destination pairs, e.g. peripheral nodes connecting to other peripheral nodes or to various hubs. This can be specified inside Kirchhoff’s law, see SM sup .

Routes generated from the non-linear constraint scenario balance traffic more than the unconstrained case and result in longer routes, see Fig. 3(B-C). Adding a budget constraint for $\beta>1$ βべーた > 1 results in more distributed traffic (lower Gini) without increasing much the total path length, compared to the unconstrained case. This could be used for instance to allocate to roads infrastructural works aimed at maintenance or upgrade when having a restricted budget.

Discussion.

Distributing flows in a transportation network is challenging. Approaches based on optimal transport theory are promising, but they are limited by the lack of a mechanism to incorporate realistic constraints. We show how to impose arbitrary constraints on OT problems in a principled and flexible way. The constraints are lifted from a position to a velocity level and are included in the corresponding mirror descent dynamics. This results in a scalable algorithm that solves constrained OT problems in a computationally efficient manner. The algorithm relies on a sparse local approximation of the feasible set at each iteration. Thus, closed-form updates can often be derived, even if the underlying feasible set is nonconvex or nonlinear. Otherwise, one can resort to efficient numerical methods to solve at most a quadratic program. Our physics-based approach is a change of paradigm with regard to how OT problems are modelled and solved numerically. This calls for a generalization of transportation problems in wider scenarios, e.g. in networks with multiple transport modes Ibrahim et al. (2021), with real-time traffic demands Lonardi et al. (2023b) or with noise-induced resonances Folz et al. (2023).

We provide an open source implementation McOpt (2023).

Acknowledgements.

Acknowledgments: The authors thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS) for supporting AAI. MM thanks the German Research Foundation and the Branco Weiss Fellowship, administered by ETH Zurich, for the support.

Appendix A Appendix

Details about setting the constraints.

We define $C:=\{\mu\in\mathbb{R}_{\geq 0}^{E}\ |\ g(\mu)\geq 0\}$ μみゅー ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_E end_POSTSUPERSCRIPT | italic_g ( italic_μみゅー ) ≥ 0 } as the set of feasible conductivities $\mu=(\mu_{1},\dots,\mu_{E})$ μみゅー = ( italic_μみゅー start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_μみゅー start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ), with $g$ a constraint function that we assume continuously differentiable and $E$ is the number of network edges. We focus on those edges where constraints are not satisfied, and denote the set of active constraints for a given $\mu$ μみゅー as $I_{\mu}:=\{i\in\mathbb{Z}\ |\ g_{i}(\mu)\leq 0\}$ μみゅー end_POSTSUBSCRIPT := { italic_i ∈ blackboard_Z | italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_μみゅー ) ≤ 0 }. Interpreting $\mu$ μみゅー as a “position” variable, a constraint to ensure $\mu(t)\in C,\forall t\geq 0$ μみゅー ( italic_t ) ∈ italic_C , ∀ italic_t ≥ 0, can be equivalently formulated as a constraint on its velocity $\dot{\mu}(t)\in T_{C}(\mu(t)),\forall t\geq 0$ μみゅー end_ARG ( italic_t ) ∈ italic_T start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ( italic_μみゅー ( italic_t ) ) , ∀ italic_t ≥ 0, with $\mu(0)\in C$ μみゅー ( 0 ) ∈ italic_C, where $T_{C}(\mu)$ μみゅー ) denotes the tangent cone of the feasible set at $\mu$ μみゅー, see Rockafellar and Wets (1998). However, it will be convenient to slightly extend the notion of tangent cone to also account for infeasible initial conditions (this is particularly important for the discretization), which is achieved by imposing $\dot{\mu}(t)\in V_{\alpha}(\mu(t))$ μみゅー end_ARG ( italic_t ) ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ( italic_t ) ), where $V_{\alpha}(\mu):=\{v\in\mathbb{R}^{E}\ |\ \nabla g_{i}(\mu)^{T}v\geq-\alpha\,g% _{i}(\mu),i\in I_{\mu}\}$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) := { italic_v ∈ blackboard_R start_POSTSUPERSCRIPT italic_E end_POSTSUPERSCRIPT | ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_μみゅー ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_v ≥ - italic_αあるふぁ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_μみゅー ) , italic_i ∈ italic_I start_POSTSUBSCRIPT italic_μみゅー end_POSTSUBSCRIPT }, and $\alpha\geq 0$ αあるふぁ ≥ 0 is a constant typically referred to as a “restitution” parameter or “slackness”. We note that $V_{\alpha}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) generalizes the notion of the tangent cone, since for $\mu\in C$ μみゅー ∈ italic_C, $V_{\alpha}(\mu)=T_{C}(\mu)$ αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ) = italic_T start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ( italic_μみゅー ). We assume mild regularity conditions (constraint qualification). A sufficient condition is, for example, the existence of $v\in\mathbb{R}^{E}$ such that $\nabla g_{i}(\mu)^{T}v>0$ μみゅー ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_v > 0 for all $i\in I_{\mu}$ μみゅー end_POSTSUBSCRIPT.

For $\mu(t)\not\in C$ μみゅー ( italic_t ) ∉ italic_C, the constraint $\dot{\mu}(t)\in V_{\alpha}(\mu(t))$ μみゅー end_ARG ( italic_t ) ∈ italic_V start_POSTSUBSCRIPT italic_αあるふぁ end_POSTSUBSCRIPT ( italic_μみゅー ( italic_t ) ) is equivalent to $\text{d}g_{i}(\mu(t))/\text{d}t\geq-\alpha g_{i}(\mu(t))$ μみゅー ( italic_t ) ) / d italic_t ≥ - italic_αあるふぁ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_μみゅー ( italic_t ) ), $i\in I_{\mu(t)}$ μみゅー ( italic_t ) end_POSTSUBSCRIPT, which ensures that potential constraint violations decay at the rate $\alpha>0$ αあるふぁ > 0.

Details about our method.

From a variational optimization perspective, our approach is related to successive linear and sequential quadratic programming Nocedal and Wright (2006); Bertsekas (1999); Luenberger and Ye (2016). The underlying idea of these methods is to linearize the objective function and the constraints about the current iterate and to solve a local linear and/or quadratic program. Our work improves upon these ideas and tailors these to optimal transport problems in the following way: i) we linearize a subset of constraints at every iteration, which means that the subproblem Eq. 3 typically includes very few constraints and can be solved efficiently; ii) we introduce a non-Euclidean inner product that is adapted to optimal transport problems and is used to show that $\mathfrak{L}_{\beta}$ βべーた end_POSTSUBSCRIPT is a Lyapunov function; iii) we provide closed-form updates in various problem instances that are practically relevant.

References

Arnott and Small (1994) R. Arnott and K. Small, American scientist 82, 446 (1994).
Commission (2011) E. Commission, “White paper of 28 march 2011: “roadmap to a single european transport area—towards a competitive and resource efficient transport system”,” (2011).
Commission (2020) E. Commission, “Strategic plan 2020-2024,” (2020).
Ahuja et al. (1988) R. K. Ahuja, T. L. Magnanti, and J. B. Orlin, “Network flows. alfred p. sloan school of management,” (1988).
Morris and Barthelemy (2012) R. G. Morris and M. Barthelemy, Physical review letters 109, 128703 (2012).
Gao et al. (2019) L. Gao, P. Shu, M. Tang, W. Wang, and H. Gao, Physical Review E 100, 012310 (2019).
Noh and Rieger (2002) J. D. Noh and H. Rieger, Physical Review E 66, 066127 (2002).
Dobrin and Duxbury (2001) R. Dobrin and P. Duxbury, Physical Review Letters 86, 5076 (2001).
Bayati et al. (2008) M. Bayati, C. Borgs, A. Braunstein, J. Chayes, A. Ramezanpour, and R. Zecchina, Physical review letters 101, 037208 (2008).
Yeung and Saad (2012) C. H. Yeung and D. Saad, Physical review letters 108, 208701 (2012).
Yeung et al. (2013) C. H. Yeung, D. Saad, and K. M. Wong, Proceedings of the National Academy of Sciences 110, 13717 (2013).
Solé-Ribalta et al. (2016) A. Solé-Ribalta, S. Gómez, and A. Arenas, Physical review letters 116, 108701 (2016).
Gómez-Gardenes and Latora (2008) J. Gómez-Gardenes and V. Latora, Physical Review E 78, 065102 (2008).
Lacasa et al. (2009) L. Lacasa, M. Cea, and M. Zanin, Physica A: Statistical Mechanics and its Applications 388, 3948 (2009).
Sneppen et al. (2005) K. Sneppen, A. Trusina, and M. Rosvall, Europhysics Letters 69, 853 (2005).
Rosvall et al. (2005) M. Rosvall, A. Grönlund, P. Minnhagen, and K. Sneppen, Physical Review E 72, 046117 (2005).
Zhao et al. (2005) L. Zhao, Y.-C. Lai, K. Park, and N. Ye, Physical Review E 71, 026125 (2005).
Estrada et al. (2023) E. Estrada, J. Gómez-Gardeñes, and L. Lacasa, Proceedings of the National Academy of Sciences 120, e2305001120 (2023).
Tero et al. (2010) A. Tero, S. Takagi, T. Saigusa, K. Ito, D. P. Bebber, M. D. Fricker, K. Yumiki, R. Kobayashi, and T. Nakagaki, Science 327, 439 (2010).
Hu and Cai (2013) D. Hu and D. Cai, Physical review letters 111, 138701 (2013).
Ronellenfitsch and Katifori (2016) H. Ronellenfitsch and E. Katifori, Physical review letters 117, 138301 (2016).
Santambrogio (2015) F. Santambrogio, Birkäuser, NY 55, 94 (2015).
Baptista et al. (2020) D. Baptista, D. Leite, E. Facca, M. Putti, and C. De Bacco, Scientific reports 10, 20806 (2020).
Leite and De Bacco (2022) D. Leite and C. De Bacco, arXiv preprint arXiv:2209.06751 (2022).
Bonifaci et al. (2012) V. Bonifaci, K. Mehlhorn, and G. Varma, Journal of theoretical biology 309, 121 (2012).
Lonardi et al. (2021) A. Lonardi, E. Facca, M. Putti, and C. De Bacco, Physical Review Research 3, 043010 (2021).
Bohn and Magnasco (2007) S. Bohn and M. O. Magnasco, Physical review letters 98, 088702 (2007).
Ibrahim et al. (2021) A. A. Ibrahim, A. Lonardi, and C. D. Bacco, Algorithms 14, 189 (2021).
Lonardi et al. (2023a) A. Lonardi, D. Baptista, and C. De Bacco, Frontiers in Physics 11, 65 (2023a).
Lanczos (1949) C. Lanczos, The Variational Principles of Mechanics (University of Toronto Press, 1949).
Bonifaci et al. (2022) V. Bonifaci, E. Facca, F. Folz, A. Karrenbauer, P. Kolev, K. Mehlhorn, G. Morigi, G. Shahkarami, and Q. Vermande, Theoretical Computer Science 920, 1 (2022).
(32) See Supporting Material, which includes Refs. [27-31], for additional information about the derivations and a detailed discussion of the numerical simulations .
Ibrahim et al. (2022) A. A. Ibrahim, D. Leite, and C. De Bacco, Physical Review E 105, 064302 (2022).
Davis (2004) T. A. Davis, ACM Trans. Math. Softw. 30, 196–199 (2004).
Briggs et al. (2000) W. L. Briggs, V. E. Henson, and S. F. McCormick, A multigrid tutorial (SIAM, 2000).
Nesterov (2004) Y. Nesterov, Introductory lectures on convex optimization: A basic course (Springer Science & Business Media LLC, 2004).
Bonifaci (2021) V. Bonifaci, Computational Optimization and Applications 79, 441 (2021).
Facca and Benzi (2021) E. Facca and M. Benzi, SIAM Journal on Scientific Computing 43, A2295 (2021).
Katifori et al. (2010) E. Katifori, G. J. Szöllősi, and M. O. Magnasco, Physical review letters 104, 048704 (2010).
Banavar et al. (2000) J. R. Banavar, F. Colaiori, A. Flammini, A. Maritan, and A. Rinaldo, Physical Review Letters 84, 4745 (2000).
Aubin and Cellina (1984) J. P. Aubin and A. Cellina, Differential Inclusions (Springer, 1984).
Muehlebach and Jordan (2022) M. Muehlebach and M. I. Jordan, Journal of Machine Learning Research 23, 1 (2022).
Guibas and Stolfi (1985) L. Guibas and J. Stolfi, ACM transactions on graphics (TOG) 4, 74 (1985).
Takamatsu et al. (2017) A. Takamatsu, T. Gomi, T. Endo, T. Hirai, and T. Sasaki, Journal of Physics D: Applied Physics 50, 154003 (2017).
Kujala et al. (2018) R. Kujala, C. Weckström, R. K. Darst, M. N. Mladenović, and J. Saramäki, Scientific data 5, 1 (2018).
Lonardi et al. (2023b) A. Lonardi, E. Facca, M. Putti, and C. De Bacco, Physical Review E 107, 024302 (2023b).
Folz et al. (2023) F. Folz, K. Mehlhorn, and G. Morigi, Physical Review Letters 130, 267401 (2023).
McOpt (2023) McOpt, https://github.com/aleable/McOpt (2023).
Rockafellar and Wets (1998) R. T. Rockafellar and R. J.-B. Wets, Variational Analysis (Springer Verlag Berlin-Heidelberg, 1998).
Nocedal and Wright (2006) J. Nocedal and S. J. Wright, Numerical Optimization, 2nd ed. (Springer, 2006).
Bertsekas (1999) D. P. Bertsekas, Nonlinear Programming, 2nd ed. (Athena Scientific, 1999).
Luenberger and Ye (2016) D. G. Luenberger and Y. Ye, Linear and Nonlinear Programming, 4th ed. (Springer, 2016).