(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–32 of 32 results for author: Muehlebach, M

.
  1. arXiv:2407.08432  [pdf, other

    cs.LG

    Subgroup-Specific Risk-Controlled Dose Estimation in Radiotherapy

    Authors: Paul Fischer, Hannah Willms, Moritz Schneider, Daniela Thorwarth, Michael Muehlebach, Christian F. Baumgartner

    Abstract: Cancer remains a leading cause of death, highlighting the importance of effective radiotherapy (RT). Magnetic resonance-guided linear accelerators (MR-Linacs) enable imaging during RT, allowing for inter-fraction, and perhaps even intra-fraction, adjustments of treatment plans. However, achieving this requires fast and accurate dose calculations. While Monte Carlo simulations offer accuracy, they… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: This work was accepted as a full paper at MICCAI 2024

  2. arXiv:2405.18100  [pdf, other

    cs.LG math.OC

    A Pontryagin Perspective on Reinforcement Learning

    Authors: Onno Eberhard, Claire Vernade, Michael Muehlebach

    Abstract: Reinforcement learning has traditionally focused on learning state-dependent policies to solve optimal control problems in a closed-loop fashion. In this work, we introduce the paradigm of open-loop reinforcement learning where a fixed action sequence is learned instead. We present three new algorithms: one robust model-based method and two sample-efficient model-free methods. Rather than basing o… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.10618  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Event-Based Learning via ADMM

    Authors: Guner Dilsad Er, Sebastian Trimpe, Michael Muehlebach

    Abstract: We consider a distributed learning problem, where agents minimize a global objective function by exchanging information over a network. Our approach has two distinct features: (i) It substantially reduces communication by triggering communication only when necessary, and (ii) it is agnostic to the data-distribution among the different agents. We can therefore guarantee convergence even if the loca… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 29 pages, 12 figures

  4. arXiv:2404.05318  [pdf, other

    cs.LG cs.RO

    Stochastic Online Optimization for Cyber-Physical and Robotic Systems

    Authors: Hao Ma, Melanie Zeilinger, Michael Muehlebach

    Abstract: We propose a novel gradient-based online optimization framework for solving stochastic programming problems that frequently arise in the context of cyber-physical and robotic systems. Our problem formulation accommodates constraints that model the evolution of a cyber-physical system, which has, in general, a continuous state and action space, is nonlinear, and where the state is only partially ob… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 46 pages, 16 figures

  5. arXiv:2404.04355  [pdf, other

    math.OC eess.SY

    Gray-Box Nonlinear Feedback Optimization

    Authors: Zhiyu He, Saverio Bolognani, Michael Muehlebach, Florian Dörfler

    Abstract: Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the input-output sensitivity of the system to construct gradients, whereas model-free approaches bypass this need by estimating gradients from real-time evaluations of the… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  6. arXiv:2403.12859  [pdf, other

    math.OC cs.LG stat.ML

    Primal Methods for Variational Inequality Problems with Functional Constraints

    Authors: Liang Zhang, Niao He, Michael Muehlebach

    Abstract: Constrained variational inequality problems are recognized for their broad applications across various fields including machine learning and operations research. First-order methods have emerged as the standard approach for solving these problems due to their simplicity and scalability. However, they typically rely on projection or linear minimization oracles to navigate the feasible set, which be… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  7. arXiv:2402.06012  [pdf, other

    eess.SY

    Balancing a 3D Inverted Pendulum using Remote Magnetic Manipulation

    Authors: Jasan Zughaibi, Bradley J. Nelson, Michael Muehlebach

    Abstract: Remote magnetic manipulation offers wireless control over magnetic objects, which has important medical applications, such as targeted drug delivery and minimally invasive surgeries. Magnetic manipulation systems are categorized into systems using permanent magnets and systems based on electromagnets. Electro-Magnetic Navigation Systems (eMNSs) are believed to have a superior actuation bandwidth,… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  8. arXiv:2401.14029  [pdf, other

    math.OC cs.LG eess.SY

    Towards a Systems Theory of Algorithms

    Authors: Florian Dörfler, Zhiyu He, Giuseppe Belgioioso, Saverio Bolognani, John Lygeros, Michael Muehlebach

    Abstract: Traditionally, numerical algorithms are seen as isolated pieces of code confined to an {\em in silico} existence. However, this perspective is not appropriate for many modern computational approaches in control, learning, or optimization, wherein {\em in vivo} algorithms interact with their environment. Examples of such {\em open algorithms} include various real-time optimization-based control str… ▽ More

    Submitted 30 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  9. arXiv:2310.07665  [pdf, other

    cs.AI cs.LG stat.ML

    Deep Backtracking Counterfactuals for Causally Compliant Explanations

    Authors: Klaus-Rudolf Kladny, Julius von Kügelgen, Bernhard Schölkopf, Michael Muehlebach

    Abstract: Counterfactuals answer questions of what would have been observed under altered circumstances and can therefore offer valuable insights. Whereas the classical interventional interpretation of counterfactuals has been studied extensively, backtracking constitutes a less studied alternative where all causal laws are kept intact. In the present work, we introduce a practical method called deep backtr… ▽ More

    Submitted 9 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  10. arXiv:2309.04727  [pdf, other

    physics.soc-ph physics.class-ph

    Optimal transport with constraints: from mirror descent to classical mechanics

    Authors: Abdullahi Adinoyi Ibrahim, Michael Muehlebach, Caterina De Bacco

    Abstract: Finding optimal trajectories for multiple traffic demands in a congested network is a challenging task. Optimal transport theory is a principled approach that has been used successfully to study various transportation problems. Its usage is limited by the lack of principled and flexible ways to incorporate realistic constraints. We propose a principled physics-based approach to impose constraints… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: 14 pages, 8 figures

  11. arXiv:2308.14562  [pdf, other

    cs.RO eess.SY

    Data-Efficient Online Learning of Ball Placement in Robot Table Tennis

    Authors: Philip Tobuschat, Hao Ma, Dieter Büchler, Bernhard Schölkopf, Michael Muehlebach

    Abstract: We present an implementation of an online optimization algorithm for hitting a predefined target when returning ping-pong balls with a table tennis robot. The online algorithm optimizes over so-called interception policies, which define the manner in which the robot arm intercepts the ball. In our case, these are composed of the state of the robot arm (position and velocity) at interception time.… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 7 pages, 6 figures, to be published in proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023

  12. arXiv:2307.02654  [pdf, other

    cs.RO

    Safe & Accurate at Speed with Tendons: A Robot Arm for Exploring Dynamic Motion

    Authors: Simon Guist, Jan Schneider, Hao Ma, Le Chen, Vincent Berenz, Julian Martus, Heiko Ott, Felix Grüninger, Michael Muehlebach, Jonathan Fiene, Bernhard Schölkopf, Dieter Büchler

    Abstract: Operating robots precisely and at high speeds has been a long-standing goal of robotics research. Balancing these competing demands is key to enabling the seamless collaboration of robots and humans and increasing task performance. However, traditional motor-driven systems often fall short in this balancing act. Due to their rigid and often heavy design exacerbated by positioning the motors into t… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

  13. arXiv:2306.06002  [pdf, other

    stat.ME cs.AI

    Causal Effect Estimation from Observational and Interventional Data Through Matrix Weighted Linear Estimators

    Authors: Klaus-Rudolf Kladny, Julius von Kügelgen, Bernhard Schölkopf, Michael Muehlebach

    Abstract: We study causal effect estimation from a mixture of observational and interventional data in a confounded linear regression model with multivariate treatments. We show that the statistical efficiency in terms of expected squared error can be improved by combining estimators arising from both the observational and interventional setting. To this end, we derive methods based on matrix weighted linea… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Journal ref: UAI 2023

  14. arXiv:2306.03655  [pdf, other

    cs.LG math.OC

    Online Learning under Adversarial Nonlinear Constraints

    Authors: Pavel Kolev, Georg Martius, Michael Muehlebach

    Abstract: In many applications, learning systems are required to process continuous non-stationary data streams. We study this problem in an online learning framework and propose an algorithm that can deal with adversarial time-varying and nonlinear constraints. As we show in our work, the algorithm called Constraint Violation Velocity Projection (CVV-Pro) achieves $\sqrt{T}$ regret and converges to the fea… ▽ More

    Submitted 13 October, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  15. arXiv:2305.15189  [pdf, other

    cs.RO cs.LG eess.SY

    Black-Box vs. Gray-Box: A Case Study on Learning Table Tennis Ball Trajectory Prediction with Spin and Impacts

    Authors: Jan Achterhold, Philip Tobuschat, Hao Ma, Dieter Buechler, Michael Muehlebach, Joerg Stueckler

    Abstract: In this paper, we present a method for table tennis ball trajectory filtering and prediction. Our gray-box approach builds on a physical model. At the same time, we use data to learn parameters of the dynamics model, of an extended Kalman filter, and of a neural model that infers the ball's initial condition. We demonstrate superior prediction performance of our approach over two black-box approac… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted for publication at the 5th Annual Conference on Learning for Dynamics and Control (L4DC) 2023 (camera-ready). With supplementary material

  16. arXiv:2305.08536  [pdf, other

    math.OC

    A Dynamical Systems Perspective on Discrete Optimization

    Authors: Tong Guanchun, Michael Muehlebach

    Abstract: We discuss a dynamical systems perspective on discrete optimization. Departing from the fact that many combinatorial optimization problems can be reformulated as finding low energy spin configurations in corresponding Ising models, we derive a penalized rank-two relaxation of the Ising formulation. It turns out that the associated gradient flow dynamics exactly correspond to a type of hardware sol… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  17. arXiv:2304.03321  [pdf, other

    cs.LG eess.SY

    Adaptive Decision-Making with Constraints and Dependent Losses: Performance Guarantees and Applications to Online and Nonlinear Identification

    Authors: Michael Muehlebach

    Abstract: We consider adaptive decision-making problems where an agent optimizes a cumulative performance objective by repeatedly choosing among a finite set of options. Compared to the classical prediction-with-expert-advice set-up, we consider situations where losses are constrained and derive algorithms that exploit the additional structure in optimal and computationally efficient ways. Our algorithm and… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 8 pages

  18. arXiv:2303.09261  [pdf, other

    math.OC stat.ML

    Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold

    Authors: Sholom Schechtman, Daniil Tiapkin, Michael Muehlebach, Eric Moulines

    Abstract: We consider the problem of minimizing a non-convex function over a smooth manifold $\mathcal{M}$. We propose a novel algorithm, the Orthogonal Directions Constrained Gradient Method (ODCGM) which only requires computing a projection onto a vector space. ODCGM is infeasible but the iterates are constantly pulled towards the manifold, ensuring the convergence of ODCGM towards $\mathcal{M}$. ODCGM is… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  19. arXiv:2302.00316  [pdf, other

    math.OC cs.LG eess.SP stat.ML

    Accelerated First-Order Optimization under Nonlinear Constraints

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We exploit analogies between first-order algorithms for constrained optimization and non-smooth dynamical systems to design a new class of accelerated first-order algorithms for constrained optimization. Unlike Frank-Wolfe or projected gradients, these algorithms avoid optimization over the entire feasible set at each iteration. We prove convergence to stationary points even in a nonconvex setting… ▽ More

    Submitted 2 January, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 44 pages, 6 figures

  20. arXiv:2212.05781  [pdf, ps, other

    cs.LG eess.SY

    Robust Recurrent Neural Network to Identify Ship Motion in Open Water with Performance Guarantees -- Technical Report

    Authors: Daniel Frank, Decky Aspandi Latif, Michael Muehlebach, Benjamin Unger, Steffen Staab

    Abstract: Recurrent neural networks are capable of learning the dynamics of an unknown nonlinear system purely from input-output measurements. However, the resulting models do not provide any stability guarantees on the input-output mapping. In this work, we represent a recurrent neural network as a linear time-invariant system with nonlinear disturbances. By introducing constraints on the parameters, we ca… ▽ More

    Submitted 16 December, 2022; v1 submitted 12 December, 2022; originally announced December 2022.

  21. arXiv:2207.12992  [pdf, other

    stat.ME stat.AP

    Risk-Adjusted Incidence Modeling on Hierarchical Survival Data with Recurrent Events

    Authors: Xiaotong Jiang, William Stoudemire, Marianne S. Muhlebach, Michael R. Kosorok

    Abstract: There is a constant need for many healthcare programs to timely address problems with infection prevention and control (IP&C). For example, pathogens can be transmitted among patients with cystic fibrosis (CF) in both the inpatient and outpatient settings within the healthcare system even with the existing recommended IP&C practices, and these pathogens are often associated with negative clinical… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  22. arXiv:2206.02953  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization

    Authors: Aniket Das, Bernhard Schölkopf, Michael Muehlebach

    Abstract: We analyze the convergence rates of stochastic gradient algorithms for smooth finite-sum minimax optimization and show that, for many such algorithms, sampling the data points without replacement leads to faster convergence compared to sampling with replacement. For the smooth and strongly convex-strongly concave setting, we consider gradient descent ascent and the proximal point method, and prese… ▽ More

    Submitted 10 October, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  23. arXiv:2107.08225  [pdf, other

    math.OC cs.LG eess.SY

    On Constraints in First-Order Optimization: A View from Non-Smooth Dynamical Systems

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We introduce a class of first-order methods for smooth constrained optimization that are based on an analogy to non-smooth dynamical systems. Two distinctive features of our approach are that (i) projections or optimizations over the entire feasible set are avoided, in stark contrast to projected gradient methods or the Frank-Wolfe method, and (ii) iterates are allowed to become infeasible, which… ▽ More

    Submitted 5 November, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: 47 pages, 11 figures

  24. arXiv:2002.12493  [pdf, other

    math.OC math.NA stat.ML

    Optimization with Momentum: Dynamical, Control-Theoretic, and Symplectic Perspectives

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We analyze the convergence rate of various momentum-based optimization algorithms from a dynamical systems point of view. Our analysis exploits fundamental topological properties, such as the continuous dependence of iterates on their initial conditions, to provide a simple characterization of convergence rates. In many cases, closed-form expressions are obtained that relate algorithm parameters t… ▽ More

    Submitted 12 April, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: 30 pages; 20 pages appendix and references

  25. arXiv:2002.03546  [pdf, ps, other

    math.OC eess.SY

    Continuous-time Lower Bounds for Gradient-based Algorithms

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: This article derives lower bounds on the convergence rate of continuous-time gradient-based optimization algorithms. The algorithms are subjected to a time-normalization constraint that avoids a reparametrization of time in order to make the discussion of continuous-time convergence rates meaningful. We reduce the multi-dimensional problem to a single dimension, recover well-known lower bounds fro… ▽ More

    Submitted 3 August, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: 13 pages

  26. arXiv:1908.07109  [pdf, other

    eess.SY

    The Silver Ratio and its Relation to Controllability

    Authors: Michael Muehlebach

    Abstract: This note investigates the controllability of two unstable second-order systems that are coupled through a common input. These dynamics occur for different types of inverted-pendulum systems. Controllability is quantified by the volume of the state-space that can be reached with unit energy, provided that the system starts and ends at the origin. It is shown that controllability is maximized when… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  27. arXiv:1905.10866  [pdf, other

    physics.comp-ph cs.LG

    Physics-informed Autoencoders for Lyapunov-stable Fluid Flow Prediction

    Authors: N. Benjamin Erichson, Michael Muehlebach, Michael W. Mahoney

    Abstract: In addition to providing high-profile successes in computer vision and natural language processing, neural networks also provide an emerging set of techniques for scientific problems. Such data-driven models, however, typically ignore physical insights from the scientific system under consideration. Among other things, a physics-informed model formulation should encode some degree of stability or… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

  28. arXiv:1905.07436  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    A Dynamical Systems Perspective on Nesterov Acceleration

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We present a dynamical system framework for understanding Nesterov's accelerated gradient method. In contrast to earlier work, our derivation does not rely on a vanishing step size argument. We show that Nesterov acceleration arises from discretizing an ordinary differential equation with a semi-implicit Euler integration scheme. We analyze both the underlying differential equation as well as the… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 11 pages, 4 figures, to appear in the Proceedings of the 36th International Conference on Machine Learning

  29. arXiv:1903.07648  [pdf, other

    eess.SY

    A Method for Reducing the Complexity of Model Predictive Control in Robotics Applications

    Authors: Michael Muehlebach, Raffaello D'Andrea

    Abstract: This article describes an approach for parametrizing input and state trajectories in model predictive control. The parametrization is designed to be invariant to time shifts, which enables warm-starting the successive optimization problems and reduces the computational complexity of the online optimization. It is shown that in certain cases (e.g. for linear time-invariant dynamics with input and s… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

  30. arXiv:1803.05510  [pdf, other

    eess.SY

    On the Approximation of Constrained Linear Quadratic Regulator Problems and their Application to Model Predictive Control - Supplementary Notes

    Authors: Michael Muehlebach, Raffaello D'Andrea

    Abstract: By parametrizing input and state trajectories with basis functions different approximations to the constrained linear quadratic regulator problem are obtained. These notes present and discuss technical results that are intended to supplement a corresponding journal article. The results can be applied in a model predictive control context.

    Submitted 23 February, 2018; originally announced March 2018.

    Comments: 19 pages, 1 figure

  31. Distributed Event-Based State Estimation for Networked Systems: An LMI-Approach

    Authors: Michael Muehlebach, Sebastian Trimpe

    Abstract: In this work, a dynamic system is controlled by multiple sensor-actuator agents, each of them commanding and observing parts of the system's input and output. The different agents sporadically exchange data with each other via a common bus network according to local event-triggering protocols. From these data, each agent estimates the complete dynamic state of the system and uses its estimate for… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

    Comments: This is an extended version of an article to appear in the IEEE Transactions on Automatic Control (additional parts in the Appendix)

  32. arXiv:1608.08823  [pdf, other

    math.OC eess.SY

    Approximation of Continuous-Time Infinite-Horizon Optimal Control Problems Arising in Model Predictive Control - Supplementary Notes

    Authors: Michael Muehlebach, Raffaello D'Andrea

    Abstract: These notes present preliminary results regarding two different approximations of linear infinite-horizon optimal control problems arising in model predictive control. Input and state trajectories are parametrized with basis functions and a finite dimensional representation of the dynamics is obtained via a Galerkin approach. It is shown that the two approximations provide lower, respectively uppe… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.

    Comments: Supplementary notes, 10 pages