(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–7 of 7 results for author: Krasowski, H

.
  1. arXiv:2406.03704  [pdf, other

    cs.LG eess.SY

    Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking

    Authors: Roland Stolz, Hanna Krasowski, Jakob Thumm, Michael Eichelbeck, Philipp Gassert, Matthias Althoff

    Abstract: Continuous action spaces in reinforcement learning (RL) are commonly defined as interval sets. While intervals usually reflect the action boundaries for tasks well, they can be challenging for learning because the typically large global action space leads to frequent exploration of irrelevant actions. Yet, little task knowledge can be sufficient to identify significantly smaller state-specific set… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea

    Authors: Hanna Krasowski, Matthias Althoff

    Abstract: For safe operation, autonomous vehicles have to obey traffic rules that are set forth in legal documents formulated in natural language. Temporal logic is a suitable concept to formalize such traffic rules. Still, temporal logic rules often result in constraints that are hard to solve using optimization-based motion planners. Reinforcement learning (RL) is a promising method to find motion plans f… ▽ More

    Submitted 16 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2307.01917  [pdf, other

    eess.SY cs.AI cs.RO

    Stranding Risk for Underactuated Vessels in Complex Ocean Currents: Analysis and Controllers

    Authors: Andreas Doering, Marius Wiggert, Hanna Krasowski, Manan Doshi, Pierre F. J. Lermusiaux, Claire J. Tomlin

    Abstract: Low-propulsion vessels can take advantage of powerful ocean currents to navigate towards a destination. Recent results demonstrated that vessels can reach their destination with high probability despite forecast errors. However, these results do not consider the critical aspect of safety of such vessels: because of their low propulsion which is much smaller than the magnitude of currents, they mig… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 6 pages, 3 figures, submitted to 2023 IEEE 62th Annual Conference on Decision and Control (CDC) Andreas Doering and Marius Wiggert contributed equally to this work

  4. arXiv:2307.01916  [pdf, other

    eess.SY cs.AI cs.RO

    Maximizing Seaweed Growth on Autonomous Farms: A Dynamic Programming Approach for Underactuated Systems Navigating on Uncertain Ocean Currents

    Authors: Matthias Killer, Marius Wiggert, Hanna Krasowski, Manan Doshi, Pierre F. J. Lermusiaux, Claire J. Tomlin

    Abstract: Seaweed biomass offers significant potential for climate mitigation, but large-scale, autonomous open-ocean farms are required to fully exploit it. Such farms typically have low propulsion and are heavily influenced by ocean currents. We want to design a controller that maximizes seaweed growth over months by taking advantage of the non-linear time-varying ocean currents for reaching high-growth r… ▽ More

    Submitted 29 August, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 8 pages, submitted to 2023 IEEE 62th Annual Conference on Decision and Control (CDC) Matthias Killer and Marius Wiggert contributed equally to this work

  5. arXiv:2212.06129  [pdf, other

    cs.RO eess.SY

    Safe Reinforcement Learning with Probabilistic Guarantees Satisfying Temporal Logic Specifications in Continuous Action Spaces

    Authors: Hanna Krasowski, Prithvi Akella, Aaron D. Ames, Matthias Althoff

    Abstract: Vanilla Reinforcement Learning (RL) can efficiently solve complex tasks but does not provide any guarantees on system behavior. To bridge this gap, we propose a three-step safe RL procedure for continuous action spaces that provides probabilistic guarantees with respect to temporal logic specifications. First, our approach probabilistically verifies a candidate controller with respect to a tempora… ▽ More

    Submitted 28 September, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

  6. arXiv:2210.10691  [pdf, ps, other

    cs.RO cs.LG eess.SY

    Provably Safe Reinforcement Learning via Action Projection using Reachability Analysis and Polynomial Zonotopes

    Authors: Niklas Kochdumper, Hanna Krasowski, Xiao Wang, Stanley Bak, Matthias Althoff

    Abstract: While reinforcement learning produces very promising results for many applications, its main disadvantage is the lack of safety guarantees, which prevents its use in safety-critical systems. In this work, we address this issue by a safety shield for nonlinear continuous systems that solve reach-avoid tasks. Our safety shield prevents applying potentially unsafe actions from a reinforcement learnin… ▽ More

    Submitted 14 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  7. arXiv:2205.06750  [pdf, other

    cs.LG

    Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking

    Authors: Hanna Krasowski, Jakob Thumm, Marlon Müller, Lukas Schäfer, Xiao Wang, Matthias Althoff

    Abstract: Ensuring the safety of reinforcement learning (RL) algorithms is crucial to unlock their potential for many real-world tasks. However, vanilla RL and most safe RL approaches do not guarantee safety. In recent years, several methods have been proposed to provide hard safety guarantees for RL, which is essential for applications where unsafe actions could have disastrous consequences. Nevertheless,… ▽ More

    Submitted 18 November, 2023; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: The published paper is available at https://openreview.net/forum?id=mcN0ezbnzO

    Journal ref: Transactions on Machine Learning Research, 2023