(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 69 results for author: Schmidt, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02310  [pdf, other

    cs.CL

    Evaluating the Ability of LLMs to Solve Semantics-Aware Process Mining Tasks

    Authors: Adrian Rebmann, Fabian David Schmidt, Goran Glavaš, Han van der Aa

    Abstract: The process mining community has recently recognized the potential of large language models (LLMs) for tackling various process mining tasks. Initial studies report the capability of LLMs to support process analysis and even, to some extent, that they are able to reason about how processes work. This latter property suggests that LLMs could also be used to tackle process mining tasks that benefit… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Submitted to ICPM

  2. arXiv:2407.00492  [pdf, ps, other

    cs.LG stat.CO stat.ML

    Fast Gibbs sampling for the local and global trend Bayesian exponential smoothing model

    Authors: Xueying Long, Daniel F. Schmidt, Christoph Bergmeir, Slawek Smyl

    Abstract: In Smyl et al. [Local and global trend Bayesian exponential smoothing models. International Journal of Forecasting, 2024.], a generalised exponential smoothing model was proposed that is able to capture strong trends and volatility in time series. This method achieved state-of-the-art performance in many forecasting tasks, but its fitting procedure, which is based on the NUTS sampler, is very comp… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2406.12739  [pdf, other

    cs.CL

    Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages

    Authors: Fabian David Schmidt, Philipp Borchert, Ivan Vulić, Goran Glavaš

    Abstract: LLMs have become a go-to solution not just for text generation, but also for natural language understanding (NLU) tasks. Acquiring extensive knowledge through language modeling on web-scale corpora, they excel on English NLU, yet struggle to extend their NLU capabilities to underrepresented languages. In contrast, machine translation models (MT) produce excellent multilingual representations, resu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2406.12634  [pdf, other

    cs.IR cs.AI

    News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation

    Authors: Andreea Iana, Fabian David Schmidt, Goran Glavaš, Heiko Paulheim

    Abstract: Rapidly growing numbers of multilingual news consumers pose an increasing challenge to news recommender systems in terms of providing customized recommendations. First, existing neural news recommenders, even when powered by multilingual language models (LMs), suffer substantial performance losses in zero-shot cross-lingual transfer (ZS-XLT). Second, the current paradigm of fine-tuning the backbon… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; H.3.3

  5. arXiv:2404.19319  [pdf, other

    cs.CL

    Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget

    Authors: Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, Katharina von der Wense

    Abstract: Compared to standard language model (LM) pretraining (i.e., from scratch), Knowledge Distillation (KD) entails an additional forward pass through a teacher model that is typically substantially larger than the target student model. As such, KD in LM pretraining materially slows down throughput of pretraining instances vis-a-vis pretraining from scratch. Scaling laws of LM pretraining suggest that… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024

  6. arXiv:2401.15610  [pdf, other

    cs.LG stat.ML

    Prevalidated ridge regression is a highly-efficient drop-in replacement for logistic regression for high-dimensional data

    Authors: Angus Dempster, Geoffrey I. Webb, Daniel F. Schmidt

    Abstract: Logistic regression is a ubiquitous method for probabilistic classification. However, the effectiveness of logistic regression depends upon careful and relatively computationally expensive tuning, especially for the regularisation hyperparameter, and especially in the context of high-dimensional data. We present a prevalidated ridge regression model that closely matches logistic regression in term… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 13 pages, 11 figures

  7. arXiv:2312.12776  [pdf, other

    cs.CE

    Computational homogenization of phase-field fracture

    Authors: Felix Schmidt, Stefan Schuß, Christian Hesch

    Abstract: In this contribution we investigate the application of phase-field fracture models on non-linear multiscale computational homogenization schemes. In particular, we introduce different phase-fields on a two-scale problem and develop a thermodynamically consistent model. This allows on the one hand for the prediction of local micro-fracture patterns, which effectively acts as an anisotropic damage m… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  8. arXiv:2312.12771  [pdf, other

    cs.CE

    Variational formulation and monolithic solution of computational homogenization methods

    Authors: Christian Hesch, Felix Schmidt, Stefan Schuß

    Abstract: In this contribution, we derive a consistent variational formulation for computational homogenization methods and show that traditional FE2 and IGA2 approaches are special discretization and solution techniques of this most general framework. This allows us to enhance dramatically the numerical analysis as well as the solution of the arising algebraic system. In particular, we expand the dimension… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  9. arXiv:2311.15549  [pdf

    cond-mat.mtrl-sci cs.AI cs.LG

    From Prediction to Action: Critical Role of Performance Estimation for Machine-Learning-Driven Materials Discovery

    Authors: Mario Boley, Felix Luong, Simon Teshuva, Daniel F Schmidt, Lucas Foppa, Matthias Scheffler

    Abstract: Materials discovery driven by statistical property models is an iterative decision process, during which an initial data collection is extended with new data proposed by a model-informed acquisition function--with the goal to maximize a certain "reward" over time, such as the maximum property value discovered so far. While the materials science community achieved much progress in developing proper… ▽ More

    Submitted 6 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Simplified notation

  10. arXiv:2311.00993  [pdf, other

    cs.LG

    Scalable Probabilistic Forecasting in Retail with Gradient Boosted Trees: A Practitioner's Approach

    Authors: Xueying Long, Quang Bui, Grady Oktavian, Daniel F. Schmidt, Christoph Bergmeir, Rakshitha Godahewa, Seong Per Lee, Kaifeng Zhao, Paul Condylis

    Abstract: The recent M5 competition has advanced the state-of-the-art in retail forecasting. However, we notice important differences between the competition challenge and the challenges we face in a large e-commerce company. The datasets in our scenario are larger (hundreds of thousands of time series), and e-commerce can afford to have a larger assortment than brick-and-mortar retailers, leading to more i… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  11. arXiv:2310.18860  [pdf, other

    stat.ML cs.LG

    Bayes beats Cross Validation: Efficient and Accurate Ridge Regression via Expectation Maximization

    Authors: Shu Yu Tew, Mario Boley, Daniel F. Schmidt

    Abstract: We present a novel method for tuning the regularization hyper-parameter, $λらむだ$, of a ridge regression that is faster to compute than leave-one-out cross-validation (LOOCV) while yielding estimates of the regression parameters of equal, or particularly in the setting of sparse covariates, superior quality to those obtained by minimising the LOOCV risk. The LOOCV risk can suffer from multiple and bad… ▽ More

    Submitted 2 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  12. arXiv:2310.10532  [pdf, other

    cs.CL

    One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer

    Authors: Fabian David Schmidt, Ivan Vulić, Goran Glavaš

    Abstract: Multilingual language models enable zero-shot cross-lingual transfer (ZS-XLT): fine-tuned on sizable source-language task data, they perform the task in target languages without labeled instances. The effectiveness of ZS-XLT hinges on the linguistic proximity between languages and the amount of pretraining data for a language. Because of this, model selection based on source-language validation is… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to findings of EMNLP 2023

  13. arXiv:2310.09129  [pdf, other

    cs.LG stat.ML

    Computing Marginal and Conditional Divergences between Decomposable Models with Applications

    Authors: Loong Kuan Lee, Geoffrey I. Webb, Daniel F. Schmidt, Nico Piatkowski

    Abstract: The ability to compute the exact divergence between two high-dimensional distributions is useful in many applications but doing so naively is intractable. Computing the alpha-beta divergence -- a family of divergences that includes the Kullback-Leibler divergence and Hellinger distance -- between the joint distribution of two decomposable models, i.e chordal Markov networks, can be done in time ex… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 10 pages, 8 figures, Accepted at the IEEE International Conference on Data Mining (ICDM) 2023

  14. arXiv:2308.03375  [pdf, other

    cs.CV

    VR-based body tracking to stimulate musculoskeletal training

    Authors: M. Neidhardt, S. Gerlach F. N. Schmidt, I. A. K. Fiedler, S. Grube, B. Busse, A. Schlaefer

    Abstract: Training helps to maintain and improve sufficient muscle function, body control, and body coordination. These are important to reduce the risk of fracture incidents caused by falls, especially for the elderly or people recovering from injury. Virtual reality training can offer a cost-effective and individualized training experience. We present an application for the HoloLens 2 to enable musculoske… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Conference

  15. arXiv:2308.00928  [pdf, other

    cs.LG

    QUANT: A Minimalist Interval Method for Time Series Classification

    Authors: Angus Dempster, Daniel F. Schmidt, Geoffrey I. Webb

    Abstract: We show that it is possible to achieve the same accuracy, on average, as the most accurate existing interval methods for time series classification on a standard set of benchmark datasets using a single type of feature (quantiles), fixed intervals, and an 'off the shelf' classifier. This distillation of interval-based approaches represents a fast and accurate method for time series classification,… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 26 pages, 20 figures

  16. arXiv:2307.13078  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Certified Training: Towards Better Accuracy-Robustness Tradeoffs

    Authors: Zhakshylyk Nurlanov, Frank R. Schmidt, Florian Bernard

    Abstract: As deep learning models continue to advance and are increasingly utilized in real-world systems, the issue of robustness remains a major challenge. Existing certified training methods produce models that achieve high provable robustness guarantees at certain perturbation levels. However, the main problem of such models is a dramatically low standard accuracy, i.e. accuracy on clean unperturbed dat… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Presented at ICML 2023 workshop "New Frontiers in Adversarial Machine Learning"

  17. arXiv:2305.16834  [pdf, other

    cs.CL

    Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging

    Authors: Fabian David Schmidt, Ivan Vulić, Goran Glavaš

    Abstract: Massively multilingual language models have displayed strong performance in zero-shot (ZS-XLT) and few-shot (FS-XLT) cross-lingual transfer setups, where models fine-tuned on task data in a source language are transferred without any or with only a few annotated instances to the target language(s). However, current work typically overestimates model performance as fine-tuned models are frequently… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted To Appear In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics

  18. arXiv:2305.11921  [pdf, other

    stat.ME cs.AI cs.LG cs.PF

    An Approach to Multiple Comparison Benchmark Evaluations that is Stable Under Manipulation of the Comparate Set

    Authors: Ali Ismail-Fawaz, Angus Dempster, Chang Wei Tan, Matthieu Herrmann, Lynn Miller, Daniel F. Schmidt, Stefano Berretti, Jonathan Weber, Maxime Devanne, Germain Forestier, Geoffrey I. Webb

    Abstract: The measurement of progress using benchmarks evaluations is ubiquitous in computer science and machine learning. However, common approaches to analyzing and presenting the results of benchmark comparisons of multiple algorithms over multiple datasets, such as the critical difference diagram introduced by Demšar (2006), have important shortcomings and, we show, are open to both inadvertent and inte… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  19. arXiv:2303.11734  [pdf, other

    cs.LG cs.AI

    Unlocking Layer-wise Relevance Propagation for Autoencoders

    Authors: Kenyu Kobayashi, Renata Khasanova, Arno Schneuwly, Felix Schmidt, Matteo Casserini

    Abstract: Autoencoders are a powerful and versatile tool often used for various problems such as anomaly detection, image processing and machine translation. However, their reconstructions are not always trivial to explain. Therefore, we propose a fast explainability solution by extending the Layer-wise Relevance Propagation method with the help of Deep Taylor Decomposition framework. Furthermore, we introd… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  20. arXiv:2302.02641  [pdf, other

    astro-ph.EP cs.CV physics.ao-ph physics.ins-det

    Approximation of radiative transfer for surface spectral features

    Authors: Frédéric Schmidt

    Abstract: Remote sensing hyperspectral and more generally spectral instruments are common tools to decipher surface features in Earth and Planetary science. While linear mixture is the most common approximation for compounds detection (mineral, water, ice, etc...), the transfer of light in surface and atmospheric medium are highly non-linear. The exact simulation of non-linearities can be estimated at very… ▽ More

    Submitted 13 April, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 4 pages, 3 figures, submitted 21st october 2022 to IEEE Geoscience and Remote Sensing Letters

    Journal ref: IEEE Geoscience and Remote Sensing Letters, 2023, 20, 1-3

  21. arXiv:2212.00780  [pdf

    cs.CV cs.AI

    Universe Points Representation Learning for Partial Multi-Graph Matching

    Authors: Zhakshylyk Nurlanov, Frank R. Schmidt, Florian Bernard

    Abstract: Many challenges from natural world can be formulated as a graph matching problem. Previous deep learning-based methods mainly consider a full two-graph matching setting. In this work, we study the more general partial matching problem with multi-graph cycle consistency guarantees. Building on a recent progress in deep learning on graphs, we propose a novel data-driven method (URL) for partial mult… ▽ More

    Submitted 7 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: To appear in AAAI 2023

  22. arXiv:2211.03248  [pdf, other

    stat.ML cs.LG

    Sparse Horseshoe Estimation via Expectation-Maximisation

    Authors: Shu Yu Tew, Daniel F. Schmidt, Enes Makalic

    Abstract: The horseshoe prior is known to possess many desirable properties for Bayesian estimation of sparse parameter vectors, yet its density function lacks an analytic form. As such, it is challenging to find a closed-form solution for the posterior mode. Conventional horseshoe estimators use the posterior mean to estimate the parameters, but these estimates are not sparse. We propose a novel expectatio… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  23. arXiv:2205.04164  [pdf, other

    cs.RO

    Robotic Maintenance of Road Infrastructures: The HERON Project

    Authors: Iason Katsamenis, Matthaios Bimpas, Eftychios Protopapadakis, Charalampos Zafeiropoulos, Dimitris Kalogeras, Anastasios Doulamis, Nikolaos Doulamis, Carlos Martín-Portugués Montoliu, Yannis Handanos, Franziska Schmidt, Lionel Ott, Miquel Cantero, Rafael Lopez

    Abstract: Of all public assets, road infrastructure tops the list. Roads are crucial for economic development and growth, providing access to education, health, and employment. The maintenance, repair, and upgrade of roads are therefore vital to road users' health and safety as well as to a well-functioning and prosperous modern economy. The EU-funded HERON project will develop an integrated automated syste… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 13 pages, 6 figures, 1 table

  24. arXiv:2203.13652  [pdf, other

    cs.LG

    HYDRA: Competing convolutional kernels for fast and accurate time series classification

    Authors: Angus Dempster, Daniel F. Schmidt, Geoffrey I. Webb

    Abstract: We demonstrate a simple connection between dictionary methods for time series classification, which involve extracting and counting symbolic patterns in time series, and methods based on transforming input time series using convolutional kernels, namely ROCKET and its variants. We show that by adjusting a single hyperparameter it is possible to move by degrees between models resembling dictionary… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 27 pages, 18 figures

  25. arXiv:2203.05362  [pdf, other

    cs.DC

    Efficient Runtime Profiling for Black-box Machine Learning Services on Sensor Streams

    Authors: Soeren Becker, Dominik Scheinert, Florian Schmidt, Odej Kao

    Abstract: In highly distributed environments such as cloud, edge and fog computing, the application of machine learning for automating and optimizing processes is on the rise. Machine learning jobs are frequently applied in streaming conditions, where models are used to analyze data streams originating from e.g. video streams or sensory data. Often the results for particular data samples need to be provided… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Accepted as a short paper at the 6th IEEE International Conference on Fog and Edge Computing 2022

  26. Towards a trustworthy, secure and reliable enclave for machine learning in a hospital setting: The Essen Medical Computing Platform (EMCP)

    Authors: Hendrik F. R. Schmidt, Jörg Schlötterer, Marcel Bargull, Enrico Nasca, Ryan Aydelott, Christin Seifert, Folker Meyer

    Abstract: AI/Computing at scale is a difficult problem, especially in a health care setting. We outline the requirements, planning and implementation choices as well as the guiding principles that led to the implementation of our secure research computing enclave, the Essen Medical Computing Platform (EMCP), affiliated with a major German hospital. Compliance, data privacy and usability were the immutable r… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: 9 pages, 5 figures, to be published in the proceedings of the 2021 IEEE CogMI conference. Christin Seifert and Folker Meyer are co-senior authors

  27. Computational homogenization of higher-order continua

    Authors: Felix Schmidt, Melanie Krüger, Marc-Andre Keip, Christian Hesch

    Abstract: We introduce a novel computational framework for the multiscale simulation of higher-order continua that allows for the consideration of first-, second- and third- order effects at both micro- and macro-level. In line with classical two-scale approaches, we describe the microstructure via representative volume elements (RVE) that are attached at each integration point of the macroscopic problem. T… ▽ More

    Submitted 7 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Journal ref: Int J Numer Methods Eng. 2022;1-31

  28. arXiv:2109.13009  [pdf, other

    cs.DC

    LOS: Local-Optimistic Scheduling of Periodic Model Training For Anomaly Detection on Sensor Data Streams in Meshed Edge Networks

    Authors: Soeren Becker, Florian Schmidt, Lauritz Thamsen, Ana Juan Ferrer, Odej Kao

    Abstract: Anomaly detection is increasingly important to handle the amount of sensor data in Edge and Fog environments, Smart Cities, as well as in Industry 4.0. To ensure good results, the utilized ML models need to be updated periodically to adapt to seasonal changes and concept drifts in the sensor data. Although the increasing resource availability at the edge can allow for in-situ execution of model tr… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 2nd IEEE International Conference on Autonomic Computing and Self-Organizing Systems - ACSOS 2021

  29. EdgePier: P2P-based Container Image Distribution in Edge Computing Environments

    Authors: Soeren Becker, Florian Schmidt, Odej Kao

    Abstract: Edge and fog computing architectures utilize container technologies in order to offer a lightweight application deployment. Container images are stored in registry services and operated by orchestration platforms to download and start the respective applications on nodes of the infrastructure. During large application rollouts, the connection to the registry is prone to become a bottleneck, which… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 40th IEEE International Performance Computing and Communications Conference 2021

  30. Multi-Objective Reconstruction Of Software Architecture

    Authors: Frederick Schmidt, Stephen MacDonell, Andy M. Connor

    Abstract: Design erosion is a persistent problem within the software engineering discipline. Software designs tend to deteriorate over time and there is a need for tools and techniques that support software architects when dealing with legacy systems. This paper presents an evaluation of a Search Based Software Engineering (SBSE) approach intended to recover high-level architecture designs of software syste… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Journal paper, 12 pages, 5 figures, 3 tables

    Journal ref: International Journal of Software Engineering and Knowledge Engineering 28(6)(2018), pp.869-892

  31. arXiv:2103.06026  [pdf, other

    cs.NI

    Towards a Cognitive Compute Continuum: An Architecture for Ad-Hoc Self-Managed Swarms

    Authors: Ana Juan Ferrer, Soeren Becker, Florian Schmidt, Lauritz Thamsen, Odej Kao

    Abstract: In this paper we introduce our vision of a Cognitive Computing Continuum to address the changing IT service provisioning towards a distributed, opportunistic, self-managed collaboration between heterogeneous devices outside the traditional data center boundaries. The focal point of this continuum are cognitive devices, which have to make decisions autonomously using their on-board computation and… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: 8 pages, CCGrid 2021 Cloud2Things Workshop

  32. Towards AIOps in Edge Computing Environments

    Authors: Soeren Becker, Florian Schmidt, Anton Gulenko, Alexander Acker, Odej Kao

    Abstract: Edge computing was introduced as a technical enabler for the demanding requirements of new network technologies like 5G. It aims to overcome challenges related to centralized cloud computing environments by distributing computational resources to the edge of the network towards the customers. The complexity of the emerging infrastructures increases significantly, together with the ramifications of… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  33. arXiv:2101.10037  [pdf, other

    cs.LG stat.ML

    Optimizing Convergence for Iterative Learning of ARIMA for Stationary Time Series

    Authors: Kevin Styp-Rekowski, Florian Schmidt, Odej Kao

    Abstract: Forecasting of time series in continuous systems becomes an increasingly relevant task due to recent developments in IoT and 5G. The popular forecasting model ARIMA is applied to a large variety of applications for decades. An online variant of ARIMA applies the Online Newton Step in order to learn the underlying process of the time series. This optimization method has pitfalls concerning the comp… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  34. arXiv:2101.06054  [pdf, other

    cs.LG cs.SE

    Artificial Intelligence for IT Operations (AIOPS) Workshop White Paper

    Authors: Jasmin Bogatinovski, Sasho Nedelkoski, Alexander Acker, Florian Schmidt, Thorsten Wittkopp, Soeren Becker, Jorge Cardoso, Odej Kao

    Abstract: Artificial Intelligence for IT Operations (AIOps) is an emerging interdisciplinary field arising in the intersection between the research areas of machine learning, big data, streaming analytics, and the management of IT operations. AIOps, as a field, is a candidate to produce the future standard for IT operation management. To that end, AIOps has several challenges. First, it needs to combine sep… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

    Comments: 8 pages, white paper for the AIOPS 2020 workshop at ICSOC 2020

  35. Multidimensional coupling: A variationally consistent approach to fiber-reinforced materials

    Authors: Ustim Khristenko, Stefan Schuß, Melanie Krüger, Felix Schmidt, Barbara Wohlmuth, Christian Hesch

    Abstract: A novel mathematical model for fiber-reinforced materials is proposed. It is based on a 1-dimensional beam model for the thin fiber structures, a flexible and general 3-dimensional elasticity model for the matrix and an overlapping domain decomposition approach. From a computational point of view, this is motivated by the fact that matrix and fibers can easily meshed independently. Our main intere… ▽ More

    Submitted 10 April, 2021; v1 submitted 7 January, 2021; originally announced January 2021.

  36. arXiv:2012.09478  [pdf, other

    cs.SD cs.CL eess.AS

    The voice of COVID-19: Acoustic correlates of infection

    Authors: Katrin D. Bartl-Pokorny, Florian B. Pokorny, Anton Batliner, Shahin Amiriparian, Anastasia Semertzidou, Florian Eyben, Elena Kramer, Florian Schmidt, Rainer Schönweiler, Markus Wehler, Björn W. Schuller

    Abstract: COVID-19 is a global health crisis that has been affecting many aspects of our daily lives throughout the past year. The symptomatology of COVID-19 is heterogeneous with a severity continuum. A considerable proportion of symptoms are related to pathological changes in the vocal system, leading to the assumption that COVID-19 may also affect voice production. For the very first time, the present st… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: 8 pages

    MSC Class: 68T01 ACM Class: J.3

  37. MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

    Authors: Angus Dempster, Daniel F. Schmidt, Geoffrey I. Webb

    Abstract: Until recently, the most accurate methods for time series classification were limited by high computational complexity. ROCKET achieves state-of-the-art accuracy with a fraction of the computational expense of most existing methods by transforming input time series using random convolutional kernels, and using the transformed features to train a linear classifier. We reformulate ROCKET into a new… ▽ More

    Submitted 14 July, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 10 pages, 11 figures; Updated to accepted version

  38. arXiv:2012.08175  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Machine Learning for automatic identification of new minor species

    Authors: Frederic Schmidt, Guillaume Cruz Mermy, Justin Erwin, Severine Robert, Lori Neary, Ian R. Thomas, Frank Daerden, Bojan Ristic, Manish R. Patel, Giancarlo Bellucci, Jose-Juan Lopez-Moreno, Ann-Carine Vandaele

    Abstract: One of the main difficulties to analyze modern spectroscopic datasets is due to the large amount of data. For example, in atmospheric transmittance spectroscopy, the solar occultation channel (SO) of the NOMAD instrument onboard the ESA ExoMars2016 satellite called Trace Gas Orbiter (TGO) had produced $\sim$10 millions of spectra in 20000 acquisition sequences since the beginning of the mission in… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: 26 pages, 10 figures

    Journal ref: Quantitative Spectroscopy and Radiative Transfer, 2021, 259, 107361

  39. 3D Bird Reconstruction: a Dataset, Model, and Shape Recovery from a Single View

    Authors: Marc Badger, Yufu Wang, Adarsh Modh, Ammon Perkes, Nikos Kolotouros, Bernd G. Pfrommer, Marc F. Schmidt, Kostas Daniilidis

    Abstract: Automated capture of animal pose is transforming how we study neuroscience and social behavior. Movements carry important social cues, but current methods are not able to robustly estimate pose and shape of animals, particularly for social animals such as birds, which are often occluded by each other and objects in the environment. To address this problem, we first introduce a model and multi-view… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: In ECCV 2020

    ACM Class: I.4.8

    Journal ref: ECCV 2020, vol 12363, pp 1-17

  40. A strain-gradient formulation for fiber reinforced polymers: Hybrid phase-field model for porous-ductile fracture

    Authors: Maik Dittman, Jonathan Schult, Felix Schmidt, Christian Hesch

    Abstract: A novel numerical approach to analyze the mechanical behavior within composite materials including the inelastic regime up to final failure is presented. Therefore, a second-gradient theory is combined with phase-field methods to fracture. In particular, we assume that the polymeric matrix material undergoes ductile fracture, whereas continuously embedded fibers undergo brittle fracture as it is t… ▽ More

    Submitted 19 April, 2021; v1 submitted 21 July, 2020; originally announced July 2020.

  41. arXiv:2007.00147  [pdf, other

    cs.LG stat.ML

    Neural Network Virtual Sensors for Fuel Injection Quantities with Provable Performance Specifications

    Authors: Eric Wong, Tim Schneider, Joerg Schmitt, Frank R. Schmidt, J. Zico Kolter

    Abstract: Recent work has shown that it is possible to learn neural networks with provable guarantees on the output of the model when subject to input perturbations, however these works have focused primarily on defending against adversarial examples for image classifiers. In this paper, we study how these provable guarantees can be naturally applied to other real world settings, namely getting performance… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

  42. arXiv:2006.08368  [pdf

    cs.CY eess.SP

    Sensor Artificial Intelligence and its Application to Space Systems -- A White Paper

    Authors: Anko Börner, Heinz-Wilhelm Hübers, Odej Kao, Florian Schmidt, Sören Becker, Joachim Denzler, Daniel Matolin, David Haber, Sergio Lucia, Wojciech Samek, Rudolph Triebel, Sascha Eichstädt, Felix Biessmann, Anna Kruspe, Peter Jung, Manon Kok, Guillermo Gallego, Ralf Berger

    Abstract: Information and communication technologies have accompanied our everyday life for years. A steadily increasing number of computers, cameras, mobile devices, etc. generate more and more data, but at the same time we realize that the data can only partially be analyzed with classical approaches. The research and development of methods based on artificial intelligence (AI) made enormous progress in t… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: 4 pages. 1st Workshop on Sensor Artificial Intelligence, Apr. 2020, Berlin, Germany

  43. arXiv:2003.02738  [pdf, other

    cs.LG cs.CL stat.ML

    BERT as a Teacher: Contextual Embeddings for Sequence-Level Reward

    Authors: Florian Schmidt, Thomas Hofmann

    Abstract: Measuring the quality of a generated sequence against a set of references is a central problem in many learning frameworks, be it to compute a score, to assign a reward, or to perform discrimination. Despite great advances in model architectures, metrics that scale independently of the number of references are still based on n-gram estimates. We show that the underlying operations, counting words… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

  44. arXiv:1911.11490  [pdf, ps, other

    cs.IT cs.NI

    Outage Duration in Poisson Networks and its Application to Erasure Codes

    Authors: Udo Schilcher, Siddhartha Borkotoky, Jorge F. Schmidt, Christian Bettstetter

    Abstract: We derive the probability distribution of the link outage duration at a typical receiver in a wireless network with Poisson distributed interferers sending messages with slotted random access over a Rayleigh fading channel. This result is used to analyze the performance of random linear network coding, showing that there is an optimum code rate and that interference correlation affects the decodin… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

  45. arXiv:1910.00292  [pdf, other

    cs.LG cs.CL stat.ML

    Generalization in Generation: A closer look at Exposure Bias

    Authors: Florian Schmidt

    Abstract: Exposure bias refers to the train-test discrepancy that seemingly arises when an autoregressive generative model uses only ground-truth contexts at training time but generated ones at test time. We separate the contributions of the model and the learning framework to clarify the debate on consequences and review proposed counter-measures. In this light, we argue that generalization is the underlyi… ▽ More

    Submitted 7 November, 2019; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: wngt2019 camera ready

  46. InceptionTime: Finding AlexNet for Time Series Classification

    Authors: Hassan Ismail Fawaz, Benjamin Lucas, Germain Forestier, Charlotte Pelletier, Daniel F. Schmidt, Jonathan Weber, Geoffrey I. Webb, Lhassane Idoumghar, Pierre-Alain Muller, François Petitjean

    Abstract: This paper brings deep learning at the forefront of research into Time Series Classification (TSC). TSC is the area of machine learning tasked with the categorization (or labelling) of time series. The last few decades of work in this area have led to significant progress in the accuracy of classifiers, with the state of the art now represented by the HIVE-COTE algorithm. While extremely accurate,… ▽ More

    Submitted 5 December, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  47. arXiv:1908.11658  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Autoregressive Text Generation Beyond Feedback Loops

    Authors: Florian Schmidt, Stephan Mandt, Thomas Hofmann

    Abstract: Autoregressive state transitions, where predictions are conditioned on past predictions, are the predominant choice for both deterministic and stochastic sequential models. However, autoregressive feedback exposes the evolution of the hidden state trajectory to potential biases from well-known train-test discrepancies. In this paper, we combine a latent state space model with a CRF observation mod… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

    Comments: emnlp camera ready

  48. arXiv:1904.00759  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Adversarial camera stickers: A physical camera-based attack on deep learning systems

    Authors: Juncheng Li, Frank R. Schmidt, J. Zico Kolter

    Abstract: Recent work has documented the susceptibility of deep learning systems to adversarial examples, but most such attacks directly manipulate the digital input to a classifier. Although a smaller line of work considers physical adversarial attacks, in all cases these involve manipulating the object of interest, e.g., putting a physical sticker on an object to misclassify it, or manufacturing an object… ▽ More

    Submitted 8 June, 2019; v1 submitted 21 March, 2019; originally announced April 2019.

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97:3896-3904, 2019

  49. arXiv:1904.00671  [pdf, other

    cs.NI

    Application-Agnostic Offloading of Packet Processing

    Authors: Oliver Hohlfeld, Helge Reelfs, Jan Rüth, Florian Schmidt, Torsten Zimmermann, Jens Hiller, Klaus Wehrle

    Abstract: As network speed increases, servers struggle to serve all requests directed at them. This challenge is rooted in a partitioned data path where the split between the kernel space networking stack and user space applications induces overheads. To address this challenge, we propose Santa, a new architecture to optimize the data path by enabling server applications to partially offload packet processi… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: Technical Report, RWTH Aachen University, Chair of Communication and Distributed Systems

  50. arXiv:1903.10899  [pdf, ps, other

    cs.LG cs.NI eess.SP stat.ML

    Interference Prediction in Wireless Networks: Stochastic Geometry meets Recursive Filtering

    Authors: Jorge F. Schmidt, Udo Schilcher, Mahin K. Atiq, Christian Bettstetter

    Abstract: This article proposes and evaluates a technique to predict the level of interference in wireless networks. We design a recursive predictor that estimates future interference values by filtering measured interference at a given location. The predictor's parameterization is done offline by translating the autocorrelation of interference into an autoregressive moving average (ARMA) representation. Th… ▽ More

    Submitted 10 February, 2021; v1 submitted 26 March, 2019; originally announced March 2019.