(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–20 of 20 results for author: Löser, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.14490  [pdf, other

    cs.CL

    LongHealth: A Question Answering Benchmark with Long Clinical Documents

    Authors: Lisa Adams, Felix Busch, Tianyu Han, Jean-Baptiste Excoffier, Matthieu Ortala, Alexander Löser, Hugo JWL. Aerts, Jakob Nikolas Kather, Daniel Truhn, Keno Bressem

    Abstract: Background: Recent advancements in large language models (LLMs) offer potential benefits in healthcare, particularly in processing extensive patient records. However, existing benchmarks do not fully assess LLMs' capability in handling real-world, lengthy clinical data. Methods: We present the LongHealth benchmark, comprising 20 detailed fictional patient cases across various diseases, with each… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 11 pages, 3 figures, 5 tables

  2. arXiv:2304.08247  [pdf, ps, other

    cs.CL cs.AI

    MedAlpaca -- An Open-Source Collection of Medical Conversational AI Models and Training Data

    Authors: Tianyu Han, Lisa C. Adams, Jens-Michalis Papaioannou, Paul Grundmann, Tom Oberhauser, Alexander Löser, Daniel Truhn, Keno K. Bressem

    Abstract: As large language models (LLMs) like OpenAI's GPT series continue to make strides, we witness the emergence of artificial intelligence applications in an ever-expanding range of fields. In medicine, these LLMs hold considerable promise for improving medical workflows, diagnostics, patient care, and education. Yet, there is an urgent need for open-source models that can be deployed on-premises to s… ▽ More

    Submitted 4 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

  3. MEDBERT.de: A Comprehensive German BERT Model for the Medical Domain

    Authors: Keno K. Bressem, Jens-Michalis Papaioannou, Paul Grundmann, Florian Borchert, Lisa C. Adams, Leonhard Liu, Felix Busch, Lina Xu, Jan P. Loyen, Stefan M. Niehues, Moritz Augustin, Lennart Grosser, Marcus R. Makowski, Hugo JWL. Aerts, Alexander Löser

    Abstract: This paper presents medBERTde, a pre-trained German BERT model specifically designed for the German medical domain. The model has been trained on a large corpus of 4.7 Million German medical documents and has been shown to achieve new state-of-the-art performance on eight different medical benchmarks covering a wide range of disciplines and medical document types. In addition to evaluating the ove… ▽ More

    Submitted 24 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Keno K. Bressem and Jens-Michalis Papaioannou and Paul Grundmann contributed equally

    Journal ref: Expert Systems with Applications 2024;237(21):121598

  4. arXiv:2210.08500  [pdf, other

    cs.CL

    This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Clinical Text

    Authors: Betty van Aken, Jens-Michalis Papaioannou, Marcel G. Naik, Georgios Eleftheriadis, Wolfgang Nejdl, Felix A. Gers, Alexander Löser

    Abstract: The use of deep neural models for diagnosis prediction from clinical text has shown promising results. However, in clinical practice such models must not only be accurate, but provide doctors with interpretable and helpful results. We introduce ProtoPatient, a novel method based on prototypical networks and label-wise attention with both of these abilities. ProtoPatient makes predictions based on… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: AACL-IJCNLP 2022 Main Conference (Long Paper)

  5. arXiv:2208.01912  [pdf, other

    cs.CL

    Cross-Lingual Knowledge Transfer for Clinical Phenotyping

    Authors: Jens-Michalis Papaioannou, Paul Grundmann, Betty van Aken, Athanasios Samaras, Ilias Kyparissidis, George Giannakoulas, Felix Gers, Alexander Löser

    Abstract: Clinical phenotyping enables the automatic extraction of clinical conditions from patient records, which can be beneficial to doctors and clinics worldwide. However, current state-of-the-art models are mostly applicable to clinical notes written in English. We therefore investigate cross-lingual knowledge transfer strategies to execute this task for clinics that do not use the English language and… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: LREC 2022 submmision: January 2022

    ACM Class: I.2.7; J.3

    Journal ref: Proceedings of the Language Resources and Evaluation Conference. 2022; 900-909

  6. arXiv:2111.15512  [pdf, other

    cs.CL cs.LG

    What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

    Authors: Betty van Aken, Sebastian Herrmann, Alexander Löser

    Abstract: Decision support systems based on clinical notes have the potential to improve patient care by pointing doctors towards overseen risks. Predicting a patient's outcome is an essential part of such systems, for which the use of deep neural networks has shown promising results. However, the patterns learned by these networks are mostly opaque and previous work revealed flaws regarding the reproductio… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 Research2Clinics Workshop, Bridging the Gap: From Machine Learning Research to Clinical Practice

  7. arXiv:2108.00775  [pdf, other

    cs.IR cs.CL

    Self-supervised Answer Retrieval on Clinical Notes

    Authors: Paul Grundmann, Sebastian Arnold, Alexander Löser

    Abstract: Retrieving answer passages from long documents is a complex task requiring semantic understanding of both discourse and document context. We approach this challenge specifically in a clinical scenario, where doctors retrieve cohorts of patients based on diagnoses and other latent medical aspects. We introduce CAPR, a rule-based self-supervision objective for training Transformer language models fo… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

  8. arXiv:2103.15509  [pdf, other

    cs.DB

    Automatic Clustering in Hyrise

    Authors: Alexander Löser

    Abstract: Physical data layout is an important performance factor for modern databases. Clustering, i.e., storing similar values in proximity, can lead to performance gains in several ways. We present an automated model to determine beneficial clustering columns and a clustering algorithm for the column-oriented, memory-resident database Hyrise. To automatically select clustering columns, the model analyzes… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

  9. arXiv:2102.04110  [pdf, other

    cs.CL

    Clinical Outcome Prediction from Admission Notes using Self-Supervised Knowledge Integration

    Authors: Betty van Aken, Jens-Michalis Papaioannou, Manuel Mayrdorfer, Klemens Budde, Felix A. Gers, Alexander Löser

    Abstract: Outcome prediction from clinical text can prevent doctors from overlooking possible risks and help hospitals to plan capacities. We simulate patients at admission time, when decision support can be especially valuable, and contribute a novel admission to discharge task with four common outcome prediction targets: Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-sta… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: EACL 2021

  10. arXiv:2011.04507  [pdf, other

    cs.CL

    VisBERT: Hidden-State Visualizations for Transformers

    Authors: Betty van Aken, Benjamin Winter, Alexander Löser, Felix A. Gers

    Abstract: Explainability and interpretability are two important concepts, the absence of which can and should impede the application of well-performing neural networks to real-world problems. At the same time, they are difficult to incorporate into the large, black-box models that achieve state-of-the-art results in a multitude of NLP tasks. Bidirectional Encoder Representations from Transformers (BERT) is… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Published in WWW '20: Companion Proceedings of the Web Conference 2020

    Journal ref: Companion Proceedings of the Web Conference 2020

  11. Learning Contextualized Document Representations for Healthcare Answer Retrieval

    Authors: Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A. Gers, Alexander Löser

    Abstract: We present Contextual Discourse Vectors (CDV), a distributed document representation for efficient answer retrieval from long healthcare documents. Our approach is based on structured query tuples of entities and aspects from free text and medical taxonomies. Our model leverages a dual encoder architecture with hierarchical LSTM layers and multi-task training to encode the position of clinical ent… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: The Web Conference 2020 (WWW '20)

  12. arXiv:1911.02453  [pdf, other

    cs.DS

    From Symmetry to Asymmetry: Generalizing TSP Approximations by Parametrization

    Authors: Lukas Behrendt, Katrin Casel, Tobias Friedrich, J. A. Gregor Lagodzinski, Alexander Löser, Marcus Wilhelm

    Abstract: We generalize the tree doubling and Christofides algorithm, the two most common approximations for TSP, to parameterized approximations for ATSP. The parameters we consider for the respective parameterizations are upper bounded by the number of asymmetric distances in the given instance, which yields algorithms to efficiently compute constant factor approximations also for moderately asymmetric TS… ▽ More

    Submitted 26 February, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

  13. How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations

    Authors: Betty van Aken, Benjamin Winter, Alexander Löser, Felix A. Gers

    Abstract: Bidirectional Encoder Representations from Transformers (BERT) reach state-of-the-art results in a variety of Natural Language Processing tasks. However, understanding of their internal functioning is still insufficient and unsatisfactory. In order to better understand BERT and other Transformer-based models, we present a layer-wise analysis of BERT's hidden states. Unlike previous research, which… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: Accepted at CIKM 2019

  14. arXiv:1902.04793  [pdf, other

    cs.CL

    SECTOR: A Neural Model for Coherent Topic Segmentation and Classification

    Authors: Sebastian Arnold, Rudolf Schneider, Philippe Cudré-Mauroux, Felix A. Gers, Alexander Löser

    Abstract: When searching for information, a human reader first glances over a document, spots relevant sections and then focuses on a few sentences for resolving her intention. However, the high variance of document structure complicates to identify the salient topic of a given section at a glance. To tackle this challenge, we present SECTOR, a model to support machine reading systems by segmenting document… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

    Comments: Author's final version, accepted for publication at TACL, 2019

  15. arXiv:1809.07572  [pdf, other

    cs.CL

    Challenges for Toxic Comment Classification: An In-Depth Error Analysis

    Authors: Betty van Aken, Julian Risch, Ralf Krestel, Alexander Löser

    Abstract: Toxic comment classification has become an active research field with many recently proposed approaches. However, while these approaches address some of the task's challenges others still remain unsolved and directions for further research are needed. To this end, we compare different deep learning and shallow approaches on a new, large comment dataset and propose an ensemble that outperforms all… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: ALW2: 2nd Workshop on Abusive Language Online to be held at EMNLP 2018 (Brussels, Belgium), October 31st, 2018

  16. arXiv:1805.09648  [pdf, other

    cs.CL

    Crowd-Labeling Fashion Reviews with Quality Control

    Authors: Iurii Chernushenko, Felix A. Gers, Alexander Löser, Alessandro Checco

    Abstract: We present a new methodology for high-quality labeling in the fashion domain with crowd workers instead of experts. We focus on the Aspect-Based Sentiment Analysis task. Our methods filter out inaccurate input from crowd workers but we preserve different worker labeling to capture the inherent high variability of the opinions. We demonstrate the quality of labeled data based on Facebook's FastText… ▽ More

    Submitted 5 April, 2018; originally announced May 2018.

  17. arXiv:1803.04884  [pdf, other

    cs.DB cs.CL cs.NE

    IDEL: In-Database Entity Linking with Neural Embeddings

    Authors: Torsten Kilias, Alexander Löser, Felix A. Gers, Richard Koopmanschap, Ying Zhang, Martin Kersten

    Abstract: We present a novel architecture, In-Database Entity Linking (IDEL), in which we integrate the analytics-optimized RDBMS MonetDB with neural text mining abilities. Our system design abstracts core tasks of most neural entity linking systems for MonetDB. To the best of our knowledge, this is the first defacto implemented system integrating entity-linking in a database. We leverage the ability of Mon… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.

    Comments: This manuscript is a preprint for a paper submitted to VLDB2018

  18. arXiv:1710.09788  [pdf, other

    cs.AI

    FashionBrain Project: A Vision for Understanding Europe's Fashion Data Universe

    Authors: Alessandro Checco, Gianluca Demartini, Alexander Loeser, Ines Arous, Mourad Khayati, Matthias Dantone, Richard Koopmanschap, Svetlin Stalinov, Martin Kersten, Ying Zhang

    Abstract: A core business in the fashion industry is the understanding and prediction of customer needs and trends. Search engines and social networks are at the same time a fundamental bridge and a costly middleman between the customer's purchase intention and the retailer. To better exploit Europe's distinctive characteristics e.g., multiple languages, fashion and cultural differences, it is pivotal to re… ▽ More

    Submitted 26 October, 2017; originally announced October 2017.

  19. arXiv:1707.07499  [pdf, other

    cs.CL

    Analysing Errors of Open Information Extraction Systems

    Authors: Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, Alexander Löser

    Abstract: We report results on benchmarking Open Information Extraction (OIE) systems using RelVis, a toolkit for benchmarking Open Information Extraction systems. Our comprehensive benchmark contains three data sets from the news domain and one data set from Wikipedia with overall 4522 labeled sentences and 11243 binary or n-ary OIE relations. In our analysis on these data sets we compared the performance… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: Accepted at Building Linguistically Generalizable NLP Systems at EMNLP 2017

  20. arXiv:1608.06757  [pdf, other

    cs.CL

    Robust Named Entity Recognition in Idiosyncratic Domains

    Authors: Sebastian Arnold, Felix A. Gers, Torsten Kilias, Alexander Löser

    Abstract: Named entity recognition often fails in idiosyncratic domains. That causes a problem for depending tasks, such as entity linking and relation extraction. We propose a generic and robust approach for high-recall named entity recognition. Our approach is easy to train and offers strong generalization over diverse domain-specific language, such as news documents (e.g. Reuters) or biomedical text (e.g… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

    Comments: 8 pages, 1 figure