(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–11 of 11 results for author: Coppock, H

.
  1. arXiv:2309.15024  [pdf, other

    cs.SD cs.LG eess.AS

    Synthia's Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio

    Authors: Chia-Hsin Lin, Charles Jones, Björn W. Schuller, Harry Coppock

    Abstract: Despite significant advancements in deep learning for vision and natural language, unsupervised domain adaptation in audio remains relatively unexplored. We, in part, attribute this to the lack of an appropriate benchmark dataset. To address this gap, we present Synthia's melody, a novel audio data generation framework capable of simulating an infinite variety of 4-second melodies with user-specif… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  2. arXiv:2212.08571  [pdf, other

    cs.SD cs.LG eess.AS stat.AP

    Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19

    Authors: Davide Pigoli, Kieran Baker, Jobie Budd, Lorraine Butler, Harry Coppock, Sabrina Egglestone, Steven G. Gilmour, Chris Holmes, David Hurley, Radka Jersakova, Ivan Kiskin, Vasiliki Koutra, Jonathon Mellor, George Nicholson, Joe Packham, Selina Patel, Richard Payne, Stephen J. Roberts, Björn W. Schuller, Ana Tendero-Cañadas, Tracey Thornley, Alexander Titcomb

    Abstract: Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously ass… ▽ More

    Submitted 27 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  3. arXiv:2212.08570  [pdf, other

    cs.SD cs.LG eess.AS

    Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

    Authors: Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker, Jobie Budd, Richard Payne, Emma Karoune, David Hurley, Alexander Titcomb, Sabrina Egglestone, Ana Tendero Cañadas, Lorraine Butler, Radka Jersakova, Jonathon Mellor, Selina Patel, Tracey Thornley, Peter Diggle, Sylvia Richardson, Josef Packham, Björn W. Schuller, Davide Pigoli, Steven Gilmour, Stephen Roberts, Chris Holmes

    Abstract: Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata… ▽ More

    Submitted 2 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  4. arXiv:2212.07738  [pdf

    cs.SD cs.LG eess.AS

    A large-scale and PCR-referenced vocal audio dataset for COVID-19

    Authors: Jobie Budd, Kieran Baker, Emma Karoune, Harry Coppock, Selina Patel, Ana Tendero Cañadas, Alexander Titcomb, Richard Payne, David Hurley, Sabrina Egglestone, Lorraine Butler, Jonathon Mellor, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Radka Jersakova, Rachel A. McKendry, Peter Diggle, Sylvia Richardson, Björn W. Schuller, Steven Gilmour, Davide Pigoli, Stephen Roberts, Josef Packham, Tracey Thornley , et al. (1 additional authors not shown)

    Abstract: The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmi… ▽ More

    Submitted 3 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 39 pages, 4 figures

  5. arXiv:2209.14345  [pdf, other

    cs.SD cs.LG eess.AS

    Audio Barlow Twins: Self-Supervised Audio Representation Learning

    Authors: Jonah Anton, Harry Coppock, Pancham Shukla, Bjorn W. Schuller

    Abstract: The Barlow Twins self-supervised learning objective requires neither negative samples or asymmetric learning updates, achieving results on a par with the current state-of-the-art within Computer Vision. As such, we present Audio Barlow Twins, a novel self-supervised audio representation learning approach, adapting Barlow Twins to the audio domain. We pre-train on the large-scale audio dataset Audi… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 15 pages (4 main text, rest references + appendices)

  6. arXiv:2205.06799  [pdf, other

    cs.SD cs.LG eess.AS

    The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes

    Authors: Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Christian Bergler, Maurice Gerczuk, Natalie Holz, Pauline Larrouy-Maestri, Sebastian P. Bayerl, Korbinian Riedhammer, Adria Mallol-Ragolta, Maria Pateraki, Harry Coppock, Ivan Kiskin, Marianne Sinka, Stephen Roberts

    Abstract: The ACM Multimedia 2022 Computational Paralinguistics Challenge addresses four different problems for the first time in a research competition under well-defined conditions: In the Vocalisations and Stuttering Sub-Challenges, a classification on human non-verbal vocalisations and speech has to be made; the Activity Sub-Challenge aims at beyond-audio human activity recognition from smartwatch senso… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: 5 pages, part of the ACM Multimedia 2022 Grand Challenge "The ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE 2022)"

    MSC Class: 68 ACM Class: I.2.7; I.5.0; J.3

  7. arXiv:2203.06064  [pdf, other

    cs.SD cs.LG

    Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet

    Authors: Björn W. Schuller, Alican Akman, Yi Chang, Harry Coppock, Alexander Gebhard, Alexander Kathan, Esther Rituerto-González, Andreas Triantafyllopoulos, Florian B. Pokorny

    Abstract: Among the seventeen Sustainable Development Goals (SDGs) proposed within the 2030 Agenda and adopted by all the United Nations member states, the 13$^{th}$ SDG is a call for action to combat climate change for a better world. In this work, we provide an overview of areas in which audio intelligence -- a powerful but in this context so far hardly considered technology -- can contribute to overcome… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  8. arXiv:2202.08981  [pdf, other

    cs.SD cs.LG eess.AS

    A Summary of the ComParE COVID-19 Challenges

    Authors: Harry Coppock, Alican Akman, Christian Bergler, Maurice Gerczuk, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Jing Han, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Panagiotis Tzirakis, Anton Batliner, Cecilia Mascolo, Björn W. Schuller

    Abstract: The COVID-19 pandemic has caused massive humanitarian and economic damage. Teams of scientists from a broad range of disciplines have searched for methods to help governments and communities combat the disease. One avenue from the machine learning field which has been explored is the prospect of a digital mass test which can detect COVID-19 from infected individuals' respiratory sounds. We present… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 18 pages, 13 figures

  9. arXiv:2107.14549  [pdf, other

    cs.SD cs.LG eess.AS

    Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 from Audio Challenges

    Authors: Alican Akman, Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Lyn Jones, Björn W. Schuller

    Abstract: We report on cross-running the recent COVID-19 Identification ResNet (CIdeR) on the two Interspeech 2021 COVID-19 diagnosis from cough and speech audio challenges: ComParE and DiCOVA. CIdeR is an end-to-end deep learning neural network originally designed to classify whether an individual is COVID-positive or COVID-negative based on coughing and breathing audio recordings from a published crowdsou… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: 5 pages, 1 figure

  10. arXiv:2102.08359  [pdf, other

    cs.SD cs.LG eess.AS

    End-2-End COVID-19 Detection from Breath & Cough Audio

    Authors: Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Alice Baird, Lyn Jones, Björn W. Schuller

    Abstract: Our main contributions are as follows: (I) We demonstrate the first attempt to diagnose COVID-19 using end-to-end deep learning from a crowd-sourced dataset of audio samples, achieving ROC-AUC of 0.846; (II) Our model, the COVID-19 Identification ResNet, (CIdeR), has potential for rapid scalability, minimal cost and improving performance as more data becomes available. This could enable regular CO… ▽ More

    Submitted 6 January, 2021; originally announced February 2021.

    Comments: 5 pages

    MSC Class: 68T11 ACM Class: I.2; I.5; J.3

  11. arXiv:2012.14553  [pdf, other

    cs.SD cs.LG eess.AS

    Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks

    Authors: Björn W. Schuller, Harry Coppock, Alexander Gaskell

    Abstract: The COVID-19 pandemic has affected the world unevenly; while industrial economies have been able to produce the tests necessary to track the spread of the virus and mostly avoided complete lockdowns, developing countries have faced issues with testing capacity. In this paper, we explore the usage of deep learning models as a ubiquitous, low-cost, pre-testing method for detecting COVID-19 from audi… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

    Comments: 6 pages

    MSC Class: 68T11 ACM Class: I.2; I.5; J.3