(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–7 of 7 results for author: Togneri, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2308.12792  [pdf, other

    cs.SD eess.AS

    Sparks of Large Audio Models: A Survey and Outlook

    Authors: Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller

    Abstract: This survey paper provides a comprehensive overview of the recent advancements and challenges in applying large language models to the field of audio signal processing. Audio processing, with its diverse signal representations and a wide range of sources--from human voices to musical instruments and environmental sounds--poses challenges distinct from those found in traditional Natural Language Pr… ▽ More

    Submitted 21 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Under review, Repo URL: https://github.com/EmulationAI/awesome-large-audio-models

  2. arXiv:2210.11658  [pdf, other

    eess.SP

    A New Approach to Extract Fetal Electrocardiogram Using Affine Combination of Adaptive Filters

    Authors: Yu Xuan, Xiangyu Zhang, Shuyue Stella Li, Zihan Shen, Xin Xie, Leibny Paola Garcia, Roberto Togneri

    Abstract: The detection of abnormal fetal heartbeats during pregnancy is important for monitoring the health conditions of the fetus. While adult ECG has made several advances in modern medicine, noninvasive fetal electrocardiography (FECG) remains a great challenge. In this paper, we introduce a new method based on affine combinations of adaptive filters to extract FECG signals. The affine combination of m… ▽ More

    Submitted 26 February, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures, 3 tables

  3. arXiv:2209.13112  [pdf, other

    eess.AS cs.SD

    Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age

    Authors: Fuling Chen, Roberto Togneri, Murray Maybery, Diana Weiting Tan

    Abstract: Sex classification of children's voices allows for an investigation of the development of secondary sex characteristics which has been a key interest in the field of speech analysis. This research investigated a broad range of acoustic features from scripted and spontaneous speech and applied a hierarchical clustering-based machine learning model to distinguish the sex of children aged between 5 a… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  4. arXiv:2209.12702  [pdf, other

    eess.AS cs.SD

    End-to-End Lyrics Recognition with Self-supervised Learning

    Authors: Xiangyu Zhang, Shuyue Stella Li, Zhanhong He, Roberto Togneri, Leibny Paola Garcia

    Abstract: Lyrics recognition is an important task in music processing. Despite traditional algorithms such as the hybrid HMM- TDNN model achieving good performance, studies on applying end-to-end models and self-supervised learning (SSL) are limited. In this paper, we first establish an end-to-end baseline for lyrics recognition and then explore the performance of SSL models on lyrics recognition task. We e… ▽ More

    Submitted 26 October, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: 4 pages, 2 figures, 3 tables

  5. arXiv:2102.07982  [pdf, other

    cs.SD eess.AS

    Voice Gender Scoring and Independent Acoustic Characterization of Perceived Masculinity and Femininity

    Authors: Fuling Chen, Roberto Togneri, Murray Maybery, Diana Tan

    Abstract: Previous research has found that voices can provide reliable information to be used for gender classification with a high level of accuracy. In social psychology, perceived masculinity and femininity (masculinity and femininity rated by humans) has often been considered an important feature when investigating the influence of vocal features on social behaviours. While previous studies have charact… ▽ More

    Submitted 4 August, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: 24 pages, 7 figures, journal

  6. arXiv:2012.03154  [pdf, other

    eess.AS cs.CR cs.SD

    Multi-task Learning Based Spoofing-Robust Automatic Speaker Verification System

    Authors: Yuanjun Zhao, Roberto Togneri, Victor Sreeram

    Abstract: Spoofing attacks posed by generating artificial speech can severely degrade the performance of a speaker verification system. Recently, many anti-spoofing countermeasures have been proposed for detecting varying types of attacks from synthetic speech to replay presentations. While there are numerous effective defenses reported on standalone anti-spoofing solutions, the integration for speaker veri… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

    Comments: 12 pages, 6 figures, codes used in the experimental section can be found at https://github.com/zhaoyj1122/SRASV

  7. arXiv:1801.00410  [pdf, other

    math.OC eess.SY stat.OT

    Enhanced ${q}$-Least Mean Square

    Authors: Shujaat Khan, Alishba Sadiq, Imran Naseem, Roberto Togneri, Mohammed Bennamoun

    Abstract: In this work, a new class of stochastic gradient algorithm is developed based on $q$-calculus. Unlike the existing $q$-LMS algorithm, the proposed approach fully utilizes the concept of $q$-calculus by incorporating time-varying $q$ parameter. The proposed enhanced $q$-LMS ($Eq$-LMS) algorithm utilizes a novel, parameterless concept of error-correlation energy and normalization of signal to ensure… ▽ More

    Submitted 1 January, 2018; originally announced January 2018.