(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–8 of 8 results for author: Shamshad, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2308.12792  [pdf, other

    cs.SD eess.AS

    Sparks of Large Audio Models: A Survey and Outlook

    Authors: Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller

    Abstract: This survey paper provides a comprehensive overview of the recent advancements and challenges in applying large language models to the field of audio signal processing. Audio processing, with its diverse signal representations and a wide range of sources--from human voices to musical instruments and environmental sounds--poses challenges distinct from those found in traditional Natural Language Pr… ▽ More

    Submitted 21 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Under review, Repo URL: https://github.com/EmulationAI/awesome-large-audio-models

  2. arXiv:2303.11607  [pdf, other

    cs.CL cs.SD eess.AS

    Transformers in Speech Processing: A Survey

    Authors: Siddique Latif, Aun Zaidi, Heriberto Cuayahuitl, Fahad Shamshad, Moazzam Shoukat, Junaid Qadir

    Abstract: The remarkable success of transformers in the field of natural language processing has sparked the interest of the speech-processing community, leading to an exploration of their potential for modeling long-range dependencies within speech sequences. Recently, transformers have gained prominence across various speech-related domains, including automatic speech recognition, speech synthesis, speech… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: under-review

  3. arXiv:2201.09873  [pdf, other

    eess.IV cs.CV

    Transformers in Medical Imaging: A Survey

    Authors: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu

    Abstract: Following unprecedented success on the natural language tasks, Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results and prompting researchers to reconsider the supremacy of convolutional neural networks (CNNs) as {de facto} operators. Capitalizing on these advances in computer vision, the medical imaging field has also witnessed growin… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 41 pages, \url{https://github.com/fahadshamshad/awesome-transformers-in-medical-imaging}

  4. arXiv:2101.00240  [pdf, other

    cs.SD cs.LG eess.AS

    A Survey on Deep Reinforcement Learning for Audio-Based Applications

    Authors: Siddique Latif, Heriberto Cuayáhuitl, Farrukh Pervez, Fahad Shamshad, Hafiz Shehbaz Ali, Erik Cambria

    Abstract: Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence (AI) by endowing autonomous systems with high levels of understanding of the real world. Currently, deep learning (DL) is enabling DRL to effectively solve various intractable problems in various fields. Most importantly, DRL algorithms are also being employed in audio signal processing to learn direc… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

    Comments: Under Review

  5. arXiv:2005.07026  [pdf, other

    eess.IV cs.CV cs.IR cs.LG eess.SP

    Subsampled Fourier Ptychography using Pretrained Invertible and Untrained Network Priors

    Authors: Fahad Shamshad, Asif Hanif, Ali Ahmed

    Abstract: Recently pretrained generative models have shown promising results for subsampled Fourier Ptychography (FP) in terms of quality of reconstruction for extremely low sampling rate and high noise. However, one of the significant drawbacks of these pretrained generative priors is their limited representation capabilities. Moreover, training these generative models requires access to a large number of… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: Part of this work has been accepted in NeurIPS Deep Inverse Workshop, 2019

  6. arXiv:2002.12578  [pdf, other

    eess.IV cs.CV cs.LG eess.SP stat.ML

    Class-Specific Blind Deconvolutional Phase Retrieval Under a Generative Prior

    Authors: Fahad Shamshad, Ali Ahmed

    Abstract: In this paper, we consider the highly ill-posed problem of jointly recovering two real-valued signals from the phaseless measurements of their circular convolution. The problem arises in various imaging modalities such as Fourier ptychography, X-ray crystallography, and in visible light communication. We propose to solve this inverse problem using alternating gradient descent algorithm under two p… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

    Comments: 10 pages

  7. arXiv:1910.08792  [pdf, other

    cs.IT eess.SP

    Sub-Nyquist Sampling of Sparse and Correlated Signals in Array Processing

    Authors: Ali Ahmed, Fahad Shamshad, Humera Hameed

    Abstract: This paper considers efficient sampling of simultaneously sparse and correlated (S$\&$C) signals. Such signals arise in various applications in array processing. We propose an implementable sampling architecture for the acquisition of S$\&$C at a sub-Nyquist rate. We prove a sampling theorem showing exact and stable reconstruction of the acquired signals even when the sampling rate is smaller than… ▽ More

    Submitted 18 January, 2023; v1 submitted 19 October, 2019; originally announced October 2019.

  8. arXiv:1812.11065  [pdf, other

    cs.LG eess.IV eess.SP stat.ML

    Deep Ptych: Subsampled Fourier Ptychography using Generative Priors

    Authors: Fahad Shamshad, Farwa Abbas, Ali Ahmed

    Abstract: This paper proposes a novel framework to regularize the highly ill-posed and non-linear Fourier ptychography problem using generative models. We demonstrate experimentally that our proposed algorithm, Deep Ptych, outperforms the existing Fourier ptychography techniques, in terms of quality of reconstruction and robustness against noise, using far fewer samples. We further modify the proposed appro… ▽ More

    Submitted 22 December, 2018; originally announced December 2018.