(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–6 of 6 results for author: Cui, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2311.04685  [pdf, other

    eess.IV

    An End-Cloud Computing Enabled Surveillance Video Transmission System

    Authors: Dingxi Yang, Zhijin Qin, Liting Wang, Xiaoming Tao, Fang Cui, Hengjiang Wang

    Abstract: The enormous data volume of video poses a significant burden on the network. Particularly, transferring high-definition surveillance videos to the cloud consumes a significant amount of spectrum resources. To address these issues, we propose a surveillance video transmission system enabled by end-cloud computing. Specifically, the cameras actively down-sample the original video and then a redundan… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  2. arXiv:2303.10912  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Exploring Representation Learning for Small-Footprint Keyword Spotting

    Authors: Fan Cui, Liyong Guo, Quandong Wang, Peng Gao, Yujun Wang

    Abstract: In this paper, we investigate representation learning for low-resource keyword spotting (KWS). The main challenges of KWS are limited labeled data and limited available device resources. To address those challenges, we explore representation learning for KWS by self-supervised contrastive learning and self-training with pretrained model. First, local-global contrastive siamese networks (LGCSiam) a… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  3. arXiv:2303.10897  [pdf, other

    cs.SD cs.CL eess.AS q-bio.NC

    Relate auditory speech to EEG by shallow-deep attention-based network

    Authors: Fan Cui, Liyong Guo, Lang He, Jiyao Liu, ErCheng Pei, Yujun Wang, Dongmei Jiang

    Abstract: Electroencephalography (EEG) plays a vital role in detecting how brain responses to different stimulus. In this paper, we propose a novel Shallow-Deep Attention-based Network (SDANet) to classify the correct auditory stimulus evoking the EEG signal. It adopts the Attention-based Correlation Module (ACM) to discover the connection between auditory speech and EEG from global aspect, and the Shallow-… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  4. arXiv:2303.05678  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Weakly Supervised Sound Event Detection with Causal Intervention

    Authors: Yifei Xin, Dongchao Yang, Fan Cui, Yujun Wang, Yuexian Zou

    Abstract: Existing weakly supervised sound event detection (WSSED) work has not explored both types of co-occurrences simultaneously, i.e., some sound events often co-occur, and their occurrences are usually accompanied by specific background sounds, so they would be inevitably entangled, causing misclassification and biased localization results with only clip-level supervision. To tackle this issue, we fir… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP2023

  5. arXiv:2211.00508  [pdf, other

    eess.AS cs.CL cs.SD

    Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

    Authors: Liyong Guo, Xiaoyu Yang, Quandong Wang, Yuxiang Kong, Zengwei Yao, Fan Cui, Fangjun Kuang, Wei Kang, Long Lin, Mingshuang Luo, Piotr Zelasko, Daniel Povey

    Abstract: Knowledge distillation(KD) is a common approach to improve model performance in automatic speech recognition (ASR), where a student model is trained to imitate the output behaviour of a teacher model. However, traditional KD methods suffer from teacher label storage issue, especially when the training corpora are large. Although on-the-fly teacher label generation tackles this issue, the training… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2022

  6. arXiv:2112.10153  [pdf, other

    cs.SD eess.AS

    Detect what you want: Target Sound Detection

    Authors: Dongchao Yang, Helin Wang, Yuexian Zou, Fan Cui, Yujun Wang

    Abstract: Human beings can perceive a target sound type from a multi-source mixture signal by the selective auditory attention, however, such functionality was hardly ever explored in machine hearing. This paper addresses the target sound detection (TSD) task, which aims to detect the target sound signal from a mixture audio when a target sound's reference audio is given. We present a novel target sound det… ▽ More

    Submitted 7 July, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Submitted to DCASE workshop2022