(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–5 of 5 results for author: Yin, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2312.15197  [pdf, other

    cs.SD cs.CL cs.CV eess.AS

    TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation

    Authors: Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, changpeng yang, Zhou Zhao

    Abstract: Direct speech-to-speech translation achieves high-quality results through the introduction of discrete units obtained from self-supervised learning. This approach circumvents delays and cascading errors associated with model cascading. However, talking head translation, converting audio-visual speech (i.e., talking head video) from one language into another, still confronts several challenges comp… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  2. arXiv:2305.14381  [pdf, other

    cs.LG cs.AI cs.CV cs.MM cs.SD eess.AS

    Connecting Multi-modal Contrastive Representations

    Authors: Zehan Wang, Yang Zhao, Xize Cheng, Haifeng Huang, Jiageng Liu, Li Tang, Linjun Li, Yongqi Wang, Aoxiong Yin, Ziang Zhang, Zhou Zhao

    Abstract: Multi-modal Contrastive Representation learning aims to encode different modalities into a semantically aligned shared space. This paradigm shows remarkable generalization ability on numerous downstream tasks across various modalities. However, the reliance on massive high-quality data pairs limits its further development on more modalities. This paper proposes a novel training-efficient method fo… ▽ More

    Submitted 18 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  3. arXiv:2204.03990  [pdf, other

    eess.SP

    Machine Learning aided Precise Indoor Positioning

    Authors: Anqi Yin, Zihuai Lin

    Abstract: This study describes a UWB and Machine Learning (ML)-based indoor positioning system. We propose a simple mathematical strategy to create data to reduce the job of measurements for fingerprint-based indoor localization systems. A considerable number of measurements can be avoided this way. The paper compares and contrasts the performance of four distinct models. Most test locations' average error… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  4. arXiv:2202.03433  [pdf, other

    eess.IV cs.CV

    A Coarse-to-fine Morphological Approach With Knowledge-based Rules and Self-adapting Correction for Lung Nodules Segmentation

    Authors: Xinliang Fu, Jiayin Zheng, Juanyun Mai, Yanbo Shao, Minghao Wang, Linyu Li, Zhaoqi Diao, Yulong Chen, Jianyu Xiao, Jian You, Airu Yin, Yang Yang, Xiangcheng Qiu, Jinsheng Tao, Bo Wang, Hua Ji

    Abstract: The segmentation module which precisely outlines the nodules is a crucial step in a computer-aided diagnosis(CADきゃど) system. The most challenging part of such a module is how to achieve high accuracy of the segmentation, especially for the juxtapleural, non-solid and small nodules. In this research, we present a coarse-to-fine methodology that greatly improves the thresholding method performance with… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  5. arXiv:2201.13392   

    eess.IV cs.CV

    MHSnet: Multi-head and Spatial Attention Network with False-Positive Reduction for Pulmonary Nodules Detection

    Authors: Juanyun Mai, Minghao Wang, Jiayin Zheng, Yanbo Shao, Zhaoqi Diao, Xinliang Fu, Yulong Chen, Jianyu Xiao, Jian You, Airu Yin, Yang Yang, Xiangcheng Qiu, Jinsheng Tao, Bo Wang, Hua Ji

    Abstract: The mortality of lung cancer has ranked high among cancers for many years. Early detection of lung cancer is critical for disease prevention, cure, and mortality rate reduction. However, existing detection methods on pulmonary nodules introduce an excessive number of false positive proposals in order to achieve high sensitivity, which is not practical in clinical situations. In this paper, we prop… ▽ More

    Submitted 12 May, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: We have to revise the experiment results and conclusions