(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 109 results for author: Lin, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.07667  [pdf, other

    cs.CV eess.IV

    VEnhancer: Generative Space-Time Enhancement for Video Generation

    Authors: Jingwen He, Tianfan Xue, Dongyang Liu, Xinqi Lin, Peng Gao, Dahua Lin, Yu Qiao, Wanli Ouyang, Ziwei Liu

    Abstract: We present VEnhancer, a generative space-time enhancement framework that improves the existing text-to-video results by adding more details in spatial domain and synthetic detailed motion in temporal domain. Given a generated low-quality video, our approach can increase its spatial and temporal resolution simultaneously with arbitrary up-sampling space and time scales through a unified video diffu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: technical report

  2. arXiv:2407.02744  [pdf, other

    eess.IV cs.CV

    Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models

    Authors: Jiayue Chu, Chenhe Du, Xiyue Lin, Yuyao Zhang, Hongjiang Wei

    Abstract: Reconstructing high-fidelity magnetic resonance (MR) images from under-sampled k-space is a commonly used strategy to reduce scan time. The posterior sampling of diffusion models based on the real measurement data holds significant promise of improved reconstruction accuracy. However, traditional posterior sampling methods often lack effective data consistency guidance, leading to inaccurate and u… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2406.14264  [pdf, other

    eess.IV cs.CV

    Zero-Shot Image Denoising for High-Resolution Electron Microscopy

    Authors: Xuanyu Tian, Zhuoya Dong, Xiyue Lin, Yue Gao, Hongjiang Wei, Yanhang Ma, Jingyi Yu, Yuyao Zhang

    Abstract: High-resolution electron microscopy (HREM) imaging technique is a powerful tool for directly visualizing a broad range of materials in real-space. However, it faces challenges in denoising due to ultra-low signal-to-noise ratio (SNR) and scarce data availability. In this work, we propose Noise2SR, a zero-shot self-supervised learning (ZS-SSL) denoising framework for HREM. Within our framework, we… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12 pages, 12 figures

  4. arXiv:2406.00780  [pdf, other

    cs.RO eess.SY

    Accelerate Hybrid Model Predictive Control using Generalized Benders Decomposition

    Authors: Xuan Lin

    Abstract: Hybrid model predictive control with both continuous and discrete variables is widely applicable to robotics tasks. Due to the combinatorial complexity, the solving speed of hybrid MPC can be insufficient for real-time applications. In this paper, we propose to accelerate hybrid MPC using Generalized Benders Decomposition (GBD). GBD enumerates cuts online and stores inside a finite buffer to provi… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2401.00917

  5. arXiv:2405.20219  [pdf, other

    eess.SY

    System Identification for Lithium-Ion Batteries with Nonlinear Coupled Electro-Thermal Dynamics via Bayesian Optimization

    Authors: Hao Tu, Xinfan Lin, Yebin Wang, Huazhen Fang

    Abstract: Essential to various practical applications of lithium-ion batteries is the availability of accurate equivalent circuit models. This paper presents a new coupled electro-thermal model for batteries and studies how to extract it from data. We consider the problem of maximum likelihood parameter estimation, which, however, is nontrivial to solve as the model is nonlinear in both its dynamics and mea… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 2024 American Control Conference(ACC)

  6. arXiv:2405.15519  [pdf

    physics.optics eess.IV

    Confocal structured illumination microscopy

    Authors: Weishuai Zhou, Manhong Yao, Xi Lin, Quan Yu, Junzheng Peng, Jingang Zhong

    Abstract: Confocal microscopy, a critical advancement in optical imaging, is widely applied because of its excellent anti-noise ability. However, it has low imaging efficiency and can cause phototoxicity. Optical-sectioning structured illumination microscopy (OS-SIM) can overcome the limitations of confocal microscopy but still face challenges in imaging depth and signal-to-noise ratio (SNR). We introduce t… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  7. arXiv:2405.09472  [pdf, other

    eess.IV cs.CV

    Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment

    Authors: Xinying Lin, Xuyang Liu, Hong Yang, Xiaohai He, Honggang Chen

    Abstract: With the advent of image super-resolution (SR) algorithms, how to evaluate the quality of generated SR images has become an urgent task. Although full-reference methods perform well in SR image quality assessment (SR-IQA), their reliance on high-resolution (HR) images limits their practical applicability. Leveraging available reconstruction information as much as possible for SR-IQA, such as low-r… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  8. arXiv:2404.19500  [pdf, other

    cs.CV cs.AI cs.MM eess.IV

    Towards Real-world Video Face Restoration: A New Benchmark

    Authors: Ziyan Chen, Jingwen He, Xinqi Lin, Yu Qiao, Chao Dong

    Abstract: Blind face restoration (BFR) on images has significantly progressed over the last several years, while real-world video face restoration (VFR), which is more challenging for more complex face motions such as moving gaze directions and facial orientations involved, remains unsolved. Typical BFR methods are evaluated on privately synthesized datasets or self-collected real-world low-quality face ima… ▽ More

    Submitted 4 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Project page: https://ziyannchen.github.io/projects/VFRxBenchmark/

  9. arXiv:2404.17890  [pdf, other

    eess.IV cs.AI cs.CV

    DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction

    Authors: Chenhe Du, Xiyue Lin, Qing Wu, Xuanyu Tian, Ying Su, Zhe Luo, Hongjiang Wei, S. Kevin Zhou, Jingyi Yu, Yuyao Zhang

    Abstract: Limited-angle and sparse-view computed tomography (LACT and SVCT) are crucial for expanding the scope of X-ray CT applications. However, they face challenges due to incomplete data acquisition, resulting in diverse artifacts in the reconstructed CT images. Emerging implicit neural representation (INR) techniques, such as NeRF, NeAT, and NeRP, have shown promise in under-determined CT imaging recon… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 15 pages, 10 figures

    ACM Class: I.2.10; I.4.5

  10. arXiv:2404.13372  [pdf, other

    eess.IV cs.CV

    HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression

    Authors: Lei Lu, Yanyue Xie, Wei Jiang, Wei Wang, Xue Lin, Yanzhi Wang

    Abstract: This paper investigates the challenging problem of learned image compression (LIC) with extreme low bitrates. Previous LIC methods based on transmitting quantized continuous features often yield blurry and noisy reconstruction due to the severe quantization loss. While previous LIC methods based on learned codebooks that discretize visual space usually give poor-fidelity reconstruction due to the… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  11. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  12. arXiv:2403.19083  [pdf, other

    cs.LG cs.AI eess.IV

    Improving Cancer Imaging Diagnosis with Bayesian Networks and Deep Learning: A Bayesian Deep Learning Approach

    Authors: Pei Xi, Lin

    Abstract: With recent advancements in the development of artificial intelligence applications using theories and algorithms in machine learning, many accurate models can be created to train and predict on given datasets. With the realization of the importance of imaging interpretation in cancer diagnosis, this article aims to investigate the theory behind Deep Learning and Bayesian Network prediction models… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  13. arXiv:2403.18607  [pdf, other

    cs.CR cs.AI eess.SP

    Spikewhisper: Temporal Spike Backdoor Attacks on Federated Neuromorphic Learning over Low-power Devices

    Authors: Hanqing Fu, Gaolei Li, Jun Wu, Jianhua Li, Xi Lin, Kai Zhou, Yuchen Liu

    Abstract: Federated neuromorphic learning (FedNL) leverages event-driven spiking neural networks and federated learning frameworks to effectively execute intelligent analysis tasks over amounts of distributed low-power devices but also perform vulnerability to poisoning attacks. The threat of backdoor attacks on traditional deep neural networks typically comes from time-invariant data. However, in FedNL, un… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  14. arXiv:2403.16070  [pdf, ps, other

    eess.SY math.OC

    Towards a MATLAB Toolbox to compute backstepping kernels using the power series method

    Authors: Xin Lin, Rafael Vazquez, Miroslav Krstic

    Abstract: In this paper, we extend our previous work on the power series method for computing backstepping kernels. Our first contribution is the development of initial steps towards a MATLAB toolbox dedicated to backstepping kernel computation. This toolbox would exploit MATLAB's linear algebra and sparse matrix manipulation features for enhanced efficiency; our initial findings show considerable improveme… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Preprint submitted to CDC 2024

  15. arXiv:2403.14250  [pdf, other

    eess.IV cs.CR cs.CV

    Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations

    Authors: Xun Lin, Yi Yu, Song Xia, Jue Jiang, Haoran Wang, Zitong Yu, Yizhong Liu, Ying Fu, Shuai Wang, Wenzhong Tang, Alex Kot

    Abstract: The widespread availability of publicly accessible medical images has significantly propelled advancements in various research and clinical fields. Nonetheless, concerns regarding unauthorized training of AI systems for commercial purposes and the duties of patient privacy protection have led numerous institutions to hesitate to share their images. This is particularly true for medical image segme… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  16. arXiv:2402.11993  [pdf

    cs.NI eess.SP

    Towards Energy Efficient RAN: From Industry Standards to Trending Practice

    Authors: Lopamudra Kundu, Xingqin Lin, Rajesh Gadiyar

    Abstract: As 5G deployments continue throughout the world, concerns regarding its energy consumption have gained significant traction. This article focuses on radio access networks (RANs) which account for a major portion of the network energy use. Firstly, we introduce the state-of-the-art 3GPP and O-RAN standardization work on enhancing RAN energy efficiency. Then we highlight three unique ways for enabli… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 8 pages, 6 figures

  17. arXiv:2401.16879  [pdf, ps, other

    math.OC eess.SY

    Optimal Control of a Stochastic Power System -- Algorithms and Mathematical Analysis

    Authors: Zhen Wang, Kaihua Xi, Aijie Cheng, Hai Xiang Lin, Jan H. van Schuppen

    Abstract: The considered optimal control problem of a stochastic power system, is to select the set of power supply vectors which infimizes the probability that the phase-angle differences of any power flow of the network, endangers the transient stability of the power system by leaving a critical subset. The set of control laws is restricted to be a periodically recomputed set of fixed power supply vectors… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 24 pages, 2 figures

    MSC Class: 93E20; 90C30; and 90C26

  18. arXiv:2312.15174  [pdf

    cs.NI eess.SP

    The Bridge Toward 6G: 5G-Advanced Evolution in 3GPP Release 19

    Authors: Xingqin Lin

    Abstract: The 3rd generation partnership project (3GPP) initiated 5G-Advanced in Release 18, laying a solid foundation for the further evolution of 5G-Advanced. Release 19-the next wave of 5G-Advanced-will primarily focus on commercial deployment needs while serving as a bridge toward 6G. In this article, we provide an in-depth overview of the 5G-Advanced evolution in 3GPP Release 19. We not only delve into… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: 8 pages, 5 figures, submitted for publication

  19. arXiv:2312.05786  [pdf, other

    eess.SP cs.IT

    Deep Learning for Joint Design of Pilot, Channel Feedback, and Hybrid Beamforming in FDD Massive MIMO-OFDM Systems

    Authors: Junyi Yang, Weifeng Zhu, Shu Sun, Xiaofeng Li, Xingqin Lin, Meixia Tao

    Abstract: This letter considers the transceiver design in frequency division duplex (FDD) massive multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) systems for high-quality data transmission. We propose a novel deep learning based framework where the procedures of pilot design, channel feedback, and hybrid beamforming are realized by carefully crafted deep neural networ… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures, acccpted by IEEE Communication Letters

  20. arXiv:2312.05382  [pdf, other

    eess.SY math.OC stat.ML

    Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems

    Authors: Simon Kuang, Xinfan Lin

    Abstract: We present a method of parameter estimation for large class of nonlinear systems, namely those in which the state consists of output derivatives and the flow is linear in the parameter. The method, which solves for the unknown parameter by directly inverting the dynamics using regularized linear regression, is based on new design and analysis ideas for differentiation filtering and regularized lea… ▽ More

    Submitted 22 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Revised introduction and review; proofs moved to appendices; numerical example

  21. arXiv:2312.03227  [pdf, other

    cs.CV eess.IV

    Human Body Model based ID using Shape and Pose Parameters

    Authors: Aravind Sundaresan, Brian Burns, Indranil Sur, Yi Yao, Xiao Lin, Sujeong Kim

    Abstract: We present a Human Body model based IDentification system (HMID) system that is jointly trained for shape, pose and biometric identification. HMID is based on the Human Mesh Recovery (HMR) network and we propose additional losses to improve and stabilize shape estimation and biometric identification while maintaining the pose and shape output. We show that when our HMID network is trained using ad… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: to be published in IEEE International Joint Conference on Biometrics, Ljubljana, Slovenia 2023

  22. arXiv:2311.18188  [pdf, other

    eess.AS cs.LG

    Speech Understanding on Tiny Devices with A Learning Cache

    Authors: Afsara Benazir, Zhiming Xu, Felix Xiaozhu Lin

    Abstract: This paper addresses spoken language understanding (SLU) on microcontroller-like embedded devices, integrating on-device execution with cloud offloading in a novel fashion. We leverage temporal locality in the speech inputs to a device and reuse recent SLU inferences accordingly. Our idea is simple: let the device match incoming inputs against cached results, and only offload inputs not matched to… ▽ More

    Submitted 8 May, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: accepted at MobiSys'24

  23. arXiv:2311.17065  [pdf, other

    eess.AS cs.CL cs.LG

    Efficient Deep Speech Understanding at the Edge

    Authors: Rongxiang Wang, Felix Xiaozhu Lin

    Abstract: In contemporary speech understanding (SU), a sophisticated pipeline is employed, encompassing the ingestion of streaming voice input. The pipeline executes beam search iteratively, invoking a deep neural network to generate tentative outputs (referred to as hypotheses) in an autoregressive manner. Periodically, the pipeline assesses attention and Connectionist Temporal Classification (CTC) scores.… ▽ More

    Submitted 4 December, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

  24. arXiv:2311.16282  [pdf, ps, other

    eess.SY

    Control of the Power Flows of a Stochastic Power System

    Authors: Zhen Wang, Kaihua Xi, Aijie Cheng, Hai Xiang Lin, Jan H. van Schuppen

    Abstract: How to determine the power supply of a power system to guarantee that the state remains during a short horizon in a critical subset of the state set? The critical subset is related to the power flows of all power lines of a power system and to transient stability. The control objective is to minimize a cost function. That function is defined as the maximal power flow over all power lines, includin… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: A supplement with 20 pages, 5 figures, 43 tables has been added to the original manuscript

  25. arXiv:2311.07134  [pdf, other

    cs.IT eess.SP

    Performance Analysis of Integrated Data and Energy Transfer Assisted by Fluid Antenna Systems

    Authors: Xiao Lin, Halvin Yang, Yizhe Zhao, Jie Hu, Kai-Kit Wong

    Abstract: Fluid antenna multiple access (FAMA) is capable of exploiting the high spatial diversity of wireless channels to mitigate multi-user interference via flexible port switching, which achieves a better performance than traditional multi-input-multi-output (MIMO) systems. Moreover, integrated data and energy transfer (IDET) is able to provide both the wireless data transfer (WDT) and wireless energy t… ▽ More

    Submitted 7 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE ICC 2024

  26. arXiv:2310.09071  [pdf, other

    cs.LG eess.SY

    Online Relocating and Matching of Ride-Hailing Services: A Model-Based Modular Approach

    Authors: Chang Gao, Xi Lin, Fang He, Xindi Tang

    Abstract: This study proposes an innovative model-based modular approach (MMA) to dynamically optimize order matching and vehicle relocation in a ride-hailing platform. MMA utilizes a two-layer and modular modeling structure. The upper layer determines the spatial transfer patterns of vehicle flow within the system to maximize the total revenue of the current and future stages. With the guidance provided by… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  27. arXiv:2310.03344  [pdf, other

    cs.RO eess.SY

    Generalized Benders Decomposition with Continual Learning for Hybrid Model Predictive Control in Dynamic Environment

    Authors: Xuan Lin

    Abstract: Hybrid model predictive control (MPC) with both continuous and discrete variables is widely applicable to robotic control tasks, especially those involving contact with the environment. Due to the combinatorial complexity, the solving speed of hybrid MPC can be insufficient for real-time applications. In this paper, we proposed a hybrid MPC solver based on Generalized Benders Decomposition (GBD) w… ▽ More

    Submitted 10 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: A correction of the author name in the metadata

  28. arXiv:2309.16161  [pdf, other

    eess.SY cs.AI cs.MA cs.RO math.OC

    Leveraging Untrustworthy Commands for Multi-Robot Coordination in Unpredictable Environments: A Bandit Submodular Maximization Approach

    Authors: Zirui Xu, Xiaofeng Lin, Vasileios Tzoumas

    Abstract: We study the problem of multi-agent coordination in unpredictable and partially-observable environments with untrustworthy external commands. The commands are actions suggested to the robots, and are untrustworthy in that their performance guarantees, if any, are unknown. Such commands may be generated by human operators or machine learning algorithms and, although untrustworthy, can often increas… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  29. arXiv:2309.06803  [pdf, other

    eess.SY

    A Critical Escape Probability Formulation for Enhancing the Transient Stability of Power Systems with System Parameter Design

    Authors: Xian Wu, Kaihua Xi, Aijie Cheng, Chenghui Zhang, Hai Xiang Lin

    Abstract: For the enhancement of the transient stability of power systems, the key is to define a quantitative optimization formulation with system parameters as decision variables. In this paper, we model the disturbances by Gaussian noise and define a metric named Critical Escape Probability (CREP) based on the invariant probability measure of a linearised stochastic processes. CREP characterizes the prob… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 15 pages, 4 figures, 2 tables

  30. arXiv:2309.05674  [pdf, other

    eess.IV cs.CV

    ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation

    Authors: Xian Lin, Zengqiang Yan, Xianbo Deng, Chuansheng Zheng, Li Yu

    Abstract: Transformers have been extensively studied in medical image segmentation to build pairwise long-range dependence. Yet, relatively limited well-annotated medical image data makes transformers struggle to extract diverse global features, resulting in attention collapse where attention maps become similar or even identical. Comparatively, convolutional neural networks (CNNs) have better convergence p… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted by MICCAI 2023

  31. Slimmed optical neural networks with multiplexed neuron sets and a corresponding backpropagation training algorithm

    Authors: Yi-Feng Liu, Rui-Yao Ren, Dai-Bao Hou, Hai-Zhong Weng, Bo-Wen Wang, Ke-Jie Huang, Xing Lin, Feng Liu, Chen-Hui Li, Chao-Yuan Jin

    Abstract: Due to their intrinsic capabilities on parallel signal processing, optical neural networks (ONNs) have attracted extensive interests recently as a potential alternative to electronic artificial neural networks (ANNs) with reduced power consumption and low latency. Preliminary confirmation of the parallelism in optical computing has been widely done by applying the technology of wavelength division… ▽ More

    Submitted 13 December, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

    Journal ref: Liu YF, Ren RY, Hou DB, Weng HZ, Wang BW, Huang KJ, Lin X, Liu F, Li CH, Jin CY. Slimmed Optical Neural Networks with Multiplexed Neuron Sets and a Corresponding Backpropagation Training Algorithm. Intell. Comput. 2024;3:Article 0070

  32. arXiv:2308.06776  [pdf, other

    eess.IV cs.CV

    Unsupervised Image Denoising in Real-World Scenarios via Self-Collaboration Parallel Generative Adversarial Branches

    Authors: Xin Lin, Chao Ren, Xiao Liu, Jie Huang, Yinjie Lei

    Abstract: Deep learning methods have shown remarkable performance in image denoising, particularly when trained on large-scale paired datasets. However, acquiring such paired datasets for real-world scenarios poses a significant challenge. Although unsupervised approaches based on generative adversarial networks offer a promising solution for denoising without paired datasets, they are difficult in surpassi… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  33. arXiv:2308.05315  [pdf

    cs.NI eess.SP

    An Overview of the 3GPP Study on Artificial Intelligence for 5G New Radio

    Authors: Xingqin Lin

    Abstract: Air interface is a fundamental component within any wireless communication system. In Release 18, the 3rd Generation Partnership Project (3GPP) delves into the possibilities of leveraging artificial intelligence (AI)/machine learning (ML) to improve the performance of the fifth-generation (5G) New Radio (NR) air interface. This endeavor marks a pioneering stride within 3GPP's journey in shaping wi… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 7 pages, 5 figures, submitted for possible publication

  34. arXiv:2307.06742  [pdf, other

    eess.SY cs.AI cs.LG

    Vehicle Dispatching and Routing of On-Demand Intercity Ride-Pooling Services: A Multi-Agent Hierarchical Reinforcement Learning Approach

    Authors: Jinhua Si, Fang He, Xi Lin, Xindi Tang

    Abstract: The integrated development of city clusters has given rise to an increasing demand for intercity travel. Intercity ride-pooling service exhibits considerable potential in upgrading traditional intercity bus services by implementing demand-responsive enhancements. Nevertheless, its online operations suffer the inherent complexities due to the coupling of vehicle resource allocation among cities and… ▽ More

    Submitted 20 March, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  35. arXiv:2306.07820  [pdf, other

    eess.AS cs.LG cs.SD

    Unsupervised speech enhancement with deep dynamical generative speech and noise models

    Authors: Xiaoyu Lin, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda

    Abstract: This work builds on a previous work on unsupervised speech enhancement using a dynamical variational autoencoder (DVAE) as the clean speech model and non-negative matrix factorization (NMF) as the noise model. We propose to replace the NMF noise model with a deep dynamical generative model (DDGM) depending either on the DVAE latent variables, or on the noisy observations, or on both. This DDGM can… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  36. arXiv:2305.12795  [pdf, other

    eess.SY cs.AI cs.MA cs.RO math.OC

    Bandit Submodular Maximization for Multi-Robot Coordination in Unpredictable and Partially Observable Environments

    Authors: Zirui Xu, Xiaofeng Lin, Vasileios Tzoumas

    Abstract: We study the problem of multi-agent coordination in unpredictable and partially observable environments, that is, environments whose future evolution is unknown a priori and that can only be partially observed. We are motivated by the future of autonomy that involves multiple robots coordinating actions in dynamic, unstructured, and partially observable environments to complete complex tasks such… ▽ More

    Submitted 26 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to RSS 2023

  37. arXiv:2305.12708  [pdf, other

    eess.AS cs.SD

    ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer

    Authors: Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao

    Abstract: Text-to-speech(TTS) has undergone remarkable improvements in performance, particularly with the advent of Denoising Diffusion Probabilistic Models (DDPMs). However, the perceived quality of audio depends not solely on its content, pitch, rhythm, and energy, but also on the physical environment. In this work, we propose ViT-TTS, the first visual TTS model with scalable diffusion transformers. ViT-T… ▽ More

    Submitted 21 April, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by EMNLP 2023

  38. arXiv:2305.09588  [pdf

    eess.SP cs.NI

    Hardware Acceleration for Open Radio Access Networks: A Contemporary Overview

    Authors: Lopamudra Kundu, Xingqin Lin, Elena Agostini, Vikrama Ditya

    Abstract: Radio access networks (RAN) are going through a paradigm shift towards interoperable, intelligent, software-defined, and cloud-native open RAN solutions. A key challenge towards the adoption and deployment of open RAN at scale is performance. Hence, it is critical to leverage the power of hardware acceleration to offload compute-heavy RAN workloads to specialized hardware devices to enable acceler… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 8 pages, 6 figures

  39. arXiv:2305.03997  [pdf, other

    eess.IV cs.CV

    Dual Degradation Representation for Joint Deraining and Low-Light Enhancement in the Dark

    Authors: Xin Lin, Jingtong Yue, Sixian Ding, Chao Ren, Lu Qi, Ming-Hsuan Yang

    Abstract: Rain in the dark poses a significant challenge to deploying real-world applications such as autonomous driving, surveillance systems, and night photography. Existing low-light enhancement or deraining methods struggle to brighten low-light conditions and remove rain simultaneously. Additionally, cascade approaches like ``deraining followed by low-light enhancement'' or the reverse often result in… ▽ More

    Submitted 17 June, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

  40. arXiv:2304.09783  [pdf, other

    eess.IV cs.CV

    Application of attention-based Siamese composite neural network in medical image recognition

    Authors: Zihao Huang, Yue Wang, Weixing Xin, Xingtong Lin, Huizhen Li, Haowen Chen, Yizhen Lao, Xia Chen

    Abstract: Medical image recognition often faces the problem of insufficient data in practical applications. Image recognition and processing under few-shot conditions will produce overfitting, low recognition accuracy, low reliability and insufficient robustness. It is often the case that the difference of characteristics is subtle, and the recognition is affected by perspectives, background, occlusion and… ▽ More

    Submitted 15 March, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

  41. arXiv:2303.11020  [pdf, other

    cs.SD cs.AI eess.AS

    DS-TDNN: Dual-stream Time-delay Neural Network with Global-aware Filter for Speaker Verification

    Authors: Yangfu Li, Jiapan Gan, Xiaodan Lin

    Abstract: Conventional time-delay neural networks (TDNNs) struggle to handle long-range context, their ability to represent speaker information is therefore limited in long utterances. Existing solutions either depend on increasing model complexity or try to balance between local features and global context to address this issue. To effectively leverage the long-term dependencies of audio signals and constr… ▽ More

    Submitted 1 August, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: 13 pages 4 figures

    MSC Class: 68 ACM Class: I.2.1

  42. arXiv:2303.09404  [pdf, other

    eess.AS cs.LG cs.SD

    Speech Modeling with a Hierarchical Transformer Dynamical VAE

    Authors: Xiaoyu Lin, Xiaoyu Bie, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda

    Abstract: The dynamical variational autoencoders (DVAEs) are a family of latent-variable deep generative models that extends the VAE to model a sequence of observed data and a corresponding sequence of latent vectors. In almost all the DVAEs of the literature, the temporal dependencies within each sequence and across the two sequences are modeled with recurrent neural networks. In this paper, we propose to… ▽ More

    Submitted 10 May, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  43. arXiv:2303.07456  [pdf, other

    cs.IT cs.NI eess.SP

    5G-Advanced Towards 6G: Past, Present, and Future

    Authors: Wanshi Chen, Xingqin Lin, Juho Lee, Antti Toskala, Shu Sun, Carla Fabiana Chiasserini, Lingjia Liu

    Abstract: Since the start of 5G work in 3GPP in early 2016, tremendous progress has been made in both standardization and commercial deployments. 3GPP is now entering the second phase of 5G standardization, known as 5G-Advanced, built on the 5G baseline in 3GPP Releases 15, 16, and 17. 3GPP Release 18, the start of 5G-Advanced, includes a diverse set of features that cover both device and network evolutions… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: IEEE Journal on Selected Areas in Communications, Special Issue on 3GPP Technologies: 5G-Advanced and Beyond, Editorial/Tutorial Paper

  44. arXiv:2302.06326  [pdf, other

    eess.SY

    Explicit formulas for the Variance of the State of a Linearized Power System driven by Gaussian stochastic disturbances

    Authors: Xian Wu, Kaihua Xi, Aijie Cheng, Hai Xiang Lin, Jan H van Schuppen, Chenghui Zhang

    Abstract: We look into the fluctuations caused by disturbances in power systems. In the linearized system of the power systems, the disturbance is modeled by a Brownian motion process, and the fluctuations are described by the covariance matrix of the associated stochastic process at the invariant probability distribution. We derive explicit formulas for the covariance matrix for the system with a uniform d… ▽ More

    Submitted 16 March, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 34 pages,6 figures

  45. arXiv:2301.10167  [pdf, other

    eess.SP cs.LG physics.optics

    EEG Opto-processor: epileptic seizure detection using diffractive photonic computing units

    Authors: Tao Yan, Maoqi Zhang, Sen Wan, Kaifeng Shang, Haiou Zhang, Xun Cao, Xing Lin, Qionghai Dai

    Abstract: Electroencephalography (EEG) analysis extracts critical information from brain signals, which has provided fundamental support for various applications, including brain-disease diagnosis and brain-computer interface. However, the real-time processing of large-scale EEG signals at high energy efficiency has placed great challenges for electronic processors on edge computing devices. Here, we propos… ▽ More

    Submitted 9 December, 2022; originally announced January 2023.

    Comments: 22 pages, 5 figures

  46. arXiv:2301.01703  [pdf, other

    cs.IT eess.SP

    Technology Trends for Massive MIMO towards 6G

    Authors: Yiming Huo, Xingqin Lin, Boya Di, Hongliang Zhang, Francisco Javier Lorca Hernando, Ahmet Serdar Tan, Shahid Mumtaz, Özlem Tuğfe Demir, Kun Chen-Hu

    Abstract: At the dawn of the next-generation wireless systems and networks, massive multiple-input multiple-output (MIMO) has been envisioned as one of the enabling technologies. With the continued success of being applied in the 5G and beyond, the massive MIMO technology has demonstrated its advantageousness, integrability, and extendibility. Moreover, several evolutionary features and revolutionizing tren… ▽ More

    Submitted 5 January, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 7 pages, 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  47. arXiv:2301.01420  [pdf

    cs.MM eess.IV

    Improved CNN Prediction Based Reversible Data Hiding

    Authors: Yingqiang Qiu, Wanli Peng, Xiaodan Lin, Huanqiang Zeng, Zhenxing Qian

    Abstract: This letter proposes an improved CNN predictor (ICNNP) for reversible data hiding (RDH) in images, which consists of a feature extraction module, a pixel prediction module, and a complexity prediction module. Due to predicting the complexity of each pixel with the ICNNP during the embedding process, the proposed method can achieve superior performance than the CNN predictor-based method. Specifica… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  48. arXiv:2211.10040  [pdf, ps, other

    eess.SP

    DASECount: Domain-Agnostic Sample-Efficient Wireless Indoor Crowd Counting via Few-shot Learning

    Authors: Huawei Hou, Suzhi Bi, Lili Zheng, Xiaohui Lin, Yuan Wu, Zhi Quan

    Abstract: Accurate indoor crowd counting (ICC) is a key enabler to many smart home/office applications. In this paper, we propose a Domain-Agnostic and Sample-Efficient wireless indoor crowd Counting (DASECount) framework that suffices to attain robust cross-domain detection accuracy given very limited data samples in new domains. DASECount leverages the wisdom of few-shot learning (FSL) paradigm consisting… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 12 pages, 10 figures. The paper has been submitted for journal publication

  49. arXiv:2210.05741  [pdf

    eess.SY

    Road Slope Prediction and Vehicle Dynamics Control for Autonomous Vehicles

    Authors: Gautam Shetty, Sabir Hossain, Chuan Hu, Xianke Lin

    Abstract: Autonomous vehicles can enhance overall performance and implement safety measures in ways that are impossible with conventional automobiles. These functions are executed through vehicle control systems, which have been the subject of considerable research. Autonomous cars have a distinct advantage as they possess various perception sensors that can predict road surface conditions and other phenome… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: 16 pages, 15 figures

  50. arXiv:2210.02731   

    cs.SD cs.LG eess.AS

    PSVRF: Learning to restore Pitch-Shifted Voice without reference

    Authors: Yangfu Li, Xiaodan Lin, Jiaxin Yang

    Abstract: Pitch scaling algorithms have a significant impact on the security of Automatic Speaker Verification (ASV) systems. Although numerous anti-spoofing algorithms have been proposed to identify the pitch-shifted voice and even restore it to the original version, they either have poor performance or require the original voice as a reference, limiting the prospects of applications. In this paper, we pro… ▽ More

    Submitted 13 March, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Have some errors