(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 80 results for author: Lu, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.08498  [pdf, other

    cs.CV eess.IV

    ERD: Exponential Retinex decomposition based on weak space and hybrid nonconvex regularization and its denoising application

    Authors: Wenjing Lu, Liang Wu, Liming Tang, Zhuang Fang

    Abstract: The Retinex theory models the image as a product of illumination and reflection components, which has received extensive attention and is widely used in image enhancement, segmentation and color restoration. However, it has been rarely used in additive noise removal due to the inclusion of both multiplication and addition operations in the Retinex noisy image modeling. In this paper, we propose an… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.05368  [pdf, other

    cs.SD cs.AI cs.IR eess.AS

    Music Era Recognition Using Supervised Contrastive Learning and Artist Information

    Authors: Qiqi He, Xuchen Song, Weituo Hao, Ju-Chiang Wang, Wei-Tsung Lu, Wei Li

    Abstract: Does popular music from the 60s sound different than that of the 90s? Prior study has shown that there would exist some variations of patterns and regularities related to instrumentation changes and growing loudness across multi-decadal trends. This indicates that perceiving the era of a song from musical features such as audio and artist information is possible. Music era information can be an im… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  3. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, Jingyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: under peer review

  4. arXiv:2406.10956  [pdf, other

    cs.SD cs.LG eess.AS

    Robust Channel Learning for Large-Scale Radio Speaker Verification

    Authors: Wenhao Yang, Jianguo Wei, Wenhuan Lu, Lei Li, Xugang Lu

    Abstract: Recent research in speaker verification has increasingly focused on achieving robust and reliable recognition under challenging channel conditions and noisy environments. Identifying speakers in radio communications is particularly difficult due to inherent limitations such as constrained bandwidth and pervasive noise interference. To address this issue, we present a Channel Robust Speaker Learnin… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 12 pages, 11 figures

  5. arXiv:2404.13298  [pdf, other

    cs.IR eess.SY

    MARec: Metadata Alignment for cold-start Recommendation

    Authors: Julien Monteil, Volodymyr Vaskovych, Wentao Lu, Anirban Majumder, Anton van den Hengel

    Abstract: For many recommender systems the primary data source is a historical record of user clicks. The associated click matrix which is often very sparse, however, as the number of users x products can be far larger than the number of clicks, and such sparsity is accentuated in cold-start settings. The sparsity of the click matrix is the reason matrix factorization and autoencoders techniques remain high… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  6. arXiv:2404.02291  [pdf, other

    cs.CR eess.SY

    Towards a New Configurable and Practical Remote Automotive Security Testing Platform

    Authors: Sekar Kulandaivel, Wenjuan Lu, Brandon Barry, Jorge Guajardo

    Abstract: In the automotive security sector, the absence of a testing platform that is configurable, practical, and user-friendly presents considerable challenges. These difficulties are compounded by the intricate design of vehicle systems, the rapid evolution of attack vectors, and the absence of standardized testing methodologies. We propose a next-generation testing platform that addresses several chall… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 7 pages, 2 figures

  7. arXiv:2402.16371  [pdf, other

    eess.IV

    Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction

    Authors: Wen-Yang Lu, Eduardo Pavez, Antonio Ortega, Xin Zhao, Shan Liu

    Abstract: Current video coding standards, including H.264/AVC, HEVC, and VVC, employ discrete cosine transform (DCT), discrete sine transform (DST), and secondary to Karhunen-Loeve transforms (KLTs) decorrelate the intra-prediction residuals. However, the efficiency of these transforms in decorrelation can be limited when the signal has a non-smooth and non-periodic structure, such as those occurring in tex… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures

  8. arXiv:2402.09463  [pdf

    eess.IV

    Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

    Authors: Kelly Payette, Céline Steger, Roxane Licandro, Priscille de Dumast, Hongwei Bran Li, Matthew Barkovich, Liu Li, Maik Dannecker, Chen Chen, Cheng Ouyang, Niccolò McConnell, Alina Miron, Yongmin Li, Alena Uus, Irina Grigorescu, Paula Ramirez Gilliland, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Haoyu Wang, Ziyan Huang, Jin Ye, Mireia Alenyà, Valentin Comte, Oscar Camara , et al. (42 additional authors not shown)

    Abstract: Segmentation is a critical step in analyzing the developing human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across dif… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Results from FeTA Challenge 2022, held at MICCAI; Manuscript submitted. Supplementary Info (including submission methods descriptions) available here: https://zenodo.org/records/10628648

  9. arXiv:2402.03585  [pdf, other

    cs.CV eess.IV

    Decoder-Only Image Registration

    Authors: Xi Jia, Wenqi Lu, Xinxing Cheng, Jinming Duan

    Abstract: In unsupervised medical image registration, the predominant approaches involve the utilization of a encoder-decoder network architecture, allowing for precise prediction of dense, full-resolution displacement fields from given paired images. Despite its widespread use in the literature, we argue for the necessity of making both the encoder and decoder learnable in such an architecture. For this, w… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  10. arXiv:2310.01809  [pdf, other

    cs.SD eess.AS

    Mel-Band RoFormer for Music Source Separation

    Authors: Ju-Chiang Wang, Wei-Tsung Lu, Minz Won

    Abstract: Recently, multi-band spectrogram-based approaches such as Band-Split RNN (BSRNN) have demonstrated promising results for music source separation. In our recent work, we introduce the BS-RoFormer model which inherits the idea of band-split scheme in BSRNN at the front-end, and then uses the hierarchical Transformer with Rotary Position Embedding (RoPE) to model the inner-band and inter-band sequenc… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: submitted as an ISMIR 2023 late-breaking and demo paper

  11. arXiv:2309.02612  [pdf, other

    cs.SD eess.AS

    Music Source Separation with Band-Split RoPE Transformer

    Authors: Wei-Tsung Lu, Ju-Chiang Wang, Qiuqiang Kong, Yun-Ning Hung

    Abstract: Music source separation (MSS) aims to separate a music recording into multiple musically distinct stems, such as vocals, bass, drums, and more. Recently, deep learning approaches such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have been used, but the improvement is still limited. In this paper, we propose a novel frequency-domain approach based on a Band-Split RoP… ▽ More

    Submitted 9 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: This paper explains the SAMI-ByteDance MSS system submitted to Sound Demixing Challenge (SDX23) Music Separation Track. Version 2 of paper fixed some typos

  12. arXiv:2308.04922  [pdf

    eess.IV physics.med-ph physics.optics

    HSD-PAM: High Speed Super Resolution Deep Penetration Photoacoustic Microscopy Imaging Boosted by Dual Branch Fusion Network

    Authors: Zhengyuan Zhang, Haoran Jin, Zesheng Zheng, Wenwen Zhang, Wenhao Lu, Feng Qin, Arunima Sharma, Manojit Pramanik, Yuanjin Zheng

    Abstract: Photoacoustic microscopy (PAM) is a novel implementation of photoacoustic imaging (PAI) for visualizing the 3D bio-structure, which is realized by raster scanning of the tissue. However, as three involved critical imaging parameters, imaging speed, lateral resolution, and penetration depth have mutual effect to one the other. The improvement of one parameter results in the degradation of other two… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  13. arXiv:2308.02282  [pdf, other

    cs.LG cs.AI eess.SP

    DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization

    Authors: Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xiangyang Ji, Qiang Yang, Xing Xie

    Abstract: Time series remains one of the most challenging modalities in machine learning research. The out-of-distribution (OOD) detection and generalization on time series tend to suffer due to its non-stationary property, i.e., the distribution changes over time. The dynamic distributions inside time series pose great challenges to existing algorithms to identify invariant distributions since they mainly… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Journal version of arXiv:2209.07027; 17 pages

  14. arXiv:2307.02997  [pdf, other

    eess.IV cs.CV

    Fourier-Net+: Leveraging Band-Limited Representation for Efficient 3D Medical Image Registration

    Authors: Xi Jia, Alexander Thorley, Alberto Gomez, Wenqi Lu, Dipak Kotecha, Jinming Duan

    Abstract: U-Net style networks are commonly utilized in unsupervised image registration to predict dense displacement fields, which for high-resolution volumetric image data is a resource-intensive and time-consuming task. To tackle this challenge, we first propose Fourier-Net, which replaces the costly U-Net style expansive path with a parameter-free model-driven decoder. Instead of directly predicting a f… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Under review. arXiv admin note: text overlap with arXiv:2211.16342

  15. arXiv:2306.10785  [pdf, other

    cs.SD cs.LG eess.AS

    Multitrack Music Transcription with a Time-Frequency Perceiver

    Authors: Wei-Tsung Lu, Ju-Chiang Wang, Yun-Ning Hung

    Abstract: Multitrack music transcription aims to transcribe a music audio input into the musical notes of multiple instruments simultaneously. It is a very challenging task that typically requires a more complex model to achieve satisfactory result. In addition, prior works mostly focus on transcriptions of regular instruments, however, neglecting vocals, which are usually the most important signal source i… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: ICASSP 2023

  16. arXiv:2305.08712  [pdf, ps, other

    math.OC eess.SY

    Model Predictive Control with Reach-avoid Analysis

    Authors: Dejin Ren, Wanli Lu, Jidong Lv, Lijun Zhang, Bai Xue

    Abstract: In this paper we investigate the optimal controller synthesis problem, so that the system under the controller can reach a specified target set while satisfying given constraints. Existing model predictive control (MPC) methods learn from a set of discrete states visited by previous (sub-)optimized trajectories and thus result in computationally expensive mixed-integer nonlinear optimization. In t… ▽ More

    Submitted 21 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  17. arXiv:2305.05548  [pdf, ps, other

    eess.SP cs.LG

    CIT-EmotionNet: CNN Interactive Transformer Network for EEG Emotion Recognition

    Authors: Wei Lu, Hua Ma, Tien-Ping Tan

    Abstract: Emotion recognition using Electroencephalogram (EEG) signals has emerged as a significant research challenge in affective computing and intelligent interaction. However, effectively combining global and local features of EEG signals to improve performance in emotion recognition is still a difficult task. In this study, we propose a novel CNN Interactive Transformer Network for EEG Emotion Recognit… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: 10 pages,3 tables

  18. arXiv:2303.10770  [pdf, other

    cs.CV cs.AI eess.IV

    RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network

    Authors: Sangmin Yoo, Eric Yeu-Jer Lee, Ziyu Wang, Xinxin Wang, Wei D. Lu

    Abstract: Event-based cameras are inspired by the sparse and asynchronous spike representation of the biological visual system. However, processing the event data requires either using expensive feature descriptors to transform spikes into frames, or using spiking neural networks that are expensive to train. In this work, we propose a neural network architecture, Reservoir Nodes-enabled neuromorphic vision… ▽ More

    Submitted 24 May, 2024; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: 12 pages, 5 figures, 4 tables

  19. arXiv:2302.08715  [pdf, other

    cs.CV eess.IV

    EEP-3DQA: Efficient and Effective Projection-based 3D Model Quality Assessment

    Authors: Zicheng Zhang, Wei Sun, Yingjie Zhou, Wei Lu, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai

    Abstract: Currently, great numbers of efforts have been put into improving the effectiveness of 3D model quality assessment (3DQA) methods. However, little attention has been paid to the computational costs and inference time, which is also important for practical applications. Unlike 2D media, 3D models are represented by more complicated and irregular digital formats, such as point cloud and mesh. Thus it… ▽ More

    Submitted 27 August, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  20. arXiv:2212.10901  [pdf, other

    cs.SD cs.CL cs.IR cs.MM eess.AS

    ALCAP: Alignment-Augmented Music Captioner

    Authors: Zihao He, Weituo Hao, Wei-Tsung Lu, Changyou Chen, Kristina Lerman, Xuchen Song

    Abstract: Music captioning has gained significant attention in the wake of the rising prominence of streaming media platforms. Traditional approaches often prioritize either the audio or lyrics aspect of the music, inadvertently ignoring the intricate interplay between the two. However, a comprehensive understanding of music necessitates the integration of both these elements. In this study, we delve into t… ▽ More

    Submitted 21 October, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

  21. arXiv:2210.08868  [pdf, other

    eess.IV cs.CV

    Cerebrovascular Segmentation via Vessel Oriented Filtering Network

    Authors: Zhanqiang Guo, Yao Luan, Jianjiang Feng, Wangsheng Lu, Yin Yin, Guangming Yang, Jie Zhou

    Abstract: Accurate cerebrovascular segmentation from Magnetic Resonance Angiography (MRA) and Computed Tomography Angiography (CTA) is of great significance in diagnosis and treatment of cerebrovascular pathology. Due to the complexity and topology variability of blood vessels, complete and accurate segmentation of vascular network is still a challenge. In this paper, we proposed a Vessel Oriented Filtering… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  22. arXiv:2208.04939  [pdf, other

    eess.IV cs.CV

    U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?

    Authors: Xi Jia, Joseph Bartlett, Tianyang Zhang, Wenqi Lu, Zhaowen Qiu, Jinming Duan

    Abstract: Due to their extreme long-range modeling capability, vision transformer-based networks have become increasingly popular in deformable image registration. We believe, however, that the receptive field of a 5-layer convolutional U-Net is sufficient to capture accurate deformations without needing long-range dependencies. The purpose of this study is therefore to investigate whether U-Net-based metho… ▽ More

    Submitted 13 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted to MICCAI-MLMI 2022

  23. arXiv:2207.12027  [pdf, other

    eess.SY

    Non-cascaded Control Barrier Functions for the Safe Control of Quadrotors

    Authors: Weifeng Zeng, Huanhui Cao, Wenjie Lu, Hao Xiong

    Abstract: Researchers have developed various cascaded controllers and non-cascaded controllers for the navigation and control of quadrotors in recent years. It is vital to ensure the safety of a quadrotor both in normal state and in abnormal state if a controller tends to make the quadrotor unsafe. To this end, this paper proposes a non-cascaded Control Barrier Function (CBF) for a quadrotor controlled by e… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  24. arXiv:2207.00842  [pdf, other

    eess.SY

    Safe Reinforcement Learning for a Robot Being Pursued but with Objectives Covering More Than Capture-avoidance

    Authors: Huanhui Cao, Zhiyuan Cai, Hairuo Wei, Wenjie Lu, Lin Zhang, Hao Xiong

    Abstract: Reinforcement Learning (RL) algorithms show amazing performance in recent years, but placing RL in real-world applications such as self-driven vehicles may suffer safety problems. A self-driven vehicle moving to a target position following a learned policy may suffer a vehicle with unpredictable aggressive behaviors or even being pursued by a vehicle following a Nash strategy. To address the safet… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

  25. arXiv:2206.05054  [pdf, other

    eess.IV cs.CV

    A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences

    Authors: Yu Fan, Zicheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Tao Wang, Ning Liu, Guangtao Zhai

    Abstract: Point cloud is one of the most widely used digital formats of 3D models, the visual quality of which is quite sensitive to distortions such as downsampling, noise, and compression. To tackle the challenge of point cloud quality assessment (PCQA) in scenarios where reference is not available, we propose a no-reference quality assessment metric for colored point cloud based on captured video sequenc… ▽ More

    Submitted 20 September, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE 24th International Workshop on Multimedia Signal Processing, 2022

  26. arXiv:2206.04289  [pdf, other

    eess.IV cs.CV

    A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps

    Authors: Zicheng Zhang, Wei Sun, Xiongkuo Min, Wenhan Zhu, Tao Wang, Wei Lu, Guangtao Zhai

    Abstract: To support the application scenarios where high-resolution (HR) images are urgently needed, various single image super-resolution (SISR) algorithms are developed. However, SISR is an ill-posed inverse problem, which may bring artifacts like texture shift, blur, etc. to the reconstructed images, thus it is necessary to evaluate the quality of super-resolution images (SRIs). Note that most existing… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  27. arXiv:2205.14701  [pdf, other

    cs.SD eess.AS

    Modeling Beats and Downbeats with a Time-Frequency Transformer

    Authors: Yun-Ning Hung, Ju-Chiang Wang, Xuchen Song, Wei-Tsung Lu, Minz Won

    Abstract: Transformer is a successful deep neural network (DNN) architecture that has shown its versatility not only in natural language processing but also in music information retrieval (MIR). In this paper, we present a novel Transformer-based approach to tackle beat and downbeat tracking. This approach employs SpecTNT (Spectral-Temporal Transformer in Transformer), a variant of Transformer that models b… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: This paper is accepted for publication at ICASSP 2022

  28. arXiv:2205.09107  [pdf

    eess.IV cs.AI cs.LG

    Leveraging Global Binary Masks for Structure Segmentation in Medical Images

    Authors: Mahdieh Kazemimoghadam, Zi Yang, Lin Ma, Mingli Chen, Weiguo Lu, Xuejun Gu

    Abstract: Deep learning (DL) models for medical image segmentation are highly influenced by intensity variations of input images and lack generalization due to primarily utilizing pixels' intensity information for inference. Acquiring sufficient training data is another challenge limiting models' applications. We proposed to leverage the consistency of organs' anatomical shape and position information in me… ▽ More

    Submitted 24 August, 2023; v1 submitted 13 May, 2022; originally announced May 2022.

  29. Self-supervised Assisted Active Learning for Skin Lesion Segmentation

    Authors: Ziyuan Zhao, Wenjing Lu, Zeng Zeng, Kaixin Xu, Bharadwaj Veeravalli, Cuntai Guan

    Abstract: Label scarcity has been a long-standing issue for biomedical image segmentation, due to high annotation costs and professional requirements. Recently, active learning (AL) strategies strive to reduce annotation costs by querying a small portion of data for annotation, receiving much traction in the field of medical imaging. However, most of the existing AL methods have to initialize models with so… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2022)

    Journal ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  30. arXiv:2204.14047  [pdf, other

    cs.CV cs.MM eess.IV

    A Deep Learning based No-reference Quality Assessment Model for UGC Videos

    Authors: Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai

    Abstract: Quality assessment for User Generated Content (UGC) videos plays an important role in ensuring the viewing experience of end-users. Previous UGC video quality assessment (VQA) studies either use the image recognition model or the image quality assessment (IQA) models to extract frame-level features of UGC videos for quality regression, which are regarded as the sub-optimal solutions because of the… ▽ More

    Submitted 20 October, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted by ACM MM 2022

    Journal ref: Proceedings of the 30th ACM International Conference on Multimedia (2022) 856-865

  31. arXiv:2203.09098  [pdf, other

    cs.SD cs.LG eess.AS

    TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding

    Authors: Ruiteng Zhang, Jianguo Wei, Xugang Lu, Wenhuan Lu, Di Jin, Junhai Xu, Lin Zhang, Yantao Ji, Jianwu Dang

    Abstract: Speaker embedding is an important front-end module to explore discriminative speaker features for many speech applications where speaker information is needed. Current SOTA backbone networks for speaker embedding are designed to aggregate multi-scale features from an utterance with multi-branch network architectures for speaker representation. However, naively adding many branches of multi-scale f… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract here is shorter than that in the PDF file

  32. arXiv:2203.05208  [pdf, other

    cs.CV eess.IV

    Transferring Dual Stochastic Graph Convolutional Network for Facial Micro-expression Recognition

    Authors: Hui Tang, Li Chai, Wanli Lu

    Abstract: Micro-expression recognition has drawn increasing attention due to its wide application in lie detection, criminal detection and psychological consultation. To improve the recognition performance of the small micro-expression data, this paper presents a transferring dual stochastic Graph Convolutional Network (TDSGCN) model. We propose a stochastic graph construction method and dual graph convolut… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  33. Probabilistic load flow calculation of AC/DC hybrid system based on cumulant method

    Authors: Yinfeng Sun, Dapeng Xia, Zichun Gao, Zhenhao Wang, Guoqing Li, Weihua Lu, Xueguang Wu, Yang Li

    Abstract: The operating conditions of the power system have become more complex and changeable. This paper proposes a probabilistic load flow based on the cumulant method (PLF-CM) for the voltage sourced converter high voltage direct current (VSC-HVDC) hybrid system containing photovoltaic grid-connected systems. Firstly, the corresponding control mode is set for the converter, including droop control and m… ▽ More

    Submitted 15 February, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

    Journal ref: International Journal of Electrical Power & Energy Systems 139 (2022) 107998

  34. arXiv:2110.13465  [pdf, other

    cs.SD cs.LG eess.AS

    CS-Rep: Making Speaker Verification Networks Embracing Re-parameterization

    Authors: Ruiteng Zhang, Jianguo Wei, Wenhuan Lu, Lin Zhang, Yantao Ji, Junhai Xu, Xugang Lu

    Abstract: Automatic speaker verification (ASV) systems, which determine whether two speeches are from the same speaker, mainly focus on verification accuracy while ignoring inference speed. However, in real applications, both inference speed and verification accuracy are essential. This study proposes cross-sequential re-parameterization (CS-Rep), a novel topology re-parameterization strategy for multi-type… ▽ More

    Submitted 3 April, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted by ICASSP 2022

  35. arXiv:2110.12855  [pdf, other

    cs.SD cs.HC cs.LG cs.MM eess.AS

    Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience

    Authors: Wei-Tsung Lu, Meng-Hsuan Wu, Yuh-Ming Chiu, Li Su

    Abstract: The subjective evaluation of music generation techniques has been mostly done with questionnaire-based listening tests while ignoring the perspectives from music composition, arrangement, and soundtrack editing. In this paper, we propose an editing test to evaluate users' editing experience of music generation models in a systematic way. To do this, we design a new music style transfer model combi… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 9 pages, Proceedings of the 29th ACM International Conference on Multimedia

  36. arXiv:2110.09127  [pdf, other

    cs.SD cs.LG eess.AS

    SpecTNT: a Time-Frequency Transformer for Music Audio

    Authors: Wei-Tsung Lu, Ju-Chiang Wang, Minz Won, Keunwoo Choi, Xuchen Song

    Abstract: Transformers have drawn attention in the MIR field for their remarkable performance shown in natural language processing and computer vision. However, prior works in the audio processing domain mostly use Transformer as a temporal feature aggregator that acts similar to RNNs. In this paper, we propose SpecTNT, a Transformer-based architecture to model both spectral and temporal sequences of an inp… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 6 pages

    Journal ref: International Society for Music Information Retrieval (ISMIR) 2021

  37. arXiv:2110.09000  [pdf, other

    eess.AS cs.SD

    Supervised Metric Learning for Music Structure Features

    Authors: Ju-Chiang Wang, Jordan B. L. Smith, Wei-Tsung Lu, Xuchen Song

    Abstract: Music structure analysis (MSA) methods traditionally search for musically meaningful patterns in audio: homogeneity, repetition, novelty, and segment-length regularity. Hand-crafted audio features such as MFCCs or chromagrams are often used to elicit these patterns. However, with more annotations of section labels (e.g., verse, chorus, and bridge) becoming available, one can use supervised feature… ▽ More

    Submitted 29 April, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: This paper was accepted and presented at ISMIR 2021

  38. arXiv:2109.10863  [pdf

    physics.soc-ph eess.SY

    A Transportation Digital-Twin Approach for Adaptive Traffic Control Systems

    Authors: Sagar Dasgupta, Mizanur Rahman, Abhay D. Lidbe, Weike Lu, Steven Jones

    Abstract: A transportation digital twin represents a digital version of a transportation physical object or process, such as a traffic signal controller, and thereby a two-way real-time data exchange between the physical twin and digital twin. This paper introduces a digital twin approach for adaptive traffic signal control (ATSC) to improve a traveler's driving experience by reducing and redistributing wai… ▽ More

    Submitted 1 July, 2023; v1 submitted 19 August, 2021; originally announced September 2021.

  39. arXiv:2108.08731  [pdf

    physics.med-ph eess.IV

    Registration-Guided Deep Learning Image Segmentation for Cone Beam CT-based Online Adaptive Radiotherapy

    Authors: Lin Ma, Weicheng Chi, Howard E. Morgan, Mu-Han Lin, Mingli Chen, David Sher, Dominic Moon, Dat T. Vo, Vladimir Avkshtol, Weiguo Lu, Xuejun Gu

    Abstract: Adaptive radiotherapy (ART), especially online ART, effectively accounts for positioning errors and anatomical changes. One key component of online ART is accurately and efficiently delineating organs at risk (OARs) and targets on online images, such as CBCT, to meet the online demands of plan evaluation and adaptation. Deep learning (DL)-based automatic segmentation has gained great success in se… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: 16 pages, 6 figures

  40. arXiv:2107.02041  [pdf, other

    cs.CV cs.GR eess.IV

    No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models

    Authors: Zicheng Zhang, Wei Sun, Xiongkuo Min, Tao Wang, Wei Lu, Guangtao Zhai

    Abstract: To improve the viewer's Quality of Experience (QoE) and optimize computer graphics applications, 3D model quality assessment (3D-QA) has become an important task in the multimedia area. Point cloud and mesh are the two most widely used digital representation formats of 3D models, the visual quality of which is quite sensitive to lossy operations like simplification and compression. Therefore, many… ▽ More

    Submitted 2 May, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

  41. arXiv:2106.13689  [pdf

    eess.IV cs.CV cs.LG

    Semantic annotation for computational pathology: Multidisciplinary experience and best practice recommendations

    Authors: Noorul Wahab, Islam M Miligy, Katherine Dodd, Harvir Sahota, Michael Toss, Wenqi Lu, Mostafa Jahanifar, Mohsin Bilal, Simon Graham, Young Park, Giorgos Hadjigeorghiou, Abhir Bhalerao, Ayat Lashen, Asmaa Ibrahim, Ayaka Katayama, Henry O Ebili, Matthew Parkin, Tom Sorell, Shan E Ahmed Raza, Emily Hero, Hesham Eldaly, Yee Wah Tsang, Kishore Gopalakrishnan, David Snead, Emad Rakha , et al. (2 additional authors not shown)

    Abstract: Recent advances in whole slide imaging (WSI) technology have led to the development of a myriad of computer vision and artificial intelligence (AI) based diagnostic, prognostic, and predictive algorithms. Computational Pathology (CPath) offers an integrated solution to utilize information embedded in pathology WSIs beyond what we obtain through visual assessment. For automated analysis of WSIs and… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  42. arXiv:2105.09558  [pdf

    eess.SY

    An Accelerated Stackelberg Game Approach for Distributed Energy Resource Aggregator participating in Energy and Reserve Markets Considering Security Check

    Authors: Zhijun Shen, Mingbo Liu, Lixin Xu, Wentian Lu

    Abstract: With increasing distributed energy resoures (DERs) integration, the strategic behavior of a DER aggregator in electricity markets will significantly affect the secure operation of the distribution system. In this paper, the interactions among the DER aggregator, energy and reserve markets, and distribution system are investigated through a single-leader-multi-follower Stackelberg game model with t… ▽ More

    Submitted 26 July, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: 20 pages, 14 figures. This work has been submitted to Renewable Energy for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  43. arXiv:2105.03877  [pdf

    eess.SY

    Non-iterative Optimization Algorithm for Active Distribution Grids Considering Uncertainty of Feeder Parameters

    Authors: J. Wu, M. Liu, W. Lu, K. Xie, M. Xie

    Abstract: To cope with fast-fluctuating distributed energy resources (DERs) and uncontrolled loads, this paper formulates a time-varying optimization problem for distribution grids with DERs and develops a novel non-iterative algorithm to track the optimal solutions. Different from existing methods, the proposed approach does not require iterations during the sampling interval. It only needs to perform a si… ▽ More

    Submitted 9 May, 2021; originally announced May 2021.

    Comments: 9 pages, 10 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  44. Saliency-Guided Deep Learning Network for Automatic Tumor Bed Volume Delineation in Post-operative Breast Irradiation

    Authors: Mahdieh Kazemimoghadam, Weicheng Chi, Asal Rahimi, Nathan Kim, Prasanna Alluri, Chika Nwachukwu, Weiguo Lu, Xuejun Gu

    Abstract: Efficient, reliable and reproducible target volume delineation is a key step in the effective planning of breast radiotherapy. However, post-operative breast target delineation is challenging as the contrast between the tumor bed volume (TBV) and normal breast tissue is relatively low in CT images. In this study, we propose to mimic the marker-guidance procedure in manual target delineation. We de… ▽ More

    Submitted 26 July, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: https://iopscience.iop.org/article/10.1088/1361-6560/ac176d

    Journal ref: Physics in Medicine & Biology 2021

  45. arXiv:2104.13339  [pdf, other

    cs.CR eess.SY math.DS math.OC

    An Event-based Parameter Switching Method for Controlling Cybersecurity Dynamics

    Authors: Zhaofeng Liu, Wenlian Lu, Yingying Lang

    Abstract: This paper proposes a new event-based parameter switching method for the control tasks of cybersecurity in the context of preventive and reactive cyber defense dynamics. Our parameter switching method helps avoid excessive control costs as well as guarantees the dynamics to converge as our desired speed. Meanwhile, it can be proved that this approach is Zeno-free. A new estimation method with adap… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: 21 pages, 10 figures, 1 algorithm. may be submitted to SciSec Conference

    MSC Class: 37N35 ACM Class: C.2.0

  46. arXiv:2104.01445  [pdf

    eess.SY cs.MA

    A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn

    Authors: Hao Xiong, Huanhui Cao, Lin Zhang, Wenjie Lu

    Abstract: Pursuit-evasion games are ubiquitous in nature and in an artificial world. In nature, pursuer(s) and evader(s) are intelligent agents that can learn from experience, and dynamics (i.e., Newtonian or Lagrangian) is vital for the pursuer and the evader in some scenarios. To this end, this paper addresses the pursuit-evasion game of intelligent agents from the perspective of dynamics. A bio-inspired… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

  47. arXiv:2103.04026  [pdf

    cs.CV cs.LG eess.IV

    Morphological Operation Residual Blocks: Enhancing 3D Morphological Feature Representation in Convolutional Neural Networks for Semantic Segmentation of Medical Images

    Authors: Chentian Li, Chi Ma, William W. Lu

    Abstract: The shapes and morphology of the organs and tissues are important prior knowledge in medical imaging recognition and segmentation. The morphological operation is a well-known method for morphological feature extraction. As the morphological operation is performed well in hand-crafted image segmentation techniques, it is also promising to design an approach to approximate morphological operation in… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  48. Cross-domain Activity Recognition via Substructural Optimal Transport

    Authors: Wang Lu, Yiqiang Chen, Jindong Wang, Xin Qin

    Abstract: It is expensive and time-consuming to collect sufficient labeled data for human activity recognition (HAR). Domain adaptation is a promising approach for cross-domain activity recognition. Existing methods mainly focus on adapting cross-domain representations via domain-level, class-level, or sample-level distribution matching. However, they might fail to capture the fine-grained locality informat… ▽ More

    Submitted 16 September, 2021; v1 submitted 29 January, 2021; originally announced February 2021.

    Comments: Accepted by Neurocomputing; 17 pages; Code: https://github.com/jindongwang/transferlearning/tree/master/code/traditional/sot

    Journal ref: Neurocomputing, Volume 454, 2021

  49. arXiv:2012.13539  [pdf, ps, other

    cs.IT eess.SP

    A GCICA Grant-Free Random Access Scheme for M2M Communications in Crowded Massive MIMO Systems

    Authors: Huimei Han, Lushun Fang, Weidang Lu, Wenchao Zhai, Ying Li, Jun Zhao

    Abstract: A high success rate of grant-free random access scheme is proposed to support massive access for machine-to-machine communications in massive multipleinput multiple-output systems. This scheme allows active user equipments (UEs) to transmit their modulated uplink messages along with super pilots consisting of multiple sub-pilots to a base station (BS). Then, the BS performs channel state informati… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

  50. arXiv:2012.10239  [pdf

    eess.IV physics.optics q-bio.QM

    Computational interference microscopy enabled by deep learning

    Authors: Yuheng Jiao, Yuchen R. He, Mikhail E. Kandel, Xiaojun Liu, Wenlong Lu, Gabriel Popescu

    Abstract: Quantitative phase imaging (QPI) has been widely applied in characterizing cells and tissues. Spatial light interference microscopy (SLIM) is a highly sensitive QPI method, due to its partially coherent illumination and common path interferometry geometry. However, its acquisition rate is limited because of the four-frame phase-shifting scheme. On the other hand, off-axis methods like diffraction… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.