-
MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Authors:
Yu-Fen Huang,
Nikki Moran,
Simon Coleman,
Jon Kelly,
Shun-Hwa Wei,
Po-Yin Chen,
Yun-Hsin Huang,
Tsung-Ping Chen,
Yu-Chia Kuo,
Yu-Chi Wei,
Chih-Hsuan Li,
Da-Yu Huang,
Hsuan-Kai Kao,
Ting-Wei Lin,
Li Su
Abstract:
In cross-modal music processing, translation between visual, auditory, and semantic content opens up new possibilities as well as challenges. The construction of such a transformative scheme depends upon a benchmark corpus with a comprehensive data infrastructure. In particular, the assembly of a large-scale cross-modal dataset presents major challenges. In this paper, we present the MOSA (Music m…
▽ More
In cross-modal music processing, translation between visual, auditory, and semantic content opens up new possibilities as well as challenges. The construction of such a transformative scheme depends upon a benchmark corpus with a comprehensive data infrastructure. In particular, the assembly of a large-scale cross-modal dataset presents major challenges. In this paper, we present the MOSA (Music mOtion with Semantic Annotation) dataset, which contains high quality 3-D motion capture data, aligned audio recordings, and note-by-note semantic annotations of pitch, beat, phrase, dynamic, articulation, and harmony for 742 professional music performances by 23 professional musicians, comprising more than 30 hours and 570 K notes of data. To our knowledge, this is the largest cross-modal music dataset with note-level annotations to date. To demonstrate the usage of the MOSA dataset, we present several innovative cross-modal music information retrieval (MIR) and musical content generation tasks, including the detection of beats, downbeats, phrase, and expressive contents from audio, video and motion data, and the generation of musicians' body motion from given music audio. The dataset and codes are available alongside this publication (https://github.com/yufenhuang/MOSA-Music-mOtion-and-Semantic-Annotation-dataset).
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Array SAR 3D Sparse Imaging Based on Regularization by Denoising Under Few Observed Data
Authors:
Yangyang Wang,
Xu Zhan,
Jing Gao,
Jinjie Yao,
Shunjun Wei,
JianSheng Bai
Abstract:
Array synthetic aperture radar (SAR) three-dimensional (3D) imaging can obtain 3D information of the target region, which is widely used in environmental monitoring and scattering information measurement. In recent years, with the development of compressed sensing (CS) theory, sparse signal processing is used in array SAR 3D imaging. Compared with matched filter (MF), sparse SAR imaging can effect…
▽ More
Array synthetic aperture radar (SAR) three-dimensional (3D) imaging can obtain 3D information of the target region, which is widely used in environmental monitoring and scattering information measurement. In recent years, with the development of compressed sensing (CS) theory, sparse signal processing is used in array SAR 3D imaging. Compared with matched filter (MF), sparse SAR imaging can effectively improve image quality. However, sparse imaging based on handcrafted regularization functions suffers from target information loss in few observed SAR data. Therefore, in this article, a general 3D sparse imaging framework based on Regulation by Denoising (RED) and proximal gradient descent type method for array SAR is presented. Firstly, we construct explicit prior terms via state-of-the-art denoising operators instead of regularization functions, which can improve the accuracy of sparse reconstruction and preserve the structure information of the target. Then, different proximal gradient descent type methods are presented, including a generalized alternating projection (GAP) and an alternating direction method of multiplier (ADMM), which is suitable for high-dimensional data processing. Additionally, the proposed method has robust convergence, which can achieve sparse reconstruction of 3D SAR in few observed SAR data. Extensive simulations and real data experiments are conducted to analyze the performance of the proposed method. The experimental results show that the proposed method has superior sparse reconstruction performance.
△ Less
Submitted 26 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Research on OPF control of three-phase four-wire low-voltage distribution network considering uncertainty
Authors:
Rui Wang,
Xiaoqing Bai,
Shengquan Huang,
Shoupu Wei
Abstract:
As power systems become more complex and uncertain, low-voltage distribution networks face numerous challenges, including three-phase imbalances caused by asymmetrical loads and distributed energy resources. We propose a robust stochastic optimization (RSO) based optimal power flow (OPF) control method for three-phase, four-wire low-voltage distribution networks that consider uncertainty to addres…
▽ More
As power systems become more complex and uncertain, low-voltage distribution networks face numerous challenges, including three-phase imbalances caused by asymmetrical loads and distributed energy resources. We propose a robust stochastic optimization (RSO) based optimal power flow (OPF) control method for three-phase, four-wire low-voltage distribution networks that consider uncertainty to address these issues. Using historical data and deep learning classification methods, the proposed method simulates optimal system behaviour without requiring communication infrastructure. The simulation results verify that the proposed method effectively controls the voltage and current amplitude while minimizing the operational cost and three-phase imbalance within acceptable limits. The proposed method shows promise for managing uncertainties and optimizing performance in low-voltage distribution networks.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Confidence-Aware Safe and Stable Control of Control-Affine Systems
Authors:
Shiqing Wei,
Prashanth Krishnamurthy,
Farshad Khorrami
Abstract:
Designing control inputs that satisfy safety requirements is crucial in safety-critical nonlinear control, and this task becomes particularly challenging when full-state measurements are unavailable. In this work, we address the problem of synthesizing safe and stable control for control-affine systems via output feedback (using an observer) while reducing the estimation error of the observer. To…
▽ More
Designing control inputs that satisfy safety requirements is crucial in safety-critical nonlinear control, and this task becomes particularly challenging when full-state measurements are unavailable. In this work, we address the problem of synthesizing safe and stable control for control-affine systems via output feedback (using an observer) while reducing the estimation error of the observer. To achieve this, we adapt control Lyapunov function (CLF) and control barrier function (CBF) techniques to the output feedback setting. Building upon the existing CLF-CBF-QP (Quadratic Program) and CBF-QP frameworks, we formulate two confidence-aware optimization problems and establish the Lipschitz continuity of the obtained solutions. To validate our approach, we conduct simulation studies on two illustrative examples. The simulation studies indicate both improvements in the observer's estimation accuracy and the fulfillment of safety and control requirements.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
From Registration Uncertainty to Segmentation Uncertainty
Authors:
Junyu Chen,
Yihao Liu,
Shuwen Wei,
Zhangxing Bian,
Aaron Carass,
Yong Du
Abstract:
Understanding the uncertainty inherent in deep learning-based image registration models has been an ongoing area of research. Existing methods have been developed to quantify both transformation and appearance uncertainties related to the registration process, elucidating areas where the model may exhibit ambiguity regarding the generated deformation. However, our study reveals that neither uncert…
▽ More
Understanding the uncertainty inherent in deep learning-based image registration models has been an ongoing area of research. Existing methods have been developed to quantify both transformation and appearance uncertainties related to the registration process, elucidating areas where the model may exhibit ambiguity regarding the generated deformation. However, our study reveals that neither uncertainty effectively estimates the potential errors when the registration model is used for label propagation. Here, we propose a novel framework to concurrently estimate both the epistemic and aleatoric segmentation uncertainties for image registration. To this end, we implement a compact deep neural network (DNN) designed to transform the appearance discrepancy in the warping into aleatoric segmentation uncertainty by minimizing a negative log-likelihood loss function. Furthermore, we present epistemic segmentation uncertainty within the label propagation process as the entropy of the propagated labels. By introducing segmentation uncertainty along with existing methods for estimating registration uncertainty, we offer vital insights into the potential uncertainties at different stages of image registration. We validated our proposed framework using publicly available datasets, and the results prove that the segmentation uncertainties estimated with the proposed method correlate well with errors in label propagation, all while achieving superior registration performance.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?
Authors:
Zhangxing Bian,
Ahmed Alshareef,
Shuwen Wei,
Junyu Chen,
Yuli Wang,
Jonghye Woo,
Dzung L. Pham,
Jiachen Zhuo,
Aaron Carass,
Jerry L. Prince
Abstract:
Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application…
▽ More
Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application of radio frequency (RF) pulses during serial imaging sequences. This is a factor that has been overlooked in prior research on tMRI post-processing. Further, we have observed an emerging trend of utilizing raw tagged MRI within a deep learning-based (DL) registration framework for motion estimation. In this work, we evaluate and analyze the impact of commonly used image similarity objectives in training DL registrations on raw tMRI. This is then compared with the Harmonic Phase-based approach, a traditional approach which is claimed to be robust to tag fading. Our findings, derived from both simulated images and an actual phantom scan, reveal the limitations of various similarity losses in raw tMRI and emphasize caution in registration tasks where image intensity changes over time.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
A Learning-Based Framework for Safe Human-Robot Collaboration with Multiple Backup Control Barrier Functions
Authors:
Neil C. Janwani,
Ersin Daş,
Thomas Touma,
Skylar X. Wei,
Tamas G. Molnar,
Joel W. Burdick
Abstract:
Ensuring robot safety in complex environments is a difficult task due to actuation limits, such as torque bounds. This paper presents a safety-critical control framework that leverages learning-based switching between multiple backup controllers to formally guarantee safety under bounded control inputs while satisfying driver intention. By leveraging backup controllers designed to uphold safety an…
▽ More
Ensuring robot safety in complex environments is a difficult task due to actuation limits, such as torque bounds. This paper presents a safety-critical control framework that leverages learning-based switching between multiple backup controllers to formally guarantee safety under bounded control inputs while satisfying driver intention. By leveraging backup controllers designed to uphold safety and input constraints, backup control barrier functions (BCBFs) construct implicitly defined control invariance sets via a feasible quadratic program (QP). However, BCBF performance largely depends on the design and conservativeness of the chosen backup controller, especially in our setting of human-driven vehicles in complex, e.g, off-road, conditions. While conservativeness can be reduced by using multiple backup controllers, determining when to switch is an open problem. Consequently, we develop a broadcast scheme that estimates driver intention and integrates BCBFs with multiple backup strategies for human-robot interaction. An LSTM classifier uses data inputs from the robot, human, and safety algorithms to continually choose a backup controller in real-time. We demonstrate our method's efficacy on a dual-track robot in obstacle avoidance scenarios. Our framework guarantees robot safety while adhering to driver intention.
△ Less
Submitted 7 March, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
WindMill: A Parameterized and Pluggable CGRA Implemented by DIAG Design Flow
Authors:
Haojia Hui,
Jiangyuan Gu,
Xunbo Hu,
Yang Hu,
Leibo Liu,
Shaojun Wei,
Shouyi Yin
Abstract:
With the cross-fertilization of applications and the ever-increasing scale of models, the efficiency and productivity of hardware computing architectures have become inadequate. This inadequacy further exacerbates issues in design flexibility, design complexity, development cycle, and development costs (4-d problems) in divergent scenarios. To address these challenges, this paper proposed a flexib…
▽ More
With the cross-fertilization of applications and the ever-increasing scale of models, the efficiency and productivity of hardware computing architectures have become inadequate. This inadequacy further exacerbates issues in design flexibility, design complexity, development cycle, and development costs (4-d problems) in divergent scenarios. To address these challenges, this paper proposed a flexible design flow called DIAG based on plugin techniques. The proposed flow guides hardware development through four layers: definition(D), implementation(I), application(A), and generation(G). Furthermore, a versatile CGRA generator called WindMill is implemented, allowing for agile generation of customized hardware accelerators based on specific application demands. Applications and algorithm tasks from three aspects is experimented. In the case of reinforcement learning algorithm, a significant performance improvement of $2.3\times$ compared to GPU is achieved.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
MomentaMorph: Unsupervised Spatial-Temporal Registration with Momenta, Shooting, and Correction
Authors:
Zhangxing Bian,
Shuwen Wei,
Yihao Liu,
Junyu Chen,
Jiachen Zhuo,
Fangxu Xing,
Jonghye Woo,
Aaron Carass,
Jerry L. Prince
Abstract:
Tagged magnetic resonance imaging (tMRI) has been employed for decades to measure the motion of tissue undergoing deformation. However, registration-based motion estimation from tMRI is difficult due to the periodic patterns in these images, particularly when the motion is large. With a larger motion the registration approach gets trapped in a local optima, leading to motion estimation errors. We…
▽ More
Tagged magnetic resonance imaging (tMRI) has been employed for decades to measure the motion of tissue undergoing deformation. However, registration-based motion estimation from tMRI is difficult due to the periodic patterns in these images, particularly when the motion is large. With a larger motion the registration approach gets trapped in a local optima, leading to motion estimation errors. We introduce a novel "momenta, shooting, and correction" framework for Lagrangian motion estimation in the presence of repetitive patterns and large motion. This framework, grounded in Lie algebra and Lie group principles, accumulates momenta in the tangent vector space and employs exponential mapping in the diffeomorphic space for rapid approximation towards true optima, circumventing local optima. A subsequent correction step ensures convergence to true optima. The results on a 2D synthetic dataset and a real 3D tMRI dataset demonstrate our method's efficiency in estimating accurate, dense, and diffeomorphic 2D/3D motion fields amidst large motion and repetitive patterns.
△ Less
Submitted 5 August, 2023;
originally announced August 2023.
-
A survey on deep learning in medical image registration: new technologies, uncertainty, evaluation metrics, and beyond
Authors:
Junyu Chen,
Yihao Liu,
Shuwen Wei,
Zhangxing Bian,
Shalini Subramanian,
Aaron Carass,
Jerry L. Prince,
Yong Du
Abstract:
Deep learning technologies have dramatically reshaped the field of medical image registration over the past decade. The initial developments, such as regression-based and U-Net-based networks, established the foundation for deep learning in image registration. Subsequent progress has been made in various aspects of deep learning-based registration, including similarity measures, deformation regula…
▽ More
Deep learning technologies have dramatically reshaped the field of medical image registration over the past decade. The initial developments, such as regression-based and U-Net-based networks, established the foundation for deep learning in image registration. Subsequent progress has been made in various aspects of deep learning-based registration, including similarity measures, deformation regularizations, network architectures, and uncertainty estimation. These advancements have not only enriched the field of image registration but have also facilitated its application in a wide range of tasks, including atlas construction, multi-atlas segmentation, motion estimation, and 2D-3D registration. In this paper, we present a comprehensive overview of the most recent advancements in deep learning-based image registration. We begin with a concise introduction to the core concepts of deep learning-based image registration. Then, we delve into innovative network architectures, loss functions specific to registration, and methods for estimating registration uncertainty. Additionally, this paper explores appropriate evaluation metrics for assessing the performance of deep learning models in registration tasks. Finally, we highlight the practical applications of these novel techniques in medical imaging and discuss the future prospects of deep learning-based image registration.
△ Less
Submitted 30 April, 2024; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Technical outlier detection via convolutional variational autoencoder for the ADMANI breast mammogram dataset
Authors:
Hui Li,
Carlos A. Pena Solorzano,
Susan Wei,
Davis J. McCarthy
Abstract:
The ADMANI datasets (annotated digital mammograms and associated non-image datasets) from the Transforming Breast Cancer Screening with AI programme (BRAIx) run by BreastScreen Victoria in Australia are multi-centre, large scale, clinically curated, real-world databases. The datasets are expected to aid in the development of clinically relevant Artificial Intelligence (AI) algorithms for breast ca…
▽ More
The ADMANI datasets (annotated digital mammograms and associated non-image datasets) from the Transforming Breast Cancer Screening with AI programme (BRAIx) run by BreastScreen Victoria in Australia are multi-centre, large scale, clinically curated, real-world databases. The datasets are expected to aid in the development of clinically relevant Artificial Intelligence (AI) algorithms for breast cancer detection, early diagnosis, and other applications. To ensure high data quality, technical outliers must be removed before any downstream algorithm development. As a first step, we randomly select 30,000 individual mammograms and use Convolutional Variational Autoencoder (CVAE), a deep generative neural network, to detect outliers. CVAE is expected to detect all sorts of outliers, although its detection performance differs among different types of outliers. Traditional image processing techniques such as erosion and pectoral muscle analysis can compensate for the poor performance of CVAE in certain outlier types. We identify seven types of technical outliers: implant, pacemaker, cardiac loop recorder, improper radiography, atypical lesion/calcification, incorrect exposure parameter and improper placement. The outlier recall rate for the test set is 61% if CVAE, erosion and pectoral muscle analysis each select the top 1% images ranked in ascending or descending order according to image outlier score under each detection method, and 83% if each selects the top 5% images. This study offers an overview of technical outliers in the ADMANI dataset and suggests future directions to improve outlier detection effectiveness.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Robust Control Barrier Functions with Uncertainty Estimation
Authors:
Ersin Daş,
Skylar X. Wei,
Joel W. Burdick
Abstract:
This paper proposes a safety controller for control-affine nonlinear systems with unmodelled dynamics and disturbances to improve closed-loop robustness. Uncertainty estimation-based control barrier functions (CBFs) are utilized to ensure robust safety in the presence of model uncertainties, which may depend on control input and states. We present a new uncertainty/disturbance estimator with theor…
▽ More
This paper proposes a safety controller for control-affine nonlinear systems with unmodelled dynamics and disturbances to improve closed-loop robustness. Uncertainty estimation-based control barrier functions (CBFs) are utilized to ensure robust safety in the presence of model uncertainties, which may depend on control input and states. We present a new uncertainty/disturbance estimator with theoretical upper bounds on estimation error and estimated outputs, which are used to ensure robust safety by formulating a convex optimization problem using a high-order CBF. The possibly unsafe nominal feedback controller is augmented with the proposed estimator in two frameworks (1) an uncertainty compensator and (2) a robustifying reformulation of CBF constraint with respect to the estimator outputs. The former scheme ensures safety with performance improvement by adaptively rejecting the matched uncertainty. The second method uses uncertainty estimation to robustify higher-order CBFs for safety-critical control. The proposed methods are demonstrated in simulations of an uncertain adaptive cruise control problem and a multirotor obstacle avoidance situation.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Neural Lyapunov Control for Nonlinear Systems with Unstructured Uncertainties
Authors:
Shiqing Wei,
Prashanth Krishnamurthy,
Farshad Khorrami
Abstract:
Stabilizing controller design and region of attraction (RoA) estimation are essential in nonlinear control. Moreover, it is challenging to implement a control Lyapunov function (CLF) in practice when only partial knowledge of the system is available. We propose a learning framework that can synthesize state-feedback controllers and a CLF for control-affine nonlinear systems with unstructured uncer…
▽ More
Stabilizing controller design and region of attraction (RoA) estimation are essential in nonlinear control. Moreover, it is challenging to implement a control Lyapunov function (CLF) in practice when only partial knowledge of the system is available. We propose a learning framework that can synthesize state-feedback controllers and a CLF for control-affine nonlinear systems with unstructured uncertainties. Based on a regularity condition on these uncertainties, we model them as bounded disturbances and prove that a CLF for the nominal system (estimate of the true system) is an input-to-state stable control Lyapunov function (ISS-CLF) for the true system when the CLF's gradient is bounded. We integrate the robust Lyapunov analysis with the learning of both the control law and CLF. We demonstrate the effectiveness of our learning framework on several examples, such as an inverted pendulum system, a strict-feedback system, and a cart-pole system.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Learning Disturbances Online for Risk-Aware Control: Risk-Aware Flight with Less Than One Minute of Data
Authors:
Prithvi Akella,
Skylar X. Wei,
Joel W. Burdick,
Aaron D. Ames
Abstract:
Recent advances in safety-critical risk-aware control are predicated on apriori knowledge of the disturbances a system might face. This paper proposes a method to efficiently learn these disturbances online, in a risk-aware context. First, we introduce the concept of a Surface-at-Risk, a risk measure for stochastic processes that extends Value-at-Risk -- a commonly utilized risk measure in the ris…
▽ More
Recent advances in safety-critical risk-aware control are predicated on apriori knowledge of the disturbances a system might face. This paper proposes a method to efficiently learn these disturbances online, in a risk-aware context. First, we introduce the concept of a Surface-at-Risk, a risk measure for stochastic processes that extends Value-at-Risk -- a commonly utilized risk measure in the risk-aware controls community. Second, we model the norm of the state discrepancy between the model and the true system evolution as a scalar-valued stochastic process and determine an upper bound to its Surface-at-Risk via Gaussian Process Regression. Third, we provide theoretical results on the accuracy of our fitted surface subject to mild assumptions that are verifiable with respect to the data sets collected during system operation. Finally, we experimentally verify our procedure by augmenting a drone's controller and highlight performance increases achieved via our risk-aware approach after collecting less than a minute of operating data.
△ Less
Submitted 12 December, 2022;
originally announced December 2022.
-
On Finite Difference Jacobian Computation in Deformable Image Registration
Authors:
Yihao Liu,
Junyu Chen,
Shuwen Wei,
Aaron Carass,
Jerry Prince
Abstract:
Producing spatial transformations that are diffeomorphic is a key goal in deformable image registration. As a diffeomorphic transformation should have positive Jacobian determinant |J| everywhere, the number of voxels with |J|<0 has been used to test for diffeomorphism and also to measure the irregularity of the transformation. For digital transformations, |J| is commonly approximated using a cent…
▽ More
Producing spatial transformations that are diffeomorphic is a key goal in deformable image registration. As a diffeomorphic transformation should have positive Jacobian determinant |J| everywhere, the number of voxels with |J|<0 has been used to test for diffeomorphism and also to measure the irregularity of the transformation. For digital transformations, |J| is commonly approximated using a central difference, but this strategy can yield positive |J|'s for transformations that are clearly not diffeomorphic -- even at the voxel resolution level. To show this, we first investigate the geometric meaning of different finite difference approximations of |J|. We show that to determine if a deformation is diffeomorphic for digital images, the use of any individual finite difference approximation of |J| is insufficient. We further demonstrate that for a 2D transformation, four unique finite difference approximations of |J|'s must be positive to ensure that the entire domain is invertible and free of folding at the pixel level. For a 3D transformation, ten unique finite differences approximations of |J|'s are required to be positive. Our proposed digital diffeomorphism criteria solves several errors inherent in the central difference approximation of |J| and accurately detects non-diffeomorphic digital transformations. The source code of this work is available at https://github.com/yihao6/digital_diffeomorphism.
△ Less
Submitted 28 May, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Adaptive Conformal Prediction for Motion Planning among Dynamic Agents
Authors:
Anushri Dixit,
Lars Lindemann,
Skylar Wei,
Matthew Cleaveland,
George J. Pappas,
Joel W. Burdick
Abstract:
This paper proposes an algorithm for motion planning among dynamic agents using adaptive conformal prediction. We consider a deterministic control system and use trajectory predictors to predict the dynamic agents' future motion, which is assumed to follow an unknown distribution. We then leverage ideas from adaptive conformal prediction to dynamically quantify prediction uncertainty from an onlin…
▽ More
This paper proposes an algorithm for motion planning among dynamic agents using adaptive conformal prediction. We consider a deterministic control system and use trajectory predictors to predict the dynamic agents' future motion, which is assumed to follow an unknown distribution. We then leverage ideas from adaptive conformal prediction to dynamically quantify prediction uncertainty from an online data stream. Particularly, we provide an online algorithm uses delayed agent observations to obtain uncertainty sets for multistep-ahead predictions with probabilistic coverage. These uncertainty sets are used within a model predictive controller to safely navigate among dynamic agents. While most existing data-driven prediction approached quantify prediction uncertainty heuristically, we quantify the true prediction uncertainty in a distribution-free, adaptive manner that even allows to capture changes in prediction quality and the agents' motion. We empirically evaluate of our algorithm on a simulation case studies where a drone avoids a flying frisbee.
△ Less
Submitted 30 November, 2022;
originally announced December 2022.
-
Shadow-Oriented Tracking Method for Multi-Target Tracking in Video-SAR
Authors:
Xiaochuan Ni,
Xiaoling Zhang,
Xu Zhan,
Zhenyu Yang,
Jun Shi,
Shunjun Wei,
Tianjiao Zeng
Abstract:
This work focuses on multi-target tracking in Video synthetic aperture radar. Specifically, we refer to tracking based on targets' shadows. Current methods have limited accuracy as they fail to consider shadows' characteristics and surroundings fully. Shades are low-scattering and varied, resulting in missed tracking. Surroundings can cause interferences, resulting in false tracking. To solve thes…
▽ More
This work focuses on multi-target tracking in Video synthetic aperture radar. Specifically, we refer to tracking based on targets' shadows. Current methods have limited accuracy as they fail to consider shadows' characteristics and surroundings fully. Shades are low-scattering and varied, resulting in missed tracking. Surroundings can cause interferences, resulting in false tracking. To solve these, we propose a shadow-oriented multi-target tracking method (SOTrack). To avoid false tracking, a pre-processing module is proposed to enhance shadows from surroundings, thus reducing their interferences. To avoid missed tracking, a detection method based on deep learning is designed to thoroughly learn shadows' features, thus increasing the accurate estimation. And further, a recall module is designed to recall missed shadows. We conduct experiments on measured data. Results demonstrate that, compared with other methods, SOTrack achieves much higher performance in tracking accuracy-18.4%. And ablation study confirms the effectiveness of the proposed modules.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
A Model-data-driven Network Embedding Multidimensional Features for Tomographic SAR Imaging
Authors:
Yu Ren,
Xiaoling Zhang,
Xu Zhan,
Jun Shi,
Shunjun Wei,
Tianjiao Zeng
Abstract:
Deep learning (DL)-based tomographic SAR imaging algorithms are gradually being studied. Typically, they use an unfolding network to mimic the iterative calculation of the classical compressive sensing (CS)-based methods and process each range-azimuth unit individually. However, only one-dimensional features are effectively utilized in this way. The correlation between adjacent resolution units is…
▽ More
Deep learning (DL)-based tomographic SAR imaging algorithms are gradually being studied. Typically, they use an unfolding network to mimic the iterative calculation of the classical compressive sensing (CS)-based methods and process each range-azimuth unit individually. However, only one-dimensional features are effectively utilized in this way. The correlation between adjacent resolution units is ignored directly. To address that, we propose a new model-data-driven network to achieve tomoSAR imaging based on multi-dimensional features. Guided by the deep unfolding methodology, a two-dimensional deep unfolding imaging network is constructed. On the basis of it, we add two 2D processing modules, both convolutional encoder-decoder structures, to enhance multi-dimensional features of the imaging scene effectively. Meanwhile, to train the proposed multifeature-based imaging network, we construct a tomoSAR simulation dataset consisting entirely of simulation data of buildings. Experiments verify the effectiveness of the model. Compared with the conventional CS-based FISTA method and DL-based gamma-Net method, the result of our proposed method has better performance on completeness while having decent imaging accuracy.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Near-filed SAR Image Restoration with Deep Learning Inverse Technique: A Preliminary Study
Authors:
Xu Zhan,
Xiaoling Zhang,
Wensi Zhang,
Jun Shi,
Shunjun Wei,
Tianjiao Zeng
Abstract:
Benefiting from a relatively larger aperture's angle, and in combination with a wide transmitting bandwidth, near-field synthetic aperture radar (SAR) provides a high-resolution image of a target's scattering distribution-hot spots. Meanwhile, imaging result suffers inevitable degradation from sidelobes, clutters, and noises, hindering the information retrieval of the target. To restore the image,…
▽ More
Benefiting from a relatively larger aperture's angle, and in combination with a wide transmitting bandwidth, near-field synthetic aperture radar (SAR) provides a high-resolution image of a target's scattering distribution-hot spots. Meanwhile, imaging result suffers inevitable degradation from sidelobes, clutters, and noises, hindering the information retrieval of the target. To restore the image, current methods make simplified assumptions; for example, the point spread function (PSF) is spatially consistent, the target consists of sparse point scatters, etc. Thus, they achieve limited restoration performance in terms of the target's shape, especially for complex targets. To address these issues, a preliminary study is conducted on restoration with the recent promising deep learning inverse technique in this work. We reformulate the degradation model into a spatially variable complex-convolution model, where the near-field SAR's system response is considered. Adhering to it, a model-based deep learning network is designed to restore the image. A simulated degraded image dataset from multiple complex target models is constructed to validate the network. All the images are formulated using the electromagnetic simulation tool. Experiments on the dataset reveal their effectiveness. Compared with current methods, superior performance is achieved regarding the target's shape and energy estimation.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Solving 3D Radar Imaging Inverse Problems with a Multi-cognition Task-oriented Framework
Authors:
Xu Zhan,
Xiaoling Zhang,
Mou Wang,
Jun Shi,
Shunjun Wei,
Tianjiao Zeng
Abstract:
This work focuses on 3D Radar imaging inverse problems. Current methods obtain undifferentiated results that suffer task-depended information retrieval loss and thus don't meet the task's specific demands well. For example, biased scattering energy may be acceptable for screen imaging but not for scattering diagnosis. To address this issue, we propose a new task-oriented imaging framework. The ima…
▽ More
This work focuses on 3D Radar imaging inverse problems. Current methods obtain undifferentiated results that suffer task-depended information retrieval loss and thus don't meet the task's specific demands well. For example, biased scattering energy may be acceptable for screen imaging but not for scattering diagnosis. To address this issue, we propose a new task-oriented imaging framework. The imaging principle is task-oriented through an analysis phase to obtain task's demands. The imaging model is multi-cognition regularized to embed and fulfill demands. The imaging method is designed to be general-ized, where couplings between cognitions are decoupled and solved individually with approximation and variable-splitting techniques. Tasks include scattering diagnosis, person screen imaging, and parcel screening imaging are given as examples. Experiments on data from two systems indicate that the pro-posed framework outperforms the current ones in task-depended information retrieval.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
The Trajectory PHD Filter for Coexisting Point and Extended Target Tracking
Authors:
Shaoxiu Wei,
Ángel F. García-Fernández,
Wei Yi
Abstract:
This paper develops a general trajectory probability hypothesis density (TPHD) filter, which uses a general density for target-generated measurements and is able to estimate trajectories of coexisting point and extended targets. First, we provide a derivation of this general TPHD filter based on finding the best Poisson posterior approximation by minimizing the Kullback-Leibler divergence, without…
▽ More
This paper develops a general trajectory probability hypothesis density (TPHD) filter, which uses a general density for target-generated measurements and is able to estimate trajectories of coexisting point and extended targets. First, we provide a derivation of this general TPHD filter based on finding the best Poisson posterior approximation by minimizing the Kullback-Leibler divergence, without using probability generating functionals. Second, we adopt an efficient implementation of this filter, where Gaussian densities correspond to point targets and Gamma Gaussian Inverse Wishart densities for extended targets. The L-scan approximation is also proposed as a simplified version to mitigate the huge computational cost. Simulation and experimental results show that the proposed filter is able to classify targets correctly and obtain accurate trajectory estimation.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Constant-Time-Delay Interferences In Near-Field SAR: Analysis And Suppression In Image Domain
Authors:
Xu Zhan,
Xiaoling Zhang,
Jun Shi,
Shunjun Wei
Abstract:
Inevitable interferences exist for the SAR system, adversely affecting the imaging quality. However, current analysis and suppression methods mainly focus on the far-field situation. Due to different sources and characteristics of interferences, they are not applicable in the near field. To bridge this gap, in the first time, analysis and the suppression method of interferences in near-field SAR a…
▽ More
Inevitable interferences exist for the SAR system, adversely affecting the imaging quality. However, current analysis and suppression methods mainly focus on the far-field situation. Due to different sources and characteristics of interferences, they are not applicable in the near field. To bridge this gap, in the first time, analysis and the suppression method of interferences in near-field SAR are presented in this work. We find that echoes from both the nadir points and the antenna coupling are the main causes, which have the constant-time-delay feature. To characterize this, we further establish an analytical model. It reveals that their patterns in 1D, 2D and 3D imaging results are all comb-like, while those of targets are point-like. Utilizing these features, a suppression method in image domain is proposed based on low-rank reconstruction. Measured data are used to validate the correctness of our analysis and the effectiveness of the suppression method.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
3D Super-Resolution Imaging Method for Distributed Millimeter-wave Automotive Radar System
Authors:
Yanqin Xu,
Xiaoling Zhang,
Shunjun Wei,
Jun Shi,
Xu Zhan,
Tianwen Zhang
Abstract:
Millimeter-wave (mmW) radar is widely applied to advanced autopilot assistance systems. However, its small antenna aperture causes a low imaging resolution. In this paper, a new distributed mmW radar system is designed to solve this problem. It forms a large sparse virtual planar array to enlarge the aperture, using multiple-input and multiple-output (MIMO) processing. However, in this system, tra…
▽ More
Millimeter-wave (mmW) radar is widely applied to advanced autopilot assistance systems. However, its small antenna aperture causes a low imaging resolution. In this paper, a new distributed mmW radar system is designed to solve this problem. It forms a large sparse virtual planar array to enlarge the aperture, using multiple-input and multiple-output (MIMO) processing. However, in this system, traditional imaging methods cannot apply to the sparse array. Therefore, we also propose a 3D super-resolution imaging method specifically for this system in this paper. The proposed method consists of three steps: (1) using range FFT to get range imaging, (2) using 2D adaptive diagonal loading iterative adaptive approach (ADL-IAA) to acquire 2D super-resolution imaging, which can satisfy this sparsity under single-measurement, (3) using constant false alarm (CFAR) processing to gain final 3D super-resolution imaging. The simulation results show the proposed method can significantly improve imaging resolution under the sparse array and single-measurement.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Near-Field SAR Image Restoration Based On Two Dimensional Spatial-Variant Deconvolution
Authors:
Wensi Zhang,
Xiaoling Zhang,
Xu Zhan,
Yuetonghui Xu,
Jun Shi,
Shunjun Wei
Abstract:
Images of near-field SAR contains spatial-variant sidelobes and clutter, subduing the image quality. Current image restoration methods are only suitable for small observation angle, due to their assumption of 2D spatial-invariant degradation operation. This limits its potential for large-scale objects imaging, like the aircraft. To ease this restriction, in this work an image restoration method ba…
▽ More
Images of near-field SAR contains spatial-variant sidelobes and clutter, subduing the image quality. Current image restoration methods are only suitable for small observation angle, due to their assumption of 2D spatial-invariant degradation operation. This limits its potential for large-scale objects imaging, like the aircraft. To ease this restriction, in this work an image restoration method based on the 2D spatial-variant deconvolution is proposed. First, the image degradation is seen as a complex convolution process with 2D spatial-variant operations. Then, to restore the image, the process of deconvolution is performed by cyclic coordinate descent algorithm. Experiments on simulation and measured data validate the effectiveness and superiority of the proposed method. Compared with current methods, higher precision estimation of the targets' amplitude and position is obtained.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Two Dimensional Sparse-Regularization-Based InSAR Imaging with Back-Projection Embedding
Authors:
Xu Zhan,
Xiaoling Zhang,
Shunjun Wei,
Jun Shi
Abstract:
Interferometric Synthetic Aperture Radar (InSAR) Imaging methods are usually based on algorithms of match-filtering type, without considering the scene's characteristic, which causes limited imaging quality. Besides, post-processing steps are inevitable, like image registration, flat-earth phase removing and phase noise filtering. To solve these problems, we propose a new InSAR imaging method. Fir…
▽ More
Interferometric Synthetic Aperture Radar (InSAR) Imaging methods are usually based on algorithms of match-filtering type, without considering the scene's characteristic, which causes limited imaging quality. Besides, post-processing steps are inevitable, like image registration, flat-earth phase removing and phase noise filtering. To solve these problems, we propose a new InSAR imaging method. First, to enhance the imaging quality, we propose a new imaging framework base on 2D sparse regularization, where the characteristic of scene is embedded. Second, to avoid the post processing steps, we establish a new forward observation process, where the back-projection imaging method is embedded. Third, a forward and backward iterative solution method is proposed based on proximal gradient descent algorithm. Experiments on simulated and measured data reveal the effectiveness of the proposed method. Compared with the conventional method, higher quality interferogram can be obtained directly from raw echoes without post-processing. Besides, in the under-sampling situation, it's also applicable.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Deep Multi-Scale Representation Learning with Attention for Automatic Modulation Classification
Authors:
Xiaowei Wu,
Shengyun Wei,
Yan Zhou
Abstract:
Currently, deep learning methods with stacking small size convolutional filters are widely used for automatic modulation classification (AMC). In this report, we find some experienced improvements by using large kernel size for convolutional deep convolution neural network based AMC, which is more efficient in extracting multi-scale features of the raw signal I/Q sequence data. Also, Squeeze-and-E…
▽ More
Currently, deep learning methods with stacking small size convolutional filters are widely used for automatic modulation classification (AMC). In this report, we find some experienced improvements by using large kernel size for convolutional deep convolution neural network based AMC, which is more efficient in extracting multi-scale features of the raw signal I/Q sequence data. Also, Squeeze-and-Excitation (SE) mechanisms can significantly help AMC networks to focus on the more important features of the signal. As a result, we propose a multi-scale feature network with large kernel size and SE mechanism (SE-MSFN) in this paper. SE-MSFN achieves state-of-the-art classification performance on the public well-known RADIOML 2018.01A dataset, with average classification accuracy of 64.50%, surpassing CLDNN by 1.42%, maximum classification accuracy of 98.5%, and an average classification accuracy of 85.53% in the lower SNR range 0dB to 10dB, surpassing CLDNN by 2.85%. In addition, we also verified that ensemble learning can help further improve classification performance. We hope this report can provide some references for developers and researchers in practical scenes.
△ Less
Submitted 31 August, 2022;
originally announced September 2022.
-
Learning long-term music representations via hierarchical contextual constraints
Authors:
Shiqi Wei,
Gus Xia
Abstract:
Learning symbolic music representations, especially disentangled representations with probabilistic interpretations, has been shown to benefit both music understanding and generation. However, most models are only applicable to short-term music, while learning long-term music representations remains a challenging task. We have seen several studies attempting to learn hierarchical representations d…
▽ More
Learning symbolic music representations, especially disentangled representations with probabilistic interpretations, has been shown to benefit both music understanding and generation. However, most models are only applicable to short-term music, while learning long-term music representations remains a challenging task. We have seen several studies attempting to learn hierarchical representations directly in an end-to-end manner, but these models have not been able to achieve the desired results and the training process is not stable. In this paper, we propose a novel approach to learn long-term symbolic music representations through contextual constraints. First, we use contrastive learning to pre-train a long-term representation by constraining its difference from the short-term representation (extracted by an off-the-shelf model). Then, we fine-tune the long-term representation by a hierarchical prediction model such that a good long-term representation (e.g., an 8-bar representation) can reconstruct the corresponding short-term ones (e.g., the 2-bar representations within the 8-bar range). Experiments show that our method stabilizes the training and the fine-tuning steps. In addition, the designed contextual constraints benefit both reconstruction and disentanglement, significantly outperforming the baselines.
△ Less
Submitted 12 February, 2022;
originally announced February 2022.
-
Trajectory PHD Filter with Unknown Detection Profile and Clutter Rate
Authors:
Shaoxiu Wei,
Boxiang Zhang,
Wei Yi
Abstract:
In this paper, we derive the robust TPHD (R-TPHD) filter, which can adaptively learn the unknown detection profile history and clutter rate. The R-TPHD filter is derived by obtaining the best Poisson posterior density approximation over trajectories on hybrid and augmented state space by minimizing the Kullback-Leibler divergence (KLD). Because of the huge computational burden and the short-term s…
▽ More
In this paper, we derive the robust TPHD (R-TPHD) filter, which can adaptively learn the unknown detection profile history and clutter rate. The R-TPHD filter is derived by obtaining the best Poisson posterior density approximation over trajectories on hybrid and augmented state space by minimizing the Kullback-Leibler divergence (KLD). Because of the huge computational burden and the short-term stability of the detection profile, we also propose the R-TPHD filter with unknown detection profile only at current time as an approximation. The Beta-Gaussian mixture model is proposed for the implementation, which is referred to as the BG-R-TPHD filter and we also propose a L-scan approximation for the BG-R-TPHD filter, which possesses lower computational burden.
△ Less
Submitted 6 November, 2021;
originally announced November 2021.
-
Multi-target Joint Tracking and Classification Using the Trajectory PHD Filter
Authors:
Shaoxiu Wei,
Boxiang Zhang,
Wei Yi
Abstract:
To account for joint tracking and classification (JTC) of multiple targets from observation sets in presence of detection uncertainty, noise and clutter, this paper develops a new trajectory probability hypothesis density (TPHD) filter, which is referred to as the JTC-TPHD filter. The JTC-TPHD filter classifies different targets based on their motion models and each target is assigned with multipl…
▽ More
To account for joint tracking and classification (JTC) of multiple targets from observation sets in presence of detection uncertainty, noise and clutter, this paper develops a new trajectory probability hypothesis density (TPHD) filter, which is referred to as the JTC-TPHD filter. The JTC-TPHD filter classifies different targets based on their motion models and each target is assigned with multiple class hypotheses. By using this strategy, we can not only obtain the category information of the targets, but also a more accurate trajectory estimation than the traditional TPHD filter. The JTC-TPHD filter is derived by finding the best Poisson posterior approximation over trajectories on an augmented state space using the Kullback-Leibler divergence (KLD) minimization. The Gaussian mixture is adopted for the implementation, which is referred to as the GM-JTC-TPHD filter. The L-scan approximation is also presented for the GM-JTC-TPHD filter, which possesses lower computational burden. Simulation results show that the GM-JTC-TPHD filter can classify targets correctly and obtain accurate trajectory estimation.
△ Less
Submitted 6 November, 2021;
originally announced November 2021.
-
Trajectory PHD and CPHD Filters with Unknown Detection Profile
Authors:
Shaoxiu Wei,
Boxiang Zhang,
Wei Yi
Abstract:
Compared to the probability hypothesis density (PHD) and cardinalized PHD (CPHD) filters, the trajectory PHD (TPHD) and trajectory CPHD (TCPHD) filters are for sets of trajectories, and thus are able to produce trajectory estimates with better estimation performance. In this paper, we develop the TPHD and TCPHD filters which can adaptively learn the history of the unknown target detection probabil…
▽ More
Compared to the probability hypothesis density (PHD) and cardinalized PHD (CPHD) filters, the trajectory PHD (TPHD) and trajectory CPHD (TCPHD) filters are for sets of trajectories, and thus are able to produce trajectory estimates with better estimation performance. In this paper, we develop the TPHD and TCPHD filters which can adaptively learn the history of the unknown target detection probability, and therefore they can perform more robustly in scenarios where targets are with unknown and time-varying detection probabilities. These filters are referred to as the unknown TPHD (U-TPHD) and unknown TCPHD (U-TCPHD) filters.By minimizing the Kullback-Leibler divergence (KLD), the U-TPHD and U-TCPHD filters can obtain, respectively, the best Poisson and independent identically distributed (IID) density approximations over the augmented sets of trajectories. For computational efficiency, we also propose the U-TPHD and U-TCPHD filters that only consider the unknown detection profile at the current time. Specifically, the Beta-Gaussian mixture method is adopted for the implementation of proposed filters, which are referred to as the BG-U-TPHD and BG-U-TCPHD filters. The L-scan approximations of these filters with much lower computational burden are also presented. Finally, various simulation results demonstrate that the BG-U-TPHD and BG-U-TCPHD filters can achieve robust tracking performance to adapt to unknown detection profile. Besides, it also shows that usually a small value of the L-scan approximation can achieve almost full efficiency of both filters but with a much lower computational costs.
△ Less
Submitted 6 November, 2021;
originally announced November 2021.
-
Nonlinear Controllability Assessment of Aerial Manipulator Systems using Lagrangian Reduction
Authors:
Skylar X. Wei,
Matthew R. Burkhardt,
Joel Burdick
Abstract:
This paper analyzes the nonlinear Small-Time Local Controllability (STLC) of a class of underatuated aerial manipulator robots. We apply methods of Lagrangian reduction to obtain their lowest dimensional equations of motion (EOM). The symmetry-breaking potential energy terms are resolved using advected parameters, allowing full $SE(3)$ reduction at the cost of additional advection equations. The r…
▽ More
This paper analyzes the nonlinear Small-Time Local Controllability (STLC) of a class of underatuated aerial manipulator robots. We apply methods of Lagrangian reduction to obtain their lowest dimensional equations of motion (EOM). The symmetry-breaking potential energy terms are resolved using advected parameters, allowing full $SE(3)$ reduction at the cost of additional advection equations. The reduced EOM highlights the shifting center of gravity due to manipulation and is readily in control-affine form, simplifying the nonlinear controllability analysis. Using Sussmann's sufficient condition, we conclude that the aerial manipulator robots are STLC near equilibrium condition, requiring Lie bracket motions up to degree three.
△ Less
Submitted 14 August, 2021;
originally announced August 2021.
-
Motivations and Preliminary Design for Mid-Air Deployment of a Science Rotorcraft on Mars
Authors:
Jeff Delaune,
Jacob Izraelevitz,
Larry A. Young,
William Rapin,
Evgeniy Sklyanskiy,
Wayne Johnson,
Aaron Schutte,
Abigail Fraeman,
Valerie Scott,
Carl Leake,
Erik Ballesteros,
Shannah Withrow,
Raghav Bhagwat,
Haley Cummings,
Kim Aaron,
Marcel Veismann,
Skylar Wei,
Regina Lee,
Luis Pabon Madrid,
Morteza Gharib,
Joel Burdick
Abstract:
Mid-Air Deployment (MAD) of a rotorcraft during Entry, Descent and Landing (EDL) on Mars eliminates the need to carry a propulsion or airbag landing system. This reduces the total mass inside the aeroshell by more than 100 kg and simplifies the aeroshell architecture. MAD's lighter and simpler design is likely to bring the risk and cost associated with the mission down. Moreover, the lighter entry…
▽ More
Mid-Air Deployment (MAD) of a rotorcraft during Entry, Descent and Landing (EDL) on Mars eliminates the need to carry a propulsion or airbag landing system. This reduces the total mass inside the aeroshell by more than 100 kg and simplifies the aeroshell architecture. MAD's lighter and simpler design is likely to bring the risk and cost associated with the mission down. Moreover, the lighter entry mass enables landing in the Martian highlands, at elevations inaccessible to current EDL technologies. This paper proposes a novel MAD concept for a Mars helicopter. We suggest a minimum science payload package to perform relevant science in the highlands. A variant of the Ingenuity helicopter is proposed to provide increased deceleration during MAD, and enough lift to fly the science payload in the highlands. We show in simulation that the lighter aeroshell results in a lower terminal velocity (30 m/s) at the end of the parachute phase of the EDL, and at higher altitudes than other approaches. After discussing the aerodynamics, controls, guidance, and mechanical challenges associated with deploying at such speed, we propose a backshell architecture that addresses them to release the helicopter in the safest conditions. Finally, we implemented the helicopter model and aerodynamic descent perturbations in the JPL Dynamics and Real-Time Simulation (DARTS)framework. Preliminary performance evaluation indicates landing and helicopter operation scan be achieved up to 5 km MOLA (Mars Orbiter Laser Altimeter reference).
△ Less
Submitted 13 October, 2020;
originally announced October 2020.
-
Balance Scene Learning Mechanism for Offshore and Inshore Ship Detection in SAR Images
Authors:
Tianwen Zhang,
Xiaoling Zhang,
Jun Shi,
Shunjun Wei,
Jianguo Wang,
Jianwei Li,
Hao Su,
Yue Zhou
Abstract:
Huge imbalance of different scenes' sample numbers seriously reduces Synthetic Aperture Radar (SAR) ship detection accuracy. Thus, to solve this problem, this letter proposes a Balance Scene Learning Mechanism (BSLM) for offshore and inshore ship detection in SAR images.
Huge imbalance of different scenes' sample numbers seriously reduces Synthetic Aperture Radar (SAR) ship detection accuracy. Thus, to solve this problem, this letter proposes a Balance Scene Learning Mechanism (BSLM) for offshore and inshore ship detection in SAR images.
△ Less
Submitted 21 September, 2020; v1 submitted 21 July, 2020;
originally announced July 2020.
-
Power Adaptive Network Coding for a Non-Orthogonal Multiple-Access Relay Channel
Authors:
Sha Wei,
Jun Li,
Wen Chen,
Hang Su,
Zihuai Lin,
Branka Vucetic
Abstract:
In this paper we propose a novel power adapted network coding (PANC) for a non-orthogonal multiple-access relay channel (MARC), where two sources transmit their information simultaneously to the destination with the help of a relay. Different from the conventional XOR-based network coding (CXNC), the relay in our PANC generates network coded bits by considering the coefficients of the source-to-re…
▽ More
In this paper we propose a novel power adapted network coding (PANC) for a non-orthogonal multiple-access relay channel (MARC), where two sources transmit their information simultaneously to the destination with the help of a relay. Different from the conventional XOR-based network coding (CXNC), the relay in our PANC generates network coded bits by considering the coefficients of the source-to-relay channels, and forwards each bit with a pre-optimized power level. Specifically, by defining a symbol pair as two symbols from the two sources, we first derive the exact symbol pair error rate (SPER) of the system. Noting that the generations of the exact SPER are complicated due to the irregularity of the decision regions caused by random channel coefficients, we propose a coordinate transform (CT) method to simplify the derivations of the SPER. Next, we prove that with a power scaling factor at relay, our PANC scheme can achieve full diversity gain, i.e., two-order diversity gain, of the system, while the CXNC can only achieve one-order diversity gain due to multi-user interference. In addition, we optimize the power levels at the relay to equivalently minimize the SPER at the destination concerning the relationship between SPER and minimum Euclidean distance of the received constellation. Simulation results show that (1) the SPER derived based on our CT method can well approximate the exact SPER with a much lower complexity; (2) the PANC scheme with power level optimizations and power scaling factor design can achieve full diversity, and obtain a much higher coding gain than the PANC scheme with randomly chosen power levels.
△ Less
Submitted 1 March, 2020;
originally announced March 2020.
-
Efficient, Effective and Well Justified Estimation of Active Nodes within a Cluster
Authors:
Md Mahmudul Hasan,
Shuangqing Wei,
Ramachandran Vaidyanathan
Abstract:
Reliable and efficient estimation of the size of a dynamically changing cluster in an IoT network is critical in its nominal operation. Most previous estimation schemes worked with relatively smaller frame size and large number of rounds. Here we propose a new estimator named \textquotedblleft Gaussian Estimator of Active Nodes,\textquotedblright (GEAN), that works with large enough frame size und…
▽ More
Reliable and efficient estimation of the size of a dynamically changing cluster in an IoT network is critical in its nominal operation. Most previous estimation schemes worked with relatively smaller frame size and large number of rounds. Here we propose a new estimator named \textquotedblleft Gaussian Estimator of Active Nodes,\textquotedblright (GEAN), that works with large enough frame size under which testing statistics is well approximated as a Gaussian variable, thereby requiring less number of frames, and thus less total number of channel slots to attain a desired accuracy in estimation. More specifically, the selection of the frame size is done according to Triangular Array Central Limit Theorem which also enables us to quantify the approximation error. Larger frame size helps the statistical average to converge faster to the ensemble mean of the estimator and the quantification of the approximation error helps to determine the number of rounds to keep up with the accuracy requirements. We present the analysis of our scheme under two different channel models i.e. $ \{0,1 \} $ and $ \{0,1,e \} $, whereas all previous schemes worked only under $ \{0,1 \} $ channel model. The overall performance of GEAN is better than the previously proposed schemes considering the number of slots required for estimation to achieve a given level of estimation accuracy.
△ Less
Submitted 26 January, 2020;
originally announced January 2020.
-
Data-driven Method for 3D Axis-symmetric Object Reconstruction from Single Cone-beam Projection Data
Authors:
Shousheng Luo,
Ruyue Meng,
Suhua Wei,
Jianfeng Cai,
Xuecheng Tai,
Yang Wang
Abstract:
In this paper we consider 3D axis-symmetric (AS) object reconstruction from single cone-beam x-ray projection data. Traditional x-ray CT fails to capture fleeting state of material due to the long time for data acquisition at all angles. Therefore, AS object is devised to investigate the instant deformation of material under pulse change of environment because single projection data is enough to r…
▽ More
In this paper we consider 3D axis-symmetric (AS) object reconstruction from single cone-beam x-ray projection data. Traditional x-ray CT fails to capture fleeting state of material due to the long time for data acquisition at all angles. Therefore, AS object is devised to investigate the instant deformation of material under pulse change of environment because single projection data is enough to reconstruct its inner structure. Previous reconstruction methods are layer by layer, and ignore the longitudinal tilt of x-ray paths. We propose a regularization method using adaptive tight frame to reconstruct the 3D AS object structure simultaneously. Alternating direction method is adopted to solve the proposed model. More importantly, a numerical algorithm is developed to compute imaging matrix. Experiments on simulation data verify the effectiveness of our method
△ Less
Submitted 16 December, 2019;
originally announced December 2019.
-
Small-footprint Keyword Spotting with Graph Convolutional Network
Authors:
Xi Chen,
Shouyi Yin,
Dandan Song,
Peng Ouyang,
Leibo Liu,
Shaojun Wei
Abstract:
Despite the recent successes of deep neural networks, it remains challenging to achieve high precision keyword spotting task (KWS) on resource-constrained devices. In this study, we propose a novel context-aware and compact architecture for keyword spotting task. Based on residual connection and bottleneck structure, we design a compact and efficient network for KWS task. To leverage the long rang…
▽ More
Despite the recent successes of deep neural networks, it remains challenging to achieve high precision keyword spotting task (KWS) on resource-constrained devices. In this study, we propose a novel context-aware and compact architecture for keyword spotting task. Based on residual connection and bottleneck structure, we design a compact and efficient network for KWS task. To leverage the long range dependencies and global context of the convolutional feature maps, the graph convolutional network is introduced to encode the non-local relations. By evaluated on the Google Speech Command Dataset, the proposed method achieves state-of-the-art performance and outperforms the prior works by a large margin with lower computational cost.
△ Less
Submitted 11 December, 2019;
originally announced December 2019.
-
Sample Mixed-Based Data Augmentation for Domestic Audio Tagging
Authors:
Shengyun Wei,
Kele Xu,
Dezhi Wang,
Feifan Liao,
Huaimin Wang,
Qiuqiang Kong
Abstract:
Audio tagging has attracted increasing attention since last decade and has various potential applications in many fields. The objective of audio tagging is to predict the labels of an audio clip. Recently deep learning methods have been applied to audio tagging and have achieved state-of-the-art performance, which provides a poor generalization ability on new data. However due to the limited size…
▽ More
Audio tagging has attracted increasing attention since last decade and has various potential applications in many fields. The objective of audio tagging is to predict the labels of an audio clip. Recently deep learning methods have been applied to audio tagging and have achieved state-of-the-art performance, which provides a poor generalization ability on new data. However due to the limited size of audio tagging data such as DCASE data, the trained models tend to result in overfitting of the network. Previous data augmentation methods such as pitch shifting, time stretching and adding background noise do not show much improvement in audio tagging. In this paper, we explore the sample mixed data augmentation for the domestic audio tagging task, including mixup, SamplePairing and extrapolation. We apply a convolutional recurrent neural network (CRNN) with attention module with log-scaled mel spectrum as a baseline system. In our experiments, we achieve an state-of-the-art of equal error rate (EER) of 0.10 on DCASE 2016 task4 dataset with mixup approach, outperforming the baseline system without data augmentation.
△ Less
Submitted 11 August, 2018;
originally announced August 2018.
-
Model-Free MLE Estimation for Online Rotor Angle Stability Assessment with PMU Data
Authors:
Shaopan Wei,
Ming Yang,
Junjian Qi,
Jianhui Wang,
Shiying Ma,
Xueshan Han
Abstract:
Recent research has demonstrated that the rotor angle stability can be assessed by identifying the sign of the system maximal Lyapunov exponent (MLE). A positive (negative) MLE implies unstable (stable) rotor angle dynamics. However, because the MLE may fluctuate between positive and negative values for a long time after a severe disturbance, it is difficult to determine the system stability when…
▽ More
Recent research has demonstrated that the rotor angle stability can be assessed by identifying the sign of the system maximal Lyapunov exponent (MLE). A positive (negative) MLE implies unstable (stable) rotor angle dynamics. However, because the MLE may fluctuate between positive and negative values for a long time after a severe disturbance, it is difficult to determine the system stability when observing a positive or negative MLE without knowing its further fluctuation trend. In this paper, a new approach for online rotor angle stability assessment is proposed to address this problem. The MLE is estimated by a recursive least square (RLS) based method based on real-time rotor angle measurements, and two critical parameters, the Theiler window and the MLE estimation initial time step, are carefully chosen to make sure the calculated MLE curves present distinct features for different stability conditions. By using the proposed stability assessment criteria, the developed approach can provide timely and reliable assessment of the rotor angle stability. Extensive tests on the New-England 39-bus system and the Northeast Power Coordinating Council 140-bus system verify the effectiveness of the proposed approach.
△ Less
Submitted 12 February, 2017;
originally announced February 2017.