Search | arXiv e-print repository

arXiv:2406.05652 [pdf, other]

Distributed Combinatorial Optimization of Downlink User Assignment in mmWave Cell-free Massive MIMO Using Graph Neural Networks

Authors: Bile Peng, Bihan Guo, Karl-Ludwig Besser, Luca Kunz, Ramprasad Raghunath, Anke Schmeink, Eduard A Jorswieck, Giuseppe Caire, H. Vincent Poor

Abstract: Millimeter wave (mmWave) cell-free massive MIMO (CF mMIMO) is a promising solution for future wireless communications. However, its optimization is non-trivial due to the challenging channel characteristics. We show that mmWave CF mMIMO optimization is largely an assignment problem between access points (APs) and users due to the high path loss of mmWave channels, the limited output power of the a… ▽ More Millimeter wave (mmWave) cell-free massive MIMO (CF mMIMO) is a promising solution for future wireless communications. However, its optimization is non-trivial due to the challenging channel characteristics. We show that mmWave CF mMIMO optimization is largely an assignment problem between access points (APs) and users due to the high path loss of mmWave channels, the limited output power of the amplifier, and the almost orthogonal channels between users given a large number of AP antennas. The combinatorial nature of the assignment problem, the requirement for scalability, and the distributed implementation of CF mMIMO make this problem difficult. In this work, we propose an unsupervised machine learning (ML) enabled solution. In particular, a graph neural network (GNN) customized for scalability and distributed implementation is introduced. Moreover, the customized GNN architecture is hierarchically permutation-equivariant (HPE), i.e., if the APs or users of an AP are permuted, the output assignment is automatically permuted in the same way. To address the combinatorial problem, we relax it to a continuous problem, and introduce an information entropy-inspired penalty term. The training objective is then formulated using the augmented Lagrangian method (ALM). The test results show that the realized sum-rate outperforms that of the generalized serial dictatorship (GSD) algorithm and is very close to the upper bound in a small network scenario, while the upper bound is impossible to obtain in a large network scenario. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2401.12974 [pdf, other]

SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI

Authors: Hanxue Gu, Roy Colglazier, Haoyu Dong, Jikai Zhang, Yaqian Chen, Zafer Yildiz, Yuwen Chen, Lin Li, Jichen Yang, Jay Willhite, Alex M. Meyer, Brian Guo, Yashvi Atul Shah, Emily Luo, Shipra Rajput, Sally Kuehn, Clark Bulleit, Kevin A. Wu, Jisoo Lee, Brandon Ramirez, Darui Lu, Jay M. Levin, Maciej A. Mazurowski

Abstract: Magnetic Resonance Imaging (MRI) is pivotal in radiology, offering non-invasive and high-quality insights into the human body. Precise segmentation of MRIs into different organs and tissues would be highly beneficial since it would allow for a higher level of understanding of the image content and enable important measurements, which are essential for accurate diagnosis and effective treatment pla… ▽ More Magnetic Resonance Imaging (MRI) is pivotal in radiology, offering non-invasive and high-quality insights into the human body. Precise segmentation of MRIs into different organs and tissues would be highly beneficial since it would allow for a higher level of understanding of the image content and enable important measurements, which are essential for accurate diagnosis and effective treatment planning. Specifically, segmenting bones in MRI would allow for more quantitative assessments of musculoskeletal conditions, while such assessments are largely absent in current radiological practice. The difficulty of bone MRI segmentation is illustrated by the fact that limited algorithms are publicly available for use, and those contained in the literature typically address a specific anatomic area. In our study, we propose a versatile, publicly available deep-learning model for bone segmentation in MRI across multiple standard MRI locations. The proposed model can operate in two modes: fully automated segmentation and prompt-based segmentation. Our contributions include (1) collecting and annotating a new MRI dataset across various MRI protocols, encompassing over 300 annotated volumes and 8485 annotated slices across diverse anatomic regions; (2) investigating several standard network architectures and strategies for automated segmentation; (3) introducing SegmentAnyBone, an innovative foundational model-based approach that extends Segment Anything Model (SAM); (4) comparative analysis of our algorithm and previous approaches; and (5) generalization analysis of our algorithm across different anatomical locations and MRI sequences, as well as an external dataset. We publicly release our model at https://github.com/mazurowski-lab/SegmentAnyBone. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: 15 pages, 15 figures

arXiv:2312.15821 [pdf, other]

Audiobox: Unified Audio Generation with Natural Language Prompts

Authors: Apoorv Vyas, Bowen Shi, Matthew Le, Andros Tjandra, Yi-Chiao Wu, Baishan Guo, Jiemin Zhang, Xinyue Zhang, Robert Adkins, William Ngan, Jeff Wang, Ivan Cruz, Bapi Akula, Akinniyi Akinyemi, Brian Ellis, Rashel Moritz, Yael Yungster, Alice Rakotoarison, Liang Tan, Chris Summers, Carleigh Wood, Joshua Lane, Mary Williamson, Wei-Ning Hsu

Abstract: Audio is an essential part of our life, but creating it often requires expertise and is time-consuming. Research communities have made great progress over the past year advancing the performance of large scale audio generative models for a single modality (speech, sound, or music) through adopting more powerful generative models and scaling data. However, these models lack controllability in sever… ▽ More Audio is an essential part of our life, but creating it often requires expertise and is time-consuming. Research communities have made great progress over the past year advancing the performance of large scale audio generative models for a single modality (speech, sound, or music) through adopting more powerful generative models and scaling data. However, these models lack controllability in several aspects: speech generation models cannot synthesize novel styles based on text description and are limited on domain coverage such as outdoor environments; sound generation models only provide coarse-grained control based on descriptions like "a person speaking" and would only generate mumbling human voices. This paper presents Audiobox, a unified model based on flow-matching that is capable of generating various audio modalities. We design description-based and example-based prompting to enhance controllability and unify speech and sound generation paradigms. We allow transcript, vocal, and other audio styles to be controlled independently when generating speech. To improve model generalization with limited labels, we adapt a self-supervised infilling objective to pre-train on large quantities of unlabeled audio. Audiobox sets new benchmarks on speech and sound generation (0.745 similarity on Librispeech for zero-shot TTS; 0.77 FAD on AudioCaps for text-to-sound) and unlocks new methods for generating audio with novel vocal and acoustic styles. We further integrate Bespoke Solvers, which speeds up generation by over 25 times compared to the default ODE solver for flow-matching, without loss of performance on several tasks. Our demo is available at https://audiobox.metademolab.com/ △ Less

Submitted 25 December, 2023; originally announced December 2023.

arXiv:2312.03374 [pdf, other]

doi 10.1109/MNET.2023.3332893

Implementing Digital Twin in Field-Deployed Optical Networks: Uncertain Factors, Operational Guidance, and Field-Trial Demonstration

Authors: Yuchen Song, Min Zhang, Yao Zhang, Yan Shi, Shikui Shen, Bingli Guo, Shanguo Huang, Danshi Wang

Abstract: Digital twin has revolutionized optical communication networks by enabling their full life-cycle management, including design, troubleshooting, optimization, upgrade, and prediction. While extensive literature exists on frameworks, standards, and applications of digital twin, there is a pressing need in implementing digital twin in field-deployed optical networks operating in real-world environmen… ▽ More Digital twin has revolutionized optical communication networks by enabling their full life-cycle management, including design, troubleshooting, optimization, upgrade, and prediction. While extensive literature exists on frameworks, standards, and applications of digital twin, there is a pressing need in implementing digital twin in field-deployed optical networks operating in real-world environments, as opposed to controlled laboratory settings. This paper addresses this challenge by examining the uncertain factors behind the inaccuracy of digital twin in field-deployed optical networks from three main challenges and proposing operational guidance for implementing accurate digital twin in field-deployed optical networks. Through the proposed guidance, we demonstrate the effective implementation of digital twin in a field-trial C+L-band optical transmission link, showcasing its capabilities in performance recovery in a fiber cut scenario. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 10 pages, 5 figures Accepted by IEEE Network Magazine, early access

arXiv:2311.03756 [pdf, other]

Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

Authors: Yao Zhang, Zhiwen Yu, Jun Zhang, Liang Wang, Tom H. Luan, Bin Guo, Chau Yuen

Abstract: This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcem… ▽ More This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2308.14178 [pdf, other]

Data-Driven Robust Control Using Prediction Error Bounds Based on Perturbation Analysis

Authors: Baiwei Guo, Yuning Jiang, Colin N. Jones, Giancarlo Ferrari-Trecate

Abstract: For linear systems, many data-driven control methods rely on the behavioral framework, using historical data of the system to predict the future trajectories. However, measurement noise introduces errors in predictions. When the noise is bounded, we propose a method for designing historical experiments that enable the computation of an upper bound on the prediction error. This approach allows us t… ▽ More For linear systems, many data-driven control methods rely on the behavioral framework, using historical data of the system to predict the future trajectories. However, measurement noise introduces errors in predictions. When the noise is bounded, we propose a method for designing historical experiments that enable the computation of an upper bound on the prediction error. This approach allows us to formulate a minimax control problem where robust constraint satisfaction is enforced. We derive an upper bound on the suboptimality gap of the resulting control input sequence compared to optimal control utilizing accurate measurements. As demonstrated in numerical experiments, the solution derived by our method can achieve constraint satisfaction and a small suboptimality gap despite the measurement noise. △ Less

Submitted 27 August, 2023; originally announced August 2023.

arXiv:2305.01968 [pdf]

DPSeq: A Novel and Efficient Digital Pathology Classifier for Predicting Cancer Biomarkers using Sequencer Architecture

Authors: Min Cen, Xingyu Li, Bangwei Guo, Jitendra Jonnagaddala, Hong Zhang, Xu Steven Xu

Abstract: In digital pathology tasks, transformers have achieved state-of-the-art results, surpassing convolutional neural networks (CNNs). However, transformers are usually complex and resource intensive. In this study, we developed a novel and efficient digital pathology classifier called DPSeq, to predict cancer biomarkers through fine-tuning a sequencer architecture integrating horizon and vertical bidi… ▽ More In digital pathology tasks, transformers have achieved state-of-the-art results, surpassing convolutional neural networks (CNNs). However, transformers are usually complex and resource intensive. In this study, we developed a novel and efficient digital pathology classifier called DPSeq, to predict cancer biomarkers through fine-tuning a sequencer architecture integrating horizon and vertical bidirectional long short-term memory (BiLSTM) networks. Using hematoxylin and eosin (H&E)-stained histopathological images of colorectal cancer (CRC) from two international datasets: The Cancer Genome Atlas (TCGA) and Molecular and Cellular Oncology (MCO), the predictive performance of DPSeq was evaluated in series of experiments. DPSeq demonstrated exceptional performance for predicting key biomarkers in CRC (MSI status, Hypermutation, CIMP status, BRAF mutation, TP53 mutation and chromosomal instability [CING]), outperforming most published state-of-the-art classifiers in a within-cohort internal validation and a cross-cohort external validation. Additionally, under the same experimental conditions using the same set of training and testing datasets, DPSeq surpassed 4 CNN (ResNet18, ResNet50, MobileNetV2, and EfficientNet) and 2 transformer (ViT and Swin-T) models, achieving the highest AUROC and AUPRC values in predicting MSI status, BRAF mutation, and CIMP status. Furthermore, DPSeq required less time for both training and prediction due to its simple architecture. Therefore, DPSeq appears to be the preferred choice over transformer and CNN models for predicting cancer biomarkers. △ Less

Submitted 3 May, 2023; originally announced May 2023.

arXiv:2304.01797 [pdf, ps, other]

Safe Zeroth-Order Optimization Using Linear Programs

Authors: Baiwei Guo, Yang Wang, Yuning Jiang, Maryam Kamgarpour, Giancarlo Ferrari-Trecate

Abstract: To solve unmodeled optimization problems with hard constraints, this paper proposes a novel zeroth-order approach called Safe Zeroth-order Optimization using Linear Programs (SZO-LP). The SZO-LP method solves a linear program in each iteration to find a descent direction, followed by a step length determination. We prove that, under mild conditions, the iterates of SZO-LP have an accumulation poin… ▽ More To solve unmodeled optimization problems with hard constraints, this paper proposes a novel zeroth-order approach called Safe Zeroth-order Optimization using Linear Programs (SZO-LP). The SZO-LP method solves a linear program in each iteration to find a descent direction, followed by a step length determination. We prove that, under mild conditions, the iterates of SZO-LP have an accumulation point that is also the primal of a KKT pair. We then apply SZO-LP to solve an Optimal Power Flow (OPF) problem on the IEEE 30-bus system. The results demonstrate that SZO-LP requires less computation time and samples compared to state-of-the-art approaches. △ Less

Submitted 4 April, 2023; originally announced April 2023.

arXiv:2303.16659 [pdf, other]

Safe Zeroth-Order Optimization Using Quadratic Local Approximations

Authors: Baiwei Guo, Yuning Jiang, Giancarlo Ferrari-Trecate, Maryam Kamgarpour

Abstract: This paper addresses black-box smooth optimization problems, where the objective and constraint functions are not explicitly known but can be queried. The main goal of this work is to generate a sequence of feasible points converging towards a KKT primal-dual pair. Assuming to have prior knowledge on the smoothness of the unknown objective and constraints, we propose a novel zeroth-order method th… ▽ More This paper addresses black-box smooth optimization problems, where the objective and constraint functions are not explicitly known but can be queried. The main goal of this work is to generate a sequence of feasible points converging towards a KKT primal-dual pair. Assuming to have prior knowledge on the smoothness of the unknown objective and constraints, we propose a novel zeroth-order method that iteratively computes quadratic approximations of the constraint functions, constructs local feasible sets and optimizes over them. Under some mild assumptions, we prove that this method returns an $ηいーた$-KKT pair (a property reflecting how close a primal-dual pair is to the exact KKT condition) within $O({1}/{ηいーた^{2}})$ iterations. Moreover, we numerically show that our method can achieve faster convergence compared with some state-of-the-art zeroth-order approaches. The effectiveness of the proposed approach is also illustrated by applying it to nonconvex optimization problems in optimal control and power system operation. △ Less

Submitted 24 April, 2024; v1 submitted 29 March, 2023; originally announced March 2023.

Comments: arXiv admin note: text overlap with arXiv:2211.02645

arXiv:2302.10406 [pdf]

Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines

Authors: Min Cen, Xingyu Li, Bangwei Guo, Jitendra Jonnagaddala, Hong Zhang, Xu Steven Xu

Abstract: NLP-based computer vision models, particularly vision transformers, have been shown to outperform CNN models in many imaging tasks. However, most digital pathology artificial-intelligence models are based on CNN architectures, probably owing to a lack of data regarding NLP models for pathology images. In this study, we developed digital pathology pipelines to benchmark the five most recently propo… ▽ More NLP-based computer vision models, particularly vision transformers, have been shown to outperform CNN models in many imaging tasks. However, most digital pathology artificial-intelligence models are based on CNN architectures, probably owing to a lack of data regarding NLP models for pathology images. In this study, we developed digital pathology pipelines to benchmark the five most recently proposed NLP models (vision transformer (ViT), Swin Transformer, MobileViT, CMT, and Sequencer2D) and four popular CNN models (ResNet18, ResNet50, MobileNetV2, and EfficientNet) to predict biomarkers in colorectal cancer (microsatellite instability, CpG island methylator phenotype, and BRAF mutation). Hematoxylin and eosin-stained whole-slide images from Molecular and Cellular Oncology and The Cancer Genome Atlas were used as training and external validation datasets, respectively. Cross-study external validations revealed that the NLP-based models significantly outperformed the CNN-based models in biomarker prediction tasks, improving the overall prediction and precision up to approximately 10% and 26%, respectively. Notably, compared with existing models in the current literature using large training datasets, our NLP models achieved state-of-the-art predictions for all three biomarkers using a relatively small training dataset, suggesting that large training datasets are not a prerequisite for NLP models or transformers, and NLP may be more suitable for clinical studies in which small training datasets are commonly collected. The superior performance of Sequencer2D suggests that further research and innovation on both transformer and bidirectional long short-term memory architectures are warranted in the field of digital pathology. NLP models can replace classic CNN architectures and become the new workhorse backbone in the field of digital pathology. △ Less

Submitted 20 February, 2023; originally announced February 2023.

arXiv:2302.05298 [pdf]

Optical Switching Data Center Networks: Understanding Techniques and Challenges

Authors: Xuwei Xue, Shaojuan Zhang, Bingli Guo, Wei Ji, Rui Yin, Bin Chen, Shanguo Huang

Abstract: Relying on the flexible-access interconnects to the scalable storage and compute resources, data centers deliver critical communications connectivity among numerous servers to support the housed applications and services. To provide the high-speeds and long-distance communications, the data centers have turned to fiber interconnections. With the stringently increased traffic volume, the data cente… ▽ More Relying on the flexible-access interconnects to the scalable storage and compute resources, data centers deliver critical communications connectivity among numerous servers to support the housed applications and services. To provide the high-speeds and long-distance communications, the data centers have turned to fiber interconnections. With the stringently increased traffic volume, the data centers are then expected to further deploy the optical switches into the systems infrastructure to implement the full optical switching. This paper first summarizes the topologies and traffic characteristics in data centers and analyzes the reasons and importance of moving to optical switching. Recent techniques related to the optical switching, and main challenges limiting the practical deployments of optical switches in data centers are also summarized and reported. △ Less

Submitted 13 January, 2023; originally announced February 2023.

arXiv:2301.11831 [pdf, other]

Data Volume-aware Computation Task Scheduling for Smart Grid Data Analytic Applications

Authors: Binquan Guo, Hongyan Li, Ye Yan, Zhou Zhang, Peng Wang

Abstract: Emerging smart grid applications analyze large amounts of data collected from millions of meters and systems to facilitate distributed monitoring and real-time control tasks. However, current parallel data processing systems are designed for common applications, unaware of the massive volume of the collected data, causing long data transfer delay during the computation and slow response time of sm… ▽ More Emerging smart grid applications analyze large amounts of data collected from millions of meters and systems to facilitate distributed monitoring and real-time control tasks. However, current parallel data processing systems are designed for common applications, unaware of the massive volume of the collected data, causing long data transfer delay during the computation and slow response time of smart grid systems. A promising direction to reduce delay is to jointly schedule computation tasks and data transfers. We identify that the smart grid data analytic jobs require the intermediate data among different computation stages to be transmitted orderly to avoid network congestion. This new feature prevents current scheduling algorithms from being efficient. In this work, an integrated computing and communication task scheduling scheme is proposed. The mathematical formulation of smart grid data analytic jobs scheduling problem is given, which is unsolvable by existing optimization methods due to the strongly coupled constraints. Several techniques are combined to linearize it for adapting the Branch and Cut method. Based on the topological information in the job graph, the Topology Aware Branch and Cut method is further proposed to speed up searching for optimal solutions. Numerical results demonstrate the effectiveness of the proposed method. △ Less

Submitted 2 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

Comments: Accepted to appear in IEEE ICC 2023. The source code is available at Github: https://github.com/wilixx/ICCTS

arXiv:2301.02357 [pdf]

doi 10.23880/ppej-16000269

Well Cement Degradation and Wellbore Integrity in Geological CO2 Storages: A Literature Review

Authors: Vu Nguyen, Olatunji Olayiwola, Boyun Guo, Ning Liu

Abstract: Carbon capture and storage (CCS) has emerged as the most effective method to curb the CO2 concentration in the atmosphere. It can store up to 5 billion tons of CO2 per year. To guarantee a safe and economical geological storage, the well cement degradation and wellbore integrity need to be studied thoroughly. This review paper is designed to provide a fundamental background of well cement degradat… ▽ More Carbon capture and storage (CCS) has emerged as the most effective method to curb the CO2 concentration in the atmosphere. It can store up to 5 billion tons of CO2 per year. To guarantee a safe and economical geological storage, the well cement degradation and wellbore integrity need to be studied thoroughly. This review paper is designed to provide a fundamental background of well cement degradation and wellbore integrity in geological CO2 storages to support the researchers in further investigation. The review mainly focuses on mechanical, thermal, chemical property changes and corrosion time for cement in experiments and simulation during geological CO2 storage. However, the debonding interface between casing/cement or cement/formation has not been addressed profoundly. A further investigation should inspect how pressure, temperature, and chemical reaction affect the micro-annuli of casing/cement or cement/formation. Also, a mathe-matical model should be established to predict the corrosion rate in geological CO2 storage. △ Less

Submitted 10 January, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

Comments: 17 pages

Journal ref: Petroleum & Petrochemical Engineering 5 (2021)

arXiv:2211.06161 [pdf, other]

Spatial Temporal Graph Convolution with Graph Structure Self-learning for Early MCI Detection

Authors: Yunpeng Zhao, Fugen Zhou, Bin Guo, Bo Liu

Abstract: Graph neural networks (GNNs) have been successfully applied to early mild cognitive impairment (EMCI) detection, with the usage of elaborately designed features constructed from blood oxygen level-dependent (BOLD) time series. However, few works explored the feasibility of using BOLD signals directly as features. Meanwhile, existing GNN-based methods primarily rely on hand-crafted explicit brain t… ▽ More Graph neural networks (GNNs) have been successfully applied to early mild cognitive impairment (EMCI) detection, with the usage of elaborately designed features constructed from blood oxygen level-dependent (BOLD) time series. However, few works explored the feasibility of using BOLD signals directly as features. Meanwhile, existing GNN-based methods primarily rely on hand-crafted explicit brain topology as the adjacency matrix, which is not optimal and ignores the implicit topological organization of the brain. In this paper, we propose a spatial temporal graph convolutional network with a novel graph structure self-learning mechanism for EMCI detection. The proposed spatial temporal graph convolution block directly exploits BOLD time series as input features, which provides an interesting view for rsfMRI-based preclinical AD diagnosis. Moreover, our model can adaptively learn the optimal topological structure and refine edge weights with the graph structure self-learning mechanism. Results on the Alzheimer's Disease Neuroimaging Initiative (ADNI) database show that our method outperforms state-of-the-art approaches. Biomarkers consistent with previous studies can be extracted from the model, proving the reliable interpretability of our method. △ Less

Submitted 11 November, 2022; originally announced November 2022.

Comments: 5 pages, 3 figures

arXiv:2211.02645 [pdf, ps, other]

Safe Zeroth-Order Convex Optimization Using Quadratic Local Approximations

Authors: Baiwei Guo, Yuning Jiang, Maryam Kamgarpour, Giancarlo Ferrari-Trecate

Abstract: We address black-box convex optimization problems, where the objective and constraint functions are not explicitly known but can be sampled within the feasible set. The challenge is thus to generate a sequence of feasible points converging towards an optimal solution. By leveraging the knowledge of the smoothness properties of the objective and constraint functions, we propose a novel zeroth-order… ▽ More We address black-box convex optimization problems, where the objective and constraint functions are not explicitly known but can be sampled within the feasible set. The challenge is thus to generate a sequence of feasible points converging towards an optimal solution. By leveraging the knowledge of the smoothness properties of the objective and constraint functions, we propose a novel zeroth-order method, SZO-QQ, that iteratively computes quadratic approximations of the constraint functions, constructs local feasible sets and optimizes over them. We prove convergence of the sequence of the objective values generated at each iteration to the minimum. Through experiments, we show that our method can achieve faster convergence compared with state-of-the-art zeroth-order approaches to convex optimization. △ Less

Submitted 7 November, 2022; v1 submitted 4 November, 2022; originally announced November 2022.

arXiv:2209.11485 [pdf, other]

Optimal Job Scheduling and Bandwidth Augmentation in Hybrid Data Center Networks

Authors: Binquan Guo, Zhou Zhang, Ye Yan, Hongyan Li

Abstract: Optimizing data transfers is critical for improving job performance in data-parallel frameworks. In the hybrid data center with both wired and wireless links, reconfigurable wireless links can provide additional bandwidth to speed up job execution. However, it requires the scheduler and transceivers to make joint decisions under coupled constraints. In this work, we identify that the joint job sch… ▽ More Optimizing data transfers is critical for improving job performance in data-parallel frameworks. In the hybrid data center with both wired and wireless links, reconfigurable wireless links can provide additional bandwidth to speed up job execution. However, it requires the scheduler and transceivers to make joint decisions under coupled constraints. In this work, we identify that the joint job scheduling and bandwidth augmentation problem is a complex mixed integer nonlinear problem, which is not solvable by existing optimization methods. To address this bottleneck, we transform it into an equivalent problem based on the coupling of its heuristic bounds, the revised data transfer representation and non-linear constraints decoupling and reformulation, such that the optimal solution can be efficiently acquired by the Branch and Bound method. Based on the proposed method, the performance of job scheduling with and without bandwidth augmentation is studied. Experiments show that the performance gain depends on multiple factors, especially the data size. Compared with existing solutions, our method can averagely reduce the job completion time by up to 10% under the setting of production scenario. △ Less

Submitted 23 September, 2022; originally announced September 2022.

Comments: Accepted to appear in IEEE GLOBECOM 2022

arXiv:2208.10495 [pdf]

Predicting microsatellite instability and key biomarkers in colorectal cancer from H&E-stained images: Achieving SOTA predictive performance with fewer data using Swin Transformer

Authors: Bangwei Guo, Xingyu Li, Jitendra Jonnagaddala, Hong Zhang, Xu Steven Xu

Abstract: Artificial intelligence (AI) models have been developed for predicting clinically relevant biomarkers, including microsatellite instability (MSI), for colorectal cancers (CRC). However, the current deep-learning networks are data-hungry and require large training datasets, which are often lacking in the medical domain. In this study, based on the latest Hierarchical Vision Transformer using Shifte… ▽ More Artificial intelligence (AI) models have been developed for predicting clinically relevant biomarkers, including microsatellite instability (MSI), for colorectal cancers (CRC). However, the current deep-learning networks are data-hungry and require large training datasets, which are often lacking in the medical domain. In this study, based on the latest Hierarchical Vision Transformer using Shifted Windows (Swin-T), we developed an efficient workflow for biomarkers in CRC (MSI, hypermutation, chromosomal instability, CpG island methylator phenotype, BRAF, and TP53 mutation) that only required relatively small datasets, but achieved the state-of-the-art (SOTA) predictive performance. Our Swin-T workflow not only substantially outperformed published models in an intra-study cross-validation experiment using TCGA-CRC-DX dataset (N = 462), but also showed excellent generalizability in cross-study external validation and delivered a SOTA AUROC of 0.90 for MSI using the MCO dataset for training (N = 1065) and the same TCGA-CRC-DX for testing. Similar performance (AUROC=0.91) was achieved by Echle and colleagues using approximately 8000 training samples (ResNet18) on the same testing dataset. Swin-T was extremely efficient using small training datasets and exhibits robust predictive performance with only 200-500 training samples. These data indicate that Swin-T may be 5-10 times more efficient than the current state-of-the-art algorithms for MSI based on ResNet18 and ShuffleNet. Furthermore, the Swin-T models showed promise as pre-screening tests for MSI status and BRAF mutation status, which could exclude and reduce the samples before the subsequent standard testing in a cascading diagnostic workflow to allow turnaround time reduction and cost saving. △ Less

Submitted 11 September, 2022; v1 submitted 21 August, 2022; originally announced August 2022.

arXiv:2206.02308 [pdf, other]

Reconfigurable intelligent surfaces: Channel characterization and modeling

Authors: Jie Huang, Cheng-Xiang Wang, Yingzhuo Sun, Rui Feng, Jialing Huang, Bolun Guo, Zhimeng Zhong, Tie Jun Cui

Abstract: Reconfigurable intelligent surfaces (RISs) are two dimensional (2D) metasurfaces which can intelligently manipulate electromagnetic waves by low-cost near passive reflecting elements. RIS is viewed as a potential key technology for the sixth generation (6G) wireless communication systems mainly due to its advantages in tuning wireless signals, thus smartly controlling propagation environments. In… ▽ More Reconfigurable intelligent surfaces (RISs) are two dimensional (2D) metasurfaces which can intelligently manipulate electromagnetic waves by low-cost near passive reflecting elements. RIS is viewed as a potential key technology for the sixth generation (6G) wireless communication systems mainly due to its advantages in tuning wireless signals, thus smartly controlling propagation environments. In this paper, we aim at addressing channel characterization and modeling issues of RIS-assisted wireless communication systems. At first, the concept, principle, and potential applications of RIS are given. An overview of RIS based channel measurements and experiments is presented by classifying frequency bands, scenarios, system configurations, RIS constructions, experiment purposes, and channel observations. Then, RIS based channel characteristics are studied, including reflection and transmission, Doppler effect and multipath fading mitigation, channel reciprocity, channel hardening, rank improvement, far field and near field, etc. RIS based channel modeling works are investigated, including largescale path loss models and small-scale multipath fading models. At last, future research directions related to RIS-assisted channels are also discussed. △ Less

Submitted 5 June, 2022; originally announced June 2022.

arXiv:2105.10280 [pdf, other]

doi 10.1109/TAC.2022.3180692

Near-Optimal Design of Safe Output Feedback Controllers from Noisy Data

Authors: Luca Furieri, Baiwei Guo, Andrea Martin, Giancarlo Ferrari-Trecate

Abstract: As we transition towards the deployment of data-driven controllers for black-box cyberphysical systems, complying with hard safety constraints becomes a primary concern. Two key aspects should be addressed when input-output data are corrupted by noise: how much uncertainty can one tolerate without compromising safety, and to what extent is the control performance affected? By focusing on finite-ho… ▽ More As we transition towards the deployment of data-driven controllers for black-box cyberphysical systems, complying with hard safety constraints becomes a primary concern. Two key aspects should be addressed when input-output data are corrupted by noise: how much uncertainty can one tolerate without compromising safety, and to what extent is the control performance affected? By focusing on finite-horizon constrained linear-quadratic problems, we provide an answer to these questions in terms of the model mismatch incurred during a preliminary identification phase. We propose a control design procedure based on a quasiconvex relaxation of the original robust problem and we prove that, if the uncertainty is sufficiently small, the synthesized controller is safe and near-optimal, in the sense that the suboptimality gap increases linearly with the model mismatch level. Since the proposed method is independent of the specific identification procedure, our analysis holds in combination with state-of-the-art behavioral estimators beyond standard least-squares. The main theoretical results are validated by numerical experiments. △ Less

Submitted 12 September, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

Comments: Baiwei Guo and Andrea Martin contributed equally to this work. in IEEE Transactions on Automatic Control, 2022

arXiv:2104.01850 [pdf, other]

Actuator Placement for Structural Controllability beyond Strong Connectivity and towards Robustness

Authors: Baiwei Guo, Orcun Karaca, Sepide Azhdari, Maryam Kamgarpour, Giancarlo Ferrari-Trecate

Abstract: Actuator placement is a fundamental problem in control design for large-scale networks. In this paper, we study the problem of finding a set of actuator positions by minimizing a given metric, while satisfying a structural controllability requirement and a constraint on the number of actuators. We first extend the classical forward greedy algorithm for applications to graphs that are not necessari… ▽ More Actuator placement is a fundamental problem in control design for large-scale networks. In this paper, we study the problem of finding a set of actuator positions by minimizing a given metric, while satisfying a structural controllability requirement and a constraint on the number of actuators. We first extend the classical forward greedy algorithm for applications to graphs that are not necessarily strongly connected. We then improve this greedy algorithm by extending its horizon. This is done by evaluating the actuator position set expansions at the further steps of the classical greedy algorithm. We prove that this new method attains a better performance, when this evaluation considers the final actuator position set. Moreover, we study the problem of minimal backup placements. The goal is to ensure that the system stays structurally controllable even when any of the selected actuators goes offline, with minimum number of backup actuators. We show that this problem is equivalent to the well-studied NP-hard hitting set problem. Our results are verified by a numerical case study. △ Less

Submitted 10 April, 2021; v1 submitted 5 April, 2021; originally announced April 2021.

arXiv:2103.02553 [pdf, other]

Finite-sample-based Spectral Radius Estimation and Stabilizability Test for Networked Control Systems

Authors: Liang Xu, Baiwei Guo, Giancarlo Ferrari-Trecate

Abstract: In the analysis and control of discrete-time linear time-invariant systems, the spectral radius of the system state matrix plays an essential role. Usually, it is assumed that system matrices are known, from which the spectral radius can be directly computed. Instead, we consider the setting where the system is affected by process noise, and one has only finitely many samples of system input and s… ▽ More In the analysis and control of discrete-time linear time-invariant systems, the spectral radius of the system state matrix plays an essential role. Usually, it is assumed that system matrices are known, from which the spectral radius can be directly computed. Instead, we consider the setting where the system is affected by process noise, and one has only finitely many samples of system input and state measurements. We provide two methods for estimating the spectral radius and derive error bounds that hold with high probability. Moreover, we show how to use the derived results to test stabilizability for networked control systems (NCSs) with lossy channels when only finitely many samples of the system input, state, and packet drop sequence are available. △ Less

Submitted 23 March, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

arXiv:2102.13338 [pdf, other]

A Behavioral Input-Output Parametrization of Control Policies with Suboptimality Guarantees

Authors: Luca Furieri, Baiwei Guo, Andrea Martin, Giancarlo Ferrari-Trecate

Abstract: Recent work in data-driven control has revived behavioral theory to perform a variety of complex control tasks, by directly plugging libraries of past input-output trajectories into optimal control problems. Despite recent advances, a key aspect remains unclear: how and to what extent do noise-corrupted data impact control performance? In this work, we provide a quantitative answer to this questio… ▽ More Recent work in data-driven control has revived behavioral theory to perform a variety of complex control tasks, by directly plugging libraries of past input-output trajectories into optimal control problems. Despite recent advances, a key aspect remains unclear: how and to what extent do noise-corrupted data impact control performance? In this work, we provide a quantitative answer to this question. We formulate a Behavioral version of the Input-Output Parametrization (BIOP) for the optimal predictive control of unknown systems using output-feedback dynamic control policies. The main advantages of the proposed framework are that 1) the state-space parameters and the initial state need not be specified for controller synthesis, 2) it can be used in combination with state-of-the-art impulse response estimators, and 3) it allows to recover suboptimality results on learning the Linear Quadratic Gaussian (LQG) controller, therefore revealing, in a quantitative way, how the level of noise in the data affects the performance of behavioral methods. Specifically, it is shown that the performance degrades linearly with the prediction error of the behavioral model. We conclude the paper with numerical experiments to validate our results. △ Less

Submitted 24 March, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

Comments: Baiwei Guo and Andrea Martin contributed equally to this work

arXiv:2101.00488 [pdf, other]

Non-conservative Design of Robust Tracking Controllers Based on Input-output Data

Authors: Liang Xu, Mustafa Sahin Turan, Baiwei Guo, Giancarlo Ferrari-Trecate

Abstract: This paper studies worst-case robust optimal tracking using noisy input-output data. We utilize behavioral system theory to represent system trajectories, while avoiding explicit system identification. We assume that the recent output data used in the data-dependent representation are noisy and we provide a non-conservative design procedure for robust control based on optimization with a linear co… ▽ More This paper studies worst-case robust optimal tracking using noisy input-output data. We utilize behavioral system theory to represent system trajectories, while avoiding explicit system identification. We assume that the recent output data used in the data-dependent representation are noisy and we provide a non-conservative design procedure for robust control based on optimization with a linear cost and LMI constraints. Our methods rely on the parameterization of noise sequences compatible with the data-dependent system representation and on a suitable reformulation of the performance specification, which further enable the application of the S-lemma to derive an LMI optimization problem. The performance of the new controller is discussed through simulations. △ Less

Submitted 24 June, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

arXiv:2011.06383 [pdf, ps, other]

Extended Dynamics Observer for Linear Systems with Disturbance

Authors: Hongyinping Feng, Bao-Zhu Guo

Abstract: This is the last part of four series papers, aiming at stabilization for signal-input-signaloutput (SISO) linear finite-dimensional systems corrupted by general input disturbances. A new observer, referred to as Extended Dynamics Observer (EDO), is proposed to estimate both the state and disturbance simultaneously. The working mechanism of EDO consists of two parts: The disturbance with known dyna… ▽ More This is the last part of four series papers, aiming at stabilization for signal-input-signaloutput (SISO) linear finite-dimensional systems corrupted by general input disturbances. A new observer, referred to as Extended Dynamics Observer (EDO), is proposed to estimate both the state and disturbance simultaneously. The working mechanism of EDO consists of two parts: The disturbance with known dynamics is canceled completely by its dynamics and the disturbance with unknown dynamics is absorbed by high-gain. It is found that the high-gain is always working as long as the control plant with unknown input disturbance is observable which is the only assumption for the observer design. When the disturbance dynamics are completely unknown except some boundedness, the EDO is reduced to an extension of the well-known extended state observer or high-gain observer. The main advantage of the developed method is that the prior information about both the control plant and the disturbance can be utilized as much as possible. The more the prior information we have, the better performance the observer would be. An EDO based stabilizing output feedback is also developed in the spirit of estimation/cancellation strategy. The stability of the resulting closed-loop system is established and some of the theoretical results are validated by numerical simulations. △ Less

Submitted 12 November, 2020; originally announced November 2020.

Comments: 31 pages, 4 figures, this is the last part of the series works: dynamics compensation for linear systems

MSC Class: 93C05; 93B52; 47A62

arXiv:2009.01643 [pdf, ps, other]

Dynamics Compensation in Observation of Abstract Linear Systems

Authors: Hongyinping Feng, Xiao-Hui Wu, Bao-Zhu Guo

Abstract: This is the second part of four series papers, aiming at the problem of sensor dynamics compensation for abstract linear systems. Two major issues are addressed. The first one is about the sensor dynamics compensation in system observation and the second one is on the disturbance dynamics compensation in output regulation for linear system. Both of them can be described by the problem of state obs… ▽ More This is the second part of four series papers, aiming at the problem of sensor dynamics compensation for abstract linear systems. Two major issues are addressed. The first one is about the sensor dynamics compensation in system observation and the second one is on the disturbance dynamics compensation in output regulation for linear system. Both of them can be described by the problem of state observation for an abstract cascade system. We consider these two apparently different problems from the same abstract linear system point of view. A new scheme of the observer design for the abstract cascade system is developed and the exponential convergence of the observation error is established. It is shown that the error based observer design in the problem of output regulation can be converted into a sensor dynamics compensation problem by the well known regulator equations. As a result, a tracking error based observer for output regulation problem is designed by exploiting the developed method. As applications, the ordinary differential equations (ODEs) with output time-delay and an unstable heat equation with ODE sensor dynamics are fully investigated to validate the theoretical results. The numerical simulations for the unstable heat system are carried out to validate the proposed method visually. △ Less

Submitted 3 September, 2020; originally announced September 2020.

Comments: 29 pages, 2 figures, this is the second part of the series works: dynamics compensation for linear systems

MSC Class: 93C05; 93B52; 47A62

arXiv:2008.11333 [pdf, ps, other]

Actuator Dynamics Compensation in Stabilization of Abstract Linear Systems

Authors: Hongyinping Feng, Xiao-Hui Wu, Bao-Zhu Guo

Abstract: This is the first part of four series papers, aiming at the problem of actuator dynamics compensation for linear systems. We consider the stabilization of a type of cascade abstract linear systems which model the actuator dynamics compensation for linear systems where both the control plant and its actuator dynamics can be infinite-dimensional. We develop a systematic way to stabilize the cascade… ▽ More This is the first part of four series papers, aiming at the problem of actuator dynamics compensation for linear systems. We consider the stabilization of a type of cascade abstract linear systems which model the actuator dynamics compensation for linear systems where both the control plant and its actuator dynamics can be infinite-dimensional. We develop a systematic way to stabilize the cascade systems by a full state feedback. Both the well-posedness and the exponential stability of the resulting closed-loop system are established in the abstract framework. A sufficient condition of the existence of compensator for ordinary differential equation (ODE) with partial differential equation (PDE) actuator dynamics is obtained. The feedback design is based on a novelly constructed upper-block-triangle transform and the Lyapunov function design is not needed in the stability analysis. As applications, an ODE with input delay and an unstable heat equation with ODE actuator dynamics are investigated to validate the theoretical results. The numerical simulations for the unstable heat system are carried out to validate the proposed approach visually. △ Less

Submitted 25 August, 2020; originally announced August 2020.

Comments: 27 pages, 5 figures, this is the first part of the series works: dynamics compensation for linear systems

MSC Class: 93C05; 93B52; 47A62

arXiv:2006.11566 [pdf]

doi 10.1109/ICCSNT47585.2019.8962465

A Computer Vision Based Beamforming Scheme for Millimeter Wave Communication in LOS Scenarios

Authors: Tianqi Xiang, Yaxin Wang, Huiwen Li, Boren Guo, Xin Zhang

Abstract: A novel location-aware beamforming scheme for millimeter wave communication is proposed for line of sight (LOS) and low mobility scenarios, in which computer vision is introduced to derive the required position or spatial angular information from the image or video captured by camera(s) co-located with mmWave antenna array at base stations. A wireless coverage model is built to investigate the cov… ▽ More A novel location-aware beamforming scheme for millimeter wave communication is proposed for line of sight (LOS) and low mobility scenarios, in which computer vision is introduced to derive the required position or spatial angular information from the image or video captured by camera(s) co-located with mmWave antenna array at base stations. A wireless coverage model is built to investigate the coverage performance and influence of positioning accuracy achieved by convolutional neural network (CNN) for image processing. In addition, videos could be intentionally blurred, or even low-resolution videos could be directly applied, to protect users' privacy with acceptable positioning precision, lower computation complexity and lower camera cost. It is proved by simulations that the beamforming scheme is practicable and the mainstream CNN we employed is sufficient in both aspects of beam directivity accuracy and processing speed in frame per second. △ Less

Submitted 20 June, 2020; originally announced June 2020.

Comments: 7 pages, 10 figures

MSC Class: 94A40

Journal ref: 2019 IEEE 7th International Conference on Computer Science and Network Technology (ICCSNT), Dalian, China, 2019, pp. 401-407

arXiv:2006.07833 [pdf]

A Computer Vision Aided Beamforming Scheme with EM Exposure Control in Outdoor LOS Scenarios

Authors: Tianqi Xiang, Huiwen Li, Boren Guo, Xin Zhang

Abstract: Without any radiation control measures, a large-scale mmWave antenna array at close range may lead to a large amount of electromagnetic exposure of human. In this paper, with the aid of pose detection in computer vision, a beamforming scheme using a novel exposure avoidance method is proposed in outdoor line of sight scenarios. Instead of reducing transmitted power, the proposed method can protect… ▽ More Without any radiation control measures, a large-scale mmWave antenna array at close range may lead to a large amount of electromagnetic exposure of human. In this paper, with the aid of pose detection in computer vision, a beamforming scheme using a novel exposure avoidance method is proposed in outdoor line of sight scenarios. Instead of reducing transmitted power, the proposed method can protect the vulnerable parts of human body from electromagnetic exposure during transmission by deviating the transmission beams from vulnerable parts. Besides, a finer beam management granularity is adopted to better balance the trade-off between exposure reduction and communication quality loss, because finer beams can provide more adjustability for finding the beam that reduces exposure without excessively reducing the link quality. The proposed exposure avoidance method is validated in simulations, and the results show that the finer beam management granularity can guarantee communication quality while reducing the electromagnetic exposure. △ Less

Submitted 28 July, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

Comments: 6 pages, 10 figures

MSC Class: 94A40

arXiv:1905.08948 [pdf, other]

Multi-agent Attentional Activity Recognition

Authors: Kaixuan Chen, Lina Yao, Dalin Zhang, Bin Guo, Zhiwen Yu

Abstract: Multi-modality is an important feature of sensor based activity recognition. In this work, we consider two inherent characteristics of human activities, the spatially-temporally varying salience of features and the relations between activities and corresponding body part motions. Based on these, we propose a multi-agent spatial-temporal attention model. The spatial-temporal attention mechanism hel… ▽ More Multi-modality is an important feature of sensor based activity recognition. In this work, we consider two inherent characteristics of human activities, the spatially-temporally varying salience of features and the relations between activities and corresponding body part motions. Based on these, we propose a multi-agent spatial-temporal attention model. The spatial-temporal attention mechanism helps intelligently select informative modalities and their active periods. And the multiple agents in the proposed model represent activities with collective motions across body parts by independently selecting modalities associated with single motions. With a joint recognition goal, the agents share gained information and coordinate their selection policies to learn the optimal recognition model. The experimental results on four real-world datasets demonstrate that the proposed model outperforms the state-of-the-art methods. △ Less

Submitted 22 May, 2019; originally announced May 2019.

Comments: Accepted by IJCAI 2019

arXiv:1903.08120 [pdf, other]

doi 10.1109/CDC40024.2019.9030204

Actuator Placement for Optimizing Network Performance under Controllability Constraints

Authors: Baiwei Guo, Orcun Karaca, Tyler Summers, Maryam Kamgarpour

Abstract: With the rising importance of large-scale network control, the problem of actuator placement has received increasing attention. Our goal in this paper is to find a set of actuators minimizing the metric that measures the average energy consumption of the control inputs while ensuring structural controllability of the network. As this problem is intractable, greedy algorithm can be used to obtain a… ▽ More With the rising importance of large-scale network control, the problem of actuator placement has received increasing attention. Our goal in this paper is to find a set of actuators minimizing the metric that measures the average energy consumption of the control inputs while ensuring structural controllability of the network. As this problem is intractable, greedy algorithm can be used to obtain an approximate solution. To provide a performance guarantee for this approach, we first define the submodularity ratio for the metric under consideration and then reformulate the structural controllability constraint as a matroid constraint. This shows that the problem under study can be characterized by a matroid optimization involving a weakly submodular objective function. Then, we derive a novel performance guarantee for the greedy algorithm applied to this class of optimization problems. Finally, we show that the matroid feasibility check for the greedy algorithm can be cast as a maximum matching problem in a certain auxiliary bipartite graph related to the network graph. △ Less

Submitted 19 March, 2019; originally announced March 2019.

Journal ref: IEEE 58th Conference on Decision and Control (CDC), 2019

Showing 1–30 of 30 results for author: Guo, B