Search | arXiv e-print repository

Retrieval-Augmented Generation for Generative Artificial Intelligence in Medicine

Authors: Rui Yang, Yilin Ning, Emilia Keppo, Mingxuan Liu, Chuan Hong, Danielle S Bitterman, Jasmine Chiat Ling Ong, Daniel Shu Wei Ting, Nan Liu

Abstract: Generative artificial intelligence (AI) has brought revolutionary innovations in various fields, including medicine. However, it also exhibits limitations. In response, retrieval-augmented generation (RAG) provides a potential solution, enabling models to generate more accurate contents by leveraging the retrieval of external knowledge. With the rapid advancement of generative AI, RAG can pave the… ▽ More Generative artificial intelligence (AI) has brought revolutionary innovations in various fields, including medicine. However, it also exhibits limitations. In response, retrieval-augmented generation (RAG) provides a potential solution, enabling models to generate more accurate contents by leveraging the retrieval of external knowledge. With the rapid advancement of generative AI, RAG can pave the way for connecting this transformative technology with medical applications and is expected to bring innovations in equity, reliability, and personalization to health care. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.10492 [pdf, other]

Large Language Models as Event Forecasters

Authors: Libo Zhang, Yue Ning

Abstract: Key elements of human events are extracted as quadruples that consist of subject, relation, object, and timestamp. This representation can be extended to a quintuple by adding a fifth element: a textual summary that briefly describes the event. These quadruples or quintuples, when organized within a specific domain, form a temporal knowledge graph (TKG). Current learning frameworks focus on a few… ▽ More Key elements of human events are extracted as quadruples that consist of subject, relation, object, and timestamp. This representation can be extended to a quintuple by adding a fifth element: a textual summary that briefly describes the event. These quadruples or quintuples, when organized within a specific domain, form a temporal knowledge graph (TKG). Current learning frameworks focus on a few TKG-related tasks, such as predicting an object given a subject and a relation or forecasting the occurrences of multiple types of events (i.e., relation) in the next time window. They typically rely on complex structural and sequential models like graph neural networks (GNNs) and recurrent neural networks (RNNs) to update intermediate embeddings. However, these methods often neglect the contextual information inherent in each quintuple, which can be effectively captured through concise textual descriptions. In this paper, we investigate how large language models (LLMs) can streamline the design of TKG learning frameworks while maintaining competitive accuracy in prediction and forecasting tasks. We develop multiple prompt templates to frame the object prediction (OP) task as a standard question-answering (QA) task, suitable for instruction fine-tuning with an encoder-decoder generative LLM. For multi-event forecasting (MEF), we design simple yet effective prompt templates for each TKG quintuple. This novel approach removes the need for GNNs and RNNs, instead utilizing an encoder-only LLM to generate fixed intermediate embeddings, which are subsequently processed by a prediction head with a self-attention mechanism to forecast potential future relations. Extensive experiments on multiple real-world datasets using various evaluation metrics validate the effectiveness and robustness of our approach. △ Less

Submitted 15 June, 2024; originally announced June 2024.

Comments: 10 pages, 3 figures, 10 tables

arXiv:2406.09455 [pdf, other]

Pandora: Towards General World Model with Natural Language Actions and Video States

Authors: Jiannan Xiang, Guangyi Liu, Yi Gu, Qiyue Gao, Yuting Ning, Yuheng Zha, Zeyu Feng, Tianhua Tao, Shibo Hao, Yemin Shi, Zhengzhong Liu, Eric P. Xing, Zhiting Hu

Abstract: World models simulate future states of the world in response to different actions. They facilitate interactive content creation and provides a foundation for grounded, long-horizon reasoning. Current foundation models do not fully meet the capabilities of general world models: large language models (LLMs) are constrained by their reliance on language modality and their limited understanding of the… ▽ More World models simulate future states of the world in response to different actions. They facilitate interactive content creation and provides a foundation for grounded, long-horizon reasoning. Current foundation models do not fully meet the capabilities of general world models: large language models (LLMs) are constrained by their reliance on language modality and their limited understanding of the physical world, while video models lack interactive action control over the world simulations. This paper makes a step towards building a general world model by introducing Pandora, a hybrid autoregressive-diffusion model that simulates world states by generating videos and allows real-time control with free-text actions. Pandora achieves domain generality, video consistency, and controllability through large-scale pretraining and instruction tuning. Crucially, Pandora bypasses the cost of training-from-scratch by integrating a pretrained LLM (7B) and a pretrained video model, requiring only additional lightweight finetuning. We illustrate extensive outputs by Pandora across diverse domains (indoor/outdoor, natural/urban, human/robot, 2D/3D, etc.). The results indicate great potential of building stronger general world models with larger-scale training. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: Website: https://world-model.maitrix.org/

arXiv:2406.01276 [pdf, other]

EduNLP: Towards a Unified and Modularized Library for Educational Resources

Authors: Zhenya Huang, Yuting Ning, Longhu Qin, Shiwei Tong, Shangzi Xue, Tong Xiao, Xin Lin, Jiayu Liu, Qi Liu, Enhong Chen, Shijing Wang

Abstract: Educational resource understanding is vital to online learning platforms, which have demonstrated growing applications recently. However, researchers and developers always struggle with using existing general natural language toolkits or domain-specific models. The issue raises a need to develop an effective and easy-to-use one that benefits AI education-related research and applications. To bridg… ▽ More Educational resource understanding is vital to online learning platforms, which have demonstrated growing applications recently. However, researchers and developers always struggle with using existing general natural language toolkits or domain-specific models. The issue raises a need to develop an effective and easy-to-use one that benefits AI education-related research and applications. To bridge this gap, we present a unified, modularized, and extensive library, EduNLP, focusing on educational resource understanding. In the library, we decouple the whole workflow to four key modules with consistent interfaces including data configuration, processing, model implementation, and model evaluation. We also provide a configurable pipeline to unify the data usage and model usage in standard ways, where users can customize their own needs. For the current version, we primarily provide 10 typical models from four categories, and 5 common downstream-evaluation tasks in the education domain on 8 subjects for users' usage. The project is released at: https://github.com/bigdata-ustc/EduNLP. △ Less

Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.17921 [pdf]

Towards Clinical AI Fairness: Filling Gaps in the Puzzle

Authors: Mingxuan Liu, Yilin Ning, Salinelat Teixayavong, Xiaoxuan Liu, Mayli Mertens, Yuqing Shang, Xin Li, Di Miao, Jie Xu, Daniel Shu Wei Ting, Lionel Tim-Ee Cheng, Jasmine Chiat Ling Ong, Zhen Ling Teo, Ting Fang Tan, Narrendar RaviChandran, Fei Wang, Leo Anthony Celi, Marcus Eng Hock Ong, Nan Liu

Abstract: The ethical integration of Artificial Intelligence (AI) in healthcare necessitates addressing fairness-a concept that is highly context-specific across medical fields. Extensive studies have been conducted to expand the technical components of AI fairness, while tremendous calls for AI fairness have been raised from healthcare. Despite this, a significant disconnect persists between technical adva… ▽ More The ethical integration of Artificial Intelligence (AI) in healthcare necessitates addressing fairness-a concept that is highly context-specific across medical fields. Extensive studies have been conducted to expand the technical components of AI fairness, while tremendous calls for AI fairness have been raised from healthcare. Despite this, a significant disconnect persists between technical advancements and their practical clinical applications, resulting in a lack of contextualized discussion of AI fairness in clinical settings. Through a detailed evidence gap analysis, our review systematically pinpoints several deficiencies concerning both healthcare data and the provided AI fairness solutions. We highlight the scarcity of research on AI fairness in many medical domains where AI technology is increasingly utilized. Additionally, our analysis highlights a substantial reliance on group fairness, aiming to ensure equality among demographic groups from a macro healthcare system perspective; in contrast, individual fairness, focusing on equity at a more granular level, is frequently overlooked. To bridge these gaps, our review advances actionable strategies for both the healthcare and AI research communities. Beyond applying existing AI fairness methods in healthcare, we further emphasize the importance of involving healthcare professionals to refine AI fairness concepts and methods to ensure contextually relevant and ethically sound AI applications in healthcare. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.13113 [pdf, other]

MAMMOTH-Subaru. II. Diverse Populations of Circumgalactic Ly$αあるふぁ$ Nebulae at Cosmic Noon

Authors: Mingyu Li, Haibin Zhang, Zheng Cai, Yongming Liang, Nobunari Kashikawa, Ke Ma, Xiaohui Fan, J. Xavier Prochaska, Bjorn H. C. Emonts, Xin Wang, Yunjing Wu, Shiwu Zhang, Qiong Li, Sean D. Johnson, Minghao Yue, Fabrizio Arrigoni Battaia, Sebastiano Cantalupo, Joseph F. Hennawi, Satoshi Kikuta, Yuanhang Ning, Masami Ouchi, Rhythm Shimakawa, Ben Wang, Weichen Wang, Zheng Zheng , et al. (1 additional authors not shown)

Abstract: Circumgalactic Lyman-alpha (Ly$αあるふぁ$) nebulae are gaseous halos around galaxies exhibiting luminous extended Ly$αあるふぁ$ emission. This work investigates Ly$αあるふぁ$ nebulae from deep imaging of $\sim12~\mathrm{deg}^2$ sky, targeted by the MAMMOTH-Subaru survey. Utilizing the wide-field capability of Hyper Suprime-Cam (HSC), we present one of the largest blind Ly$αあるふぁ$ nebula selections, including QSO nebulae, Ly… ▽ More Circumgalactic Lyman-alpha (Ly$αあるふぁ$) nebulae are gaseous halos around galaxies exhibiting luminous extended Ly$αあるふぁ$ emission. This work investigates Ly$αあるふぁ$ nebulae from deep imaging of $\sim12~\mathrm{deg}^2$ sky, targeted by the MAMMOTH-Subaru survey. Utilizing the wide-field capability of Hyper Suprime-Cam (HSC), we present one of the largest blind Ly$αあるふぁ$ nebula selections, including QSO nebulae, Ly$αあるふぁ$ blobs, and radio galaxy nebulae down to typical $2σしぐま$ Ly$αあるふぁ$ surface brightness of $(5-10)\times10^{-18}\mathrm{~erg~s^{-1}~cm^{-2}~arcsec^{-2}}$. The sample contains 117 nebulae with Ly$αあるふぁ$ sizes of 40 - 400 kpc, and the most gigantic one spans about 365 kpc, referred to as the Ivory Nebula. Combining with multiwavelength data, we investigate diverse nebula populations and associated galaxies. We find a small fraction of Ly$αあるふぁ$ nebulae have QSOs ($\sim7\%$), luminous infrared galaxies ($\sim1\%$), and radio galaxies ($\sim 2\%$). Remarkably, among the 28 enormous Ly$αあるふぁ$ nebulae (ELANe) exceeding 100 kpc, about $80\%$ are associated with UV-faint galaxies ($M_\mathrm{UV} > -22$), categorized as Type II ELANe. We underscore that Type II ELANe constitute the majority but remain largely hidden in current galaxy and QSO surveys. Dusty starburst and obscured AGN activity are proposed to explain the nature of Type II ELANe. The SED of stacking all Ly$αあるふぁ$ nebulae also reveals signs of massive dusty star-forming galaxies with obscured AGNs. We propose a model to explain the dusty nature where the diverse populations of Ly$αあるふぁ$ nebula capture massive galaxies at different evolutionary stages undergoing violent assembling. Ly$αあるふぁ$ nebulae provide critical insights into the formation and evolution of today's massive cluster galaxies at cosmic noon. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 26 pages, 10 figures, 3 tables, submitted to ApJS, comments welcome

arXiv:2405.11723 [pdf, ps, other]

Inference with non-differentiable surrogate loss in a general high-dimensional classification framework

Authors: Muxuan Liang, Yang Ning, Maureen A Smith, Ying-Qi Zhao

Abstract: Penalized empirical risk minimization with a surrogate loss function is often used to derive a high-dimensional linear decision rule in classification problems. Although much of the literature focuses on the generalization error, there is a lack of valid inference procedures to identify the driving factors of the estimated decision rule, especially when the surrogate loss is non-differentiable. In… ▽ More Penalized empirical risk minimization with a surrogate loss function is often used to derive a high-dimensional linear decision rule in classification problems. Although much of the literature focuses on the generalization error, there is a lack of valid inference procedures to identify the driving factors of the estimated decision rule, especially when the surrogate loss is non-differentiable. In this work, we propose a kernel-smoothed decorrelated score to construct hypothesis testing and interval estimations for the linear decision rule estimated using a piece-wise linear surrogate loss, which has a discontinuous gradient and non-regular Hessian. Specifically, we adopt kernel approximations to smooth the discontinuous gradient near discontinuity points and approximate the non-regular Hessian of the surrogate loss. In applications where additional nuisance parameters are involved, we propose a novel cross-fitted version to accommodate flexible nuisance estimates and kernel approximations. We establish the limiting distribution of the kernel-smoothed decorrelated score and its cross-fitted version in a high-dimensional setup. Simulation and real data analysis are conducted to demonstrate the validity and superiority of the proposed method. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 27 pages, 4 figures

arXiv:2405.07413 [pdf]

Unraveling Anisotropic Hybridizations of Solid-state Electrolyte Nano-films in Li-ion Batteries

Authors: Yuanjie Ning, Wenjun Wu, Liang Dai, Shuo Sun, Zhigang Zeng, Dengsong Zhang, Mark B. H. Breese, Chuanbing Cai, Chi Sin Tang, Xinmao Yin

Abstract: Li2WO4 (LWO) is recognized for its potential as a solid-state electrolyte and it has demonstrated the ability to enhance the electrochemical performance of LiCoO2 (LCO) cathodes in Li-ion batteries. However, prior investigations into LWO have predominantly involved polycrystalline structures, thereby lacking a comprehensive understanding of its behavior when interfaced with single crystal systems,… ▽ More Li2WO4 (LWO) is recognized for its potential as a solid-state electrolyte and it has demonstrated the ability to enhance the electrochemical performance of LiCoO2 (LCO) cathodes in Li-ion batteries. However, prior investigations into LWO have predominantly involved polycrystalline structures, thereby lacking a comprehensive understanding of its behavior when interfaced with single crystal systems, particularly those intricately connected to LCO. In this study, we employ pulsed laser deposition (PLD) to epitaxially synthesize LWO nano-films on LCO layers with different orientations. Based on a series of high-resolution synchrotron-based techniques including X-ray absorption spectroscopy (XAS) and X-ray photoemission spectroscopy (XPS), the electronic structure of LWO is carefully scrutinized where a higher main energy level of W5d(eg)-O2p orbitals hybridization in LWO/LCO(104) as compared to LWO/LCO(003) has been observed. This experimental finding is further validated by a comprehensive set of density of states calculations. Furthermore, detailed polarized XAS characterization unveils distinct anisotropy between the two oriented LWO configurations. This comprehensive scientific investigation, harnessing the capabilities of synchrotron-based techniques, provides invaluable insights for future studies, offering guidance for the optimized utilization of LWO as a solid-state electrolyte or modification layer for LCO cathodes in high-powered Li-ion batteries. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: 8 pages, 5 figures

arXiv:2405.03299 [pdf, other]

DarkFed: A Data-Free Backdoor Attack in Federated Learning

Authors: Minghui Li, Wei Wan, Yuxuan Ning, Shengshan Hu, Lulu Xue, Leo Yu Zhang, Yichen Wang

Abstract: Federated learning (FL) has been demonstrated to be susceptible to backdoor attacks. However, existing academic studies on FL backdoor attacks rely on a high proportion of real clients with main task-related data, which is impractical. In the context of real-world industrial scenarios, even the simplest defense suffices to defend against the state-of-the-art attack, 3DFed. A practical FL backdoor… ▽ More Federated learning (FL) has been demonstrated to be susceptible to backdoor attacks. However, existing academic studies on FL backdoor attacks rely on a high proportion of real clients with main task-related data, which is impractical. In the context of real-world industrial scenarios, even the simplest defense suffices to defend against the state-of-the-art attack, 3DFed. A practical FL backdoor attack remains in a nascent stage of development. To bridge this gap, we present DarkFed. Initially, we emulate a series of fake clients, thereby achieving the attacker proportion typical of academic research scenarios. Given that these emulated fake clients lack genuine training data, we further propose a data-free approach to backdoor FL. Specifically, we delve into the feasibility of injecting a backdoor using a shadow dataset. Our exploration reveals that impressive attack performance can be achieved, even when there is a substantial gap between the shadow dataset and the main task dataset. This holds true even when employing synthetic data devoid of any semantic information as the shadow dataset. Subsequently, we strategically construct a series of covert backdoor updates in an optimized manner, mimicking the properties of benign updates, to evade detection by defenses. A substantial body of empirical evidence validates the tangible effectiveness of DarkFed. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: This paper has been accepted by IJCAI 2024

arXiv:2404.16223 [pdf, other]

Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey

Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhijing Sun, Jiaying Zhu , et al. (10 additional authors not shown)

Abstract: This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as nois… ▽ More This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as noise and blur. In the challenge, a total of 230 participants registered, and 45 submitted results during thee challenge period. The performance of the top-5 submissions is reviewed and provided here as a gauge for the current state-of-the-art in RAW Image Super-Resolution. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: CVPR 2024 - NTIRE Workshop

arXiv:2404.03840 [pdf]

FarView: An In-Situ Manufactured Lunar Far Side Radio Array Concept for 21-cm Dark Ages Cosmology

Authors: Ronald S. Polidan, Jack O. Burns, Alex Ignatiev, Alex Hegedus, Jonathan Pober, Nivedita Mahesh, Tzu-Ching Chang, Gregg Hallinan, Yuhong Ning, Judd Bowman

Abstract: FarView is an early-stage concept for a large, low-frequency radio observatory, manufactured in-situ on the lunar far side using metals extracted from the lunar regolith. It consists of 100,000 dipole antennas in compact subarrays distributed over a large area but with empty space between subarrays in a core-halo structure. FarView covers a total area of ~200 km2, has a dense core within the inner… ▽ More FarView is an early-stage concept for a large, low-frequency radio observatory, manufactured in-situ on the lunar far side using metals extracted from the lunar regolith. It consists of 100,000 dipole antennas in compact subarrays distributed over a large area but with empty space between subarrays in a core-halo structure. FarView covers a total area of ~200 km2, has a dense core within the inner ~36 km2, and a ~power-law falloff of antenna density out to ~14 km from the center. With this design, it is relatively easy to identify multiple viable build sites on the lunar far side. The science case for FarView emphasizes the unique capabilities to probe the unexplored Cosmic Dark Ages - identified by the 2020 Astrophysics Decadal Survey as the discovery area for cosmology. FarView will deliver power spectra and tomographic maps tracing the evolution of the Universe from before the birth of the first stars to the beginning of Cosmic Dawn, and potentially provide unique insights into dark matter, early dark energy, neutrino masses, and the physics of inflation. What makes FarView feasible and affordable in the timeframe of the 2030s is that it is manufactured in-situ, utilizing space industrial technologies. This in-situ manufacturing architecture utilizes Earth-built equipment that is transported to the lunar surface to extract metals from the regolith and will use those metals to manufacture most of the array components: dipole antennas, power lines, and silicon solar cell power systems. This approach also enables a long functional lifetime, by permitting servicing and repair of the observatory. The full 100,000 dipole FarView observatory will take 4 - 8 years to build, depending on the realized performance of the manufacturing elements and the lunar delivery scenario. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: 26 pages, 7 figures, 2 tables

arXiv:2403.20085 [pdf, other]

OmniNxt: A Fully Open-source and Compact Aerial Robot with Omnidirectional Visual Perception

Authors: Peize Liu, Chen Feng, Yang Xu, Yan Ning, Hao Xu, Shaojie Shen

Abstract: Adopting omnidirectional Field of View (FoV) cameras in aerial robots vastly improves perception ability, significantly advancing aerial robotics's capabilities in inspection, reconstruction, and rescue tasks. However, such sensors also elevate system complexity, e.g., hardware design, and corresponding algorithm, which limits researchers from utilizing aerial robots with omnidirectional FoV in th… ▽ More Adopting omnidirectional Field of View (FoV) cameras in aerial robots vastly improves perception ability, significantly advancing aerial robotics's capabilities in inspection, reconstruction, and rescue tasks. However, such sensors also elevate system complexity, e.g., hardware design, and corresponding algorithm, which limits researchers from utilizing aerial robots with omnidirectional FoV in their research. To bridge this gap, we propose OmniNxt, a fully open-source aerial robotics platform with omnidirectional perception. We design a high-performance flight controller NxtPX4 and a multi-fisheye camera set for OmniNxt. Meanwhile, the compatible software is carefully devised, which empowers OmniNxt to achieve accurate localization and real-time dense mapping with limited computation resource occupancy. We conducted extensive real-world experiments to validate the superior performance of OmniNxt in practical applications. All the hardware and software are open-access at https://github.com/HKUST-Aerial-Robotics/OmniNxt, and we provide docker images of each crucial module in the proposed system. Project page: https://hkust-aerial-robotics.github.io/OmniNxt. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: Submitted to IROS2024. Open source: https://github.com/HKUST-Aerial-Robotics/OmniNxt. Project page: https://hkust-aerial-robotics.github.io/OmniNxt/

arXiv:2403.17708 [pdf, other]

doi 10.1145/3625468.3652176

Panonut360: A Head and Eye Tracking Dataset for Panoramic Video

Authors: Yutong Xu, Junhao Du, Jiahe Wang, Yuwei Ning, Sihan Zhou Yang Cao

Abstract: With the rapid development and widespread application of VR/AR technology, maximizing the quality of immersive panoramic video services that match users' personal preferences and habits has become a long-standing challenge. Understanding the saliency region where users focus, based on data collected with HMDs, can promote multimedia encoding, transmission, and quality assessment. At the same time,… ▽ More With the rapid development and widespread application of VR/AR technology, maximizing the quality of immersive panoramic video services that match users' personal preferences and habits has become a long-standing challenge. Understanding the saliency region where users focus, based on data collected with HMDs, can promote multimedia encoding, transmission, and quality assessment. At the same time, large-scale datasets are essential for researchers and developers to explore short/long-term user behavior patterns and train AI models related to panoramic videos. However, existing panoramic video datasets often include low-frequency user head or eye movement data through short-term videos only, lacking sufficient data for analyzing users' Field of View (FoV) and generating video saliency regions. Driven by these practical factors, in this paper, we present a head and eye tracking dataset involving 50 users (25 males and 25 females) watching 15 panoramic videos. The dataset provides details on the viewport and gaze attention locations of users. Besides, we present some statistics samples extracted from the dataset. For example, the deviation between head and eye movements challenges the widely held assumption that gaze attention decreases from the center of the FoV following a Gaussian distribution. Our analysis reveals a consistent downward offset in gaze fixations relative to the FoV in experimental settings involving multiple users and videos. That's why we name the dataset Panonut, a saliency weighting shaped like a donut. Finally, we also provide a script that generates saliency distributions based on given head or eye coordinates and pre-generated saliency distribution map sets of each video from the collected eye tracking data. The dataset is available on website: https://dianvrlab.github.io/Panonut360/. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: 7 pages,ACM MMSys'24 accepted

arXiv:2403.06999 [pdf]

Survival modeling using deep learning, machine learning and statistical methods: A comparative analysis for predicting mortality after hospital admission

Authors: Ziwen Wang, Jin Wee Lee, Tanujit Chakraborty, Yilin Ning, Mingxuan Liu, Feng Xie, Marcus Eng Hock Ong, Nan Liu

Abstract: Survival analysis is essential for studying time-to-event outcomes and providing a dynamic understanding of the probability of an event occurring over time. Various survival analysis techniques, from traditional statistical models to state-of-the-art machine learning algorithms, support healthcare intervention and policy decisions. However, there remains ongoing discussion about their comparative… ▽ More Survival analysis is essential for studying time-to-event outcomes and providing a dynamic understanding of the probability of an event occurring over time. Various survival analysis techniques, from traditional statistical models to state-of-the-art machine learning algorithms, support healthcare intervention and policy decisions. However, there remains ongoing discussion about their comparative performance. We conducted a comparative study of several survival analysis methods, including Cox proportional hazards (CoxPH), stepwise CoxPH, elastic net penalized Cox model, Random Survival Forests (RSF), Gradient Boosting machine (GBM) learning, AutoScore-Survival, DeepSurv, time-dependent Cox model based on neural network (CoxTime), and DeepHit survival neural network. We applied the concordance index (C-index) for model goodness-of-fit, and integral Brier scores (IBS) for calibration, and considered the model interpretability. As a case study, we performed a retrospective analysis of patients admitted through the emergency department of a tertiary hospital from 2017 to 2019, predicting 90-day all-cause mortality based on patient demographics, clinicopathological features, and historical data. The results of the C-index indicate that deep learning achieved comparable performance, with DeepSurv producing the best discrimination (DeepSurv: 0.893; CoxTime: 0.892; DeepHit: 0.891). The calibration of DeepSurv (IBS: 0.041) performed the best, followed by RSF (IBS: 0.042) and GBM (IBS: 0.0421), all using the full variables. Moreover, AutoScore-Survival, using a minimal variable subset, is easy to interpret, and can achieve good discrimination and calibration (C-index: 0.867; IBS: 0.044). While all models were satisfactory, DeepSurv exhibited the best discrimination and calibration. In addition, AutoScore-Survival offers a more parsimonious model and excellent interpretability. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2403.05235 [pdf]

Fairness-Aware Interpretable Modeling (FAIM) for Trustworthy Machine Learning in Healthcare

Authors: Mingxuan Liu, Yilin Ning, Yuhe Ke, Yuqing Shang, Bibhas Chakraborty, Marcus Eng Hock Ong, Roger Vaughan, Nan Liu

Abstract: The escalating integration of machine learning in high-stakes fields such as healthcare raises substantial concerns about model fairness. We propose an interpretable framework - Fairness-Aware Interpretable Modeling (FAIM), to improve model fairness without compromising performance, featuring an interactive interface to identify a "fairer" model from a set of high-performing models and promoting t… ▽ More The escalating integration of machine learning in high-stakes fields such as healthcare raises substantial concerns about model fairness. We propose an interpretable framework - Fairness-Aware Interpretable Modeling (FAIM), to improve model fairness without compromising performance, featuring an interactive interface to identify a "fairer" model from a set of high-performing models and promoting the integration of data-driven evidence and clinical expertise to enhance contextualized fairness. We demonstrated FAIM's value in reducing sex and race biases by predicting hospital admission with two real-world databases, MIMIC-IV-ED and SGH-ED. We show that for both datasets, FAIM models not only exhibited satisfactory discriminatory performance but also significantly mitigated biases as measured by well-established fairness metrics, outperforming commonly used bias-mitigation methods. Our approach demonstrates the feasibility of improving fairness without sacrificing performance and provides an a modeling mode that invites domain experts to engage, fostering a multidisciplinary effort toward tailored AI fairness. △ Less

Submitted 8 March, 2024; originally announced March 2024.

arXiv:2403.05229 [pdf]

Developing Federated Time-to-Event Scores Using Heterogeneous Real-World Survival Data

Authors: Siqi Li, Yuqing Shang, Ziwen Wang, Qiming Wu, Chuan Hong, Yilin Ning, Di Miao, Marcus Eng Hock Ong, Bibhas Chakraborty, Nan Liu

Abstract: Survival analysis serves as a fundamental component in numerous healthcare applications, where the determination of the time to specific events (such as the onset of a certain disease or death) for patients is crucial for clinical decision-making. Scoring systems are widely used for swift and efficient risk prediction. However, existing methods for constructing survival scores presume that data or… ▽ More Survival analysis serves as a fundamental component in numerous healthcare applications, where the determination of the time to specific events (such as the onset of a certain disease or death) for patients is crucial for clinical decision-making. Scoring systems are widely used for swift and efficient risk prediction. However, existing methods for constructing survival scores presume that data originates from a single source, posing privacy challenges in collaborations with multiple data owners. We propose a novel framework for building federated scoring systems for multi-site survival outcomes, ensuring both privacy and communication efficiency. We applied our approach to sites with heterogeneous survival data originating from emergency departments in Singapore and the United States. Additionally, we independently developed local scores at each site. In testing datasets from each participant site, our proposed federated scoring system consistently outperformed all local models, evidenced by higher integrated area under the receiver operating characteristic curve (iAUC) values, with a maximum improvement of 11.6%. Additionally, the federated score's time-dependent AUC(t) values showed advantages over local scores, exhibiting narrower confidence intervals (CIs) across most time points. The model developed through our proposed method exhibits effective performance on each local site, signifying noteworthy implications for healthcare research. Sites participating in our proposed federated scoring model training gained benefits by acquiring survival models with enhanced prediction accuracy and efficiency. This study demonstrates the effectiveness of our privacy-preserving federated survival score generation framework and its applicability to real-world heterogeneous survival data. △ Less

Submitted 8 March, 2024; originally announced March 2024.

arXiv:2402.12852 [pdf, other]

CCFC++: Enhancing Federated Clustering through Feature Decorrelation

Authors: Jie Yan, Jing Liu, Yi-Zi Ning, Zhong-Yuan Zhang

Abstract: In federated clustering, multiple data-holding clients collaboratively group data without exchanging raw data. This field has seen notable advancements through its marriage with contrastive learning, exemplified by Cluster-Contrastive Federated Clustering (CCFC). However, CCFC suffers from heterogeneous data across clients, leading to poor and unrobust performance. Our study conducts both empirica… ▽ More In federated clustering, multiple data-holding clients collaboratively group data without exchanging raw data. This field has seen notable advancements through its marriage with contrastive learning, exemplified by Cluster-Contrastive Federated Clustering (CCFC). However, CCFC suffers from heterogeneous data across clients, leading to poor and unrobust performance. Our study conducts both empirical and theoretical analyses to understand the impact of heterogeneous data on CCFC. Findings indicate that increased data heterogeneity exacerbates dimensional collapse in CCFC, evidenced by increased correlations across multiple dimensions of the learned representations. To address this, we introduce a decorrelation regularizer to CCFC. Benefiting from the regularizer, the improved method effectively mitigates the detrimental effects of data heterogeneity, and achieves superior performance, as evidenced by a marked increase in NMI scores, with the gain reaching as high as 0.32 in the most pronounced case. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.06861 [pdf, other]

UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction

Authors: Yansong Ning, Hao Liu

Abstract: Urban knowledge graph has recently worked as an emerging building block to distill critical knowledge from multi-sourced urban data for diverse urban application scenarios. Despite its promising benefits, urban knowledge graph construction (UrbanKGC) still heavily relies on manual effort, hindering its potential advancement. This paper presents UrbanKGent, a unified large language model agent fram… ▽ More Urban knowledge graph has recently worked as an emerging building block to distill critical knowledge from multi-sourced urban data for diverse urban application scenarios. Despite its promising benefits, urban knowledge graph construction (UrbanKGC) still heavily relies on manual effort, hindering its potential advancement. This paper presents UrbanKGent, a unified large language model agent framework, for urban knowledge graph construction. Specifically, we first construct the knowledgeable instruction set for UrbanKGC tasks (such as relational triplet extraction and knowledge graph completion) via heterogeneity-aware and geospatial-infused instruction generation. Moreover, we propose a tool-augmented iterative trajectory refinement module to enhance and refine the trajectories distilled from GPT-4. Through hybrid instruction fine-tuning with augmented trajectories on Llama-2-13B, we obtain the UrbanKGC agent, UrbanKGent-13B. We perform a comprehensive evaluation on two real-world datasets using both human and GPT-4 self-evaluation. The experimental results demonstrate that UrbanKGent-13B not only can significantly outperform 21 baselines in UrbanKGC tasks, but also surpass the state-of-the-art LLM, GPT-4, by more than 10\% with approximately 20 times lower cost. We deploy UrbanKGent-13B to provide online services, which can construct an UrbanKG with thousands of times richer relationships using only one-fifth of the data compared with the existing benchmark. Our data, code, and opensource UrbanKGC agent are available at https://github.com/usail-hkust/UrbanKGent. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Comments: Under review

arXiv:2401.09532 [pdf, other]

Quantifying the escape of Ly$αあるふぁ$ at $z\approx 5-6$: a census of Ly$αあるふぁ$ escape fraction with H$αあるふぁ$ emitting galaxies spectroscopically confirmed by JWST and VLT/MUSE

Authors: Xiaojing Lin, Zheng Cai, Yunjing Wu, Zihao Li, Fengwu Sun, Xiaohui Fan, Zuyi Chen, Mingyu Li, Fuyan Bian, Yuanhang Ning, Linhua Jiang, Gustavo Bruzual, Stephane Charlot, Jacopo Chevallard

Abstract: JWST provides an unprecedented opportunity for unbiased surveys of H$αあるふぁ$-emitting galaxies at $z>4$ with the NIRCam wide-field slitless spectroscopy (WFSS). In this work, we present a census of Ly$αあるふぁ$ escape fraction ($f_{esc, Lyαあるふぁ}$) of 165 star-forming galaxies at $z=4.9-6.3$ using their H$αあるふぁ$ emission directly measured from FRESCO NIRCam/WFSS data. We search for Ly$αあるふぁ$ emission of each H$αあるふぁ$-emitting… ▽ More JWST provides an unprecedented opportunity for unbiased surveys of H$αあるふぁ$-emitting galaxies at $z>4$ with the NIRCam wide-field slitless spectroscopy (WFSS). In this work, we present a census of Ly$αあるふぁ$ escape fraction ($f_{esc, Lyαあるふぁ}$) of 165 star-forming galaxies at $z=4.9-6.3$ using their H$αあるふぁ$ emission directly measured from FRESCO NIRCam/WFSS data. We search for Ly$αあるふぁ$ emission of each H$αあるふぁ$-emitting galaxy in VLT/MUSE data. The overall $f_{esc, Lyαあるふぁ}$ measured by stacking is $f_{esc, Lyαあるふぁ}$ is $0.090\pm0.006$. We find that $f_{esc, Lyαあるふぁ}$ displays a strong dependence on the observed UV slope ($βべーた_{\rm obs}$) and E(B-V), such that the bluest galaxies ($βべーた_{\rm obs}\sim-2.5$) have the largest escape fractions ($f_{\rm esc, Lyαあるふぁ}\approx0.6$), indicative of the crucial role of dust and gas in modulating the escape of Ly$αあるふぁ$ photons. $f_{esc, Lyαあるふぁ}$ is less well related to other parameters, including the UV luminosity and stellar mass, and the variation in $f_{esc, Lyαあるふぁ}$ with them can be explained by their underlying coupling with E(B-V) or $βべーた_{\rm obs}$. Our results suggest a tentative decline in $f_{esc, Lyαあるふぁ}$ at $z\gtrsim 5$, implying increasing intergalactic medium attenuation towards higher redshift. Furthermore, the dependence of $f_{esc, Lyαあるふぁ}$ on $βべーた_{\rm obs}$ is proportional to that of the ionizing photon escape fraction ($f_{\rm esc, LyC}$), indicating the escape of Ly$αあるふぁ$ and ionizing photon may be regulated by similar physical processes. With $f_{esc, Lyαあるふぁ}$ as a proxy to $f_{\rm esc, LyC}$, we infer that UV-faint ($M_{\rm UV}>-16$) galaxies contribute $>70\%$ of the total ionizing emissivity at $z=5-6$. If these relations hold during the epoch of reionization, UV-faint galaxies can contribute the majority of UV photon budget to reionize the Universe. △ Less

Submitted 18 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

Comments: 17 pages, 12 figures; Accepted by the ApJS

arXiv:2401.06357 [pdf, other]

doi 10.1103/PhysRevB.109.014431

FeS2 monolayer: a high valence and high-$T_{\rm C}$ Ising ferromagnet

Authors: Ke Yang, Yaozhenghang Ma, Lu Liu, Yueyue Ning, Di Lu, Yuxuan Zhou, Zhongyao Li, Hua Wu

Abstract: Two-dimensional (2D) magnetic materials are of current great interest for their promising applications in spintronics. Strong magnetic coupling and anisotropy are both highly desirable for the achievement of a high temperature magnetic order. Here we propose the unusual high valent FeS$_2$ hexagonal monolayer as such a candidate for a strong Ising 2D ferromagnet (FM), by spin-orbital state analyse… ▽ More Two-dimensional (2D) magnetic materials are of current great interest for their promising applications in spintronics. Strong magnetic coupling and anisotropy are both highly desirable for the achievement of a high temperature magnetic order. Here we propose the unusual high valent FeS$_2$ hexagonal monolayer as such a candidate for a strong Ising 2D ferromagnet (FM), by spin-orbital state analyses, first-principles calculations, and the renormalized spin-wave theory (RSWT). We find that very importantly, the high valent Fe$^{4+}$ ion is in the low-spin state ($t_{2g}^{4}$, $S$=1) with degenerate $t_{2g}$ orbitals rather than the high-spin state ($t_{2g}^{3}e_g^{1}$, $S$=2). It is the low-spin state that allows to carry a large perpendicular orbital moment and then produces a huge single ion anisotropy (SIA) of 25 meV/Fe. Moreover, the negative charge transfer character associated with the unusual high valence, strong Fe $3d$-S $3p$ hybridization, wide bands, and a small band gap all help to establish a strong superexchange. Indeed, our first-principles calculations confirm the strong FM superexchange and the huge perpendicular SIA, both of which are further enhanced by a compressive strain. Then, our RSWT calculations predict that the FM $T_{\rm C}$ is 261 K for the pristine FeS$_2$ monolayer and could be increased to 409 K under the compressive --5\% strain. The high $T_{\rm C}$ is also reproduced by our Monte Carlo (MC) simulations. Therefore, it is worth exploring the high-$T_{\rm C}$ Ising FMs in the high valent 2D magnetic materials with degenerate orbitals. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 13 pages, 5 figures

Journal ref: Phys. Rev. B 109, 014431 (2024)

arXiv:2401.05920 [pdf, other]

The Magellan M2FS spectroscopic survey of high-redshift galaxies: the brightest Lyman-break galaxies at $z \sim 6$

Authors: Shuqi Fu, Linhua Jiang, Yuanhang Ning, Weiyang Liu, Zhiwei Pan

Abstract: We present a study of a sample of 45 spectroscopically confirmed, UV luminous galaxies at $z\sim 6$. They were selected as bright Lyman-break galaxies (LBGs) using deep multi-band optical images in more than 2 deg$^2$ of the sky, and subsequently identified via their strong Ly$αあるふぁ$ emission. The majority of these LBGs span an absolute UV magnitude range from $-22.0$ to $-20.5$ mag with Ly$αあるふぁ$ equival… ▽ More We present a study of a sample of 45 spectroscopically confirmed, UV luminous galaxies at $z\sim 6$. They were selected as bright Lyman-break galaxies (LBGs) using deep multi-band optical images in more than 2 deg$^2$ of the sky, and subsequently identified via their strong Ly$αあるふぁ$ emission. The majority of these LBGs span an absolute UV magnitude range from $-22.0$ to $-20.5$ mag with Ly$αあるふぁ$ equivalent width (EW) between $\sim$10 and $\sim$200 Å, representing the most luminous galaxies at $z\sim 6$ in terms of both UV continuum emission and Ly$αあるふぁ$ line emission. We model the SEDs of 10 LBGs that have deep infrared observations from HST, JWST, and/or Spitzer, and find that they have a wide range of stellar masses and ages. They also have high star-formation rates ranging from a few tens to a few hundreds of Solar mass per year. Five of the LBGs have JWST or HST images and four of them show compact morphology in these images, including one that is roughly consistent with a point source, suggesting that UV luminous galaxies at this redshift are generally compact. The fraction of our photometrically selected LBGs with strong Ly$αあるふぁ$ emission ($\mathrm{EW}>25$ Å) is about $0.2$, which is consistent with previous results and supports a moderate evolution of the IGM opacity at the end of cosmic reionization. Using deep X-ray images, we do not find evidence of strong AGN activity in these galaxies, but our constraint is loose and we are not able to rule out the possibility of any weak AGN activity. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 19 pages, 11 figures, Accepted for publication in ApJ

arXiv:2401.03077 [pdf, other]

A Topology-aware Graph Coarsening Framework for Continual Graph Learning

Authors: Xiaoxue Han, Zhuo Feng, Yue Ning

Abstract: Continual learning on graphs tackles the problem of training a graph neural network (GNN) where graph data arrive in a streaming fashion and the model tends to forget knowledge from previous tasks when updating with new data. Traditional continual learning strategies such as Experience Replay can be adapted to streaming graphs, however, these methods often face challenges such as inefficiency in p… ▽ More Continual learning on graphs tackles the problem of training a graph neural network (GNN) where graph data arrive in a streaming fashion and the model tends to forget knowledge from previous tasks when updating with new data. Traditional continual learning strategies such as Experience Replay can be adapted to streaming graphs, however, these methods often face challenges such as inefficiency in preserving graph topology and incapability of capturing the correlation between old and new tasks. To address these challenges, we propose TA$\mathbb{CO}$, a (t)opology-(a)ware graph (co)arsening and (co)ntinual learning framework that stores information from previous tasks as a reduced graph. At each time period, this reduced graph expands by combining with a new graph and aligning shared nodes, and then it undergoes a "zoom out" process by reduction to maintain a stable size. We design a graph coarsening algorithm based on node representation proximities to efficiently reduce a graph and preserve topological information. We empirically demonstrate the learning process on the reduced graph can approximate that of the original graph. Our experiments validate the effectiveness of the proposed framework on three real-world datasets using different backbone GNN models. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.15625 [pdf, other]

doi 10.1021/acs.jpcc.3c06603

Exploration of the two-dimensional Ising magnetic materials in the triangular prismatic crystal field

Authors: Shuhang Chen, Wenjing Xu, Yueyue Ning, Ke Yang

Abstract: Magnetic anisotropy is essential for stabilizing two-dimensional (2D) magnetism, which has significant applications in spintronics and the advancement of fundamental physics. In this work, we examine the electronic structure and magnetic properties of triangular prismatic MSi$_2$N$_4$ (M = V, Cr) monolayers, using crystal field theory, spin-orbital state analyses, and density functional calculatio… ▽ More Magnetic anisotropy is essential for stabilizing two-dimensional (2D) magnetism, which has significant applications in spintronics and the advancement of fundamental physics. In this work, we examine the electronic structure and magnetic properties of triangular prismatic MSi$_2$N$_4$ (M = V, Cr) monolayers, using crystal field theory, spin-orbital state analyses, and density functional calculations. Our results reveal that the pristine VSi$_2$N$_4$ monolayer exhibits magnetism with a V$^{4+}$ 3$d^1$ $S$ = 1/2 charge-spin state within the triangular prismatic crystal field. However, the strong $d$ orbital hybridization between adjacent V$^{4+}$ ions disrupts the $d$ orbital splitting in this crystal field, resulting in a relatively small in-plane magnetic anisotropy of approximately 2 $μみゅー$eV per V atom.In contrast, the pristine CrSi$_2$N$_4$ monolayer is nonmagnetic, characterized by the Cr$^{4+}$ 3$d^2$ $S$ = 0 state. Upon substituting nonmagnetic Cr$^{4+}$ with Si$^{4+}$, Cr$_\frac{1}{3}$Si$_\frac{8}{3}$N$_4$ transforms into an antiferromagnetic insulator with Cr$^{4+}$ 3$d^2$ $S$ = 1 state, featuring a large orbital moment of -1.06 $μみゅー_{\rm B}$ oriented along the $z$-axis and huge perpendicular magnetic anisotropy of 18.63 meV per Cr atom. These findings highlight the potential for further exploration of 2D Ising magnetic materials within a unique triangular prismatic crystal field. △ Less

Submitted 25 December, 2023; originally announced December 2023.

Journal ref: The Journal of Physical Chemistry C 2024, 128, 1, 556-562

arXiv:2312.11026 [pdf, other]

MISA: Unveiling the Vulnerabilities in Split Federated Learning

Authors: Wei Wan, Yuxuan Ning, Shengshan Hu, Lulu Xue, Minghui Li, Leo Yu Zhang, Hai Jin

Abstract: \textit{Federated learning} (FL) and \textit{split learning} (SL) are prevailing distributed paradigms in recent years. They both enable shared global model training while keeping data localized on users' devices. The former excels in parallel execution capabilities, while the latter enjoys low dependence on edge computing resources and strong privacy protection. \textit{Split federated learning}… ▽ More \textit{Federated learning} (FL) and \textit{split learning} (SL) are prevailing distributed paradigms in recent years. They both enable shared global model training while keeping data localized on users' devices. The former excels in parallel execution capabilities, while the latter enjoys low dependence on edge computing resources and strong privacy protection. \textit{Split federated learning} (SFL) combines the strengths of both FL and SL, making it one of the most popular distributed architectures. Furthermore, a recent study has claimed that SFL exhibits robustness against poisoning attacks, with a fivefold improvement compared to FL in terms of robustness. In this paper, we present a novel poisoning attack known as MISA. It poisons both the top and bottom models, causing a \textbf{\underline{misa}}lignment in the global model, ultimately leading to a drastic accuracy collapse. This attack unveils the vulnerabilities in SFL, challenging the conventional belief that SFL is robust against poisoning attacks. Extensive experiments demonstrate that our proposed MISA poses a significant threat to the availability of SFL, underscoring the imperative for academia and industry to accord this matter due attention. △ Less

Submitted 19 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

Comments: This paper has been accepted by the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

arXiv:2312.04841 [pdf, other]

doi 10.3847/2041-8213/ad292f

Unveiling Luminous Ly$αあるふぁ$ Emitters at $z\approx6$ through JWST/NIRCam Imaging in the COSMOS Field

Authors: Yuanhang Ning, Zheng Cai, Xiaojing Lin, Zhen-Ya Zheng, Xiaotong Feng, Mingyu Li, Qiong Li, Daniele Spinoso, Yunjing Wu, Haibin Zhang

Abstract: We study a sample of 14 spectroscopically confirmed Ly$αあるふぁ$ Emitters (LAEs) in the late era of reionization (at redshift $z\approx6$) based on the JWST/NIRCam imaging dataset. These LAEs with high Ly$αあるふぁ$ luminosity of $L$(Ly$αあるふぁ$) $\sim10^{42.4-43.4}$ erg s$^{-1}$ have been covered by the (ongoing) COSMOS-Web survey (Kartaltepe et al. 2021; Casey et al. 2022) over $0.28$ deg$^2$ in four NIRCam bands (F… ▽ More We study a sample of 14 spectroscopically confirmed Ly$αあるふぁ$ Emitters (LAEs) in the late era of reionization (at redshift $z\approx6$) based on the JWST/NIRCam imaging dataset. These LAEs with high Ly$αあるふぁ$ luminosity of $L$(Ly$αあるふぁ$) $\sim10^{42.4-43.4}$ erg s$^{-1}$ have been covered by the (ongoing) COSMOS-Web survey (Kartaltepe et al. 2021; Casey et al. 2022) over $0.28$ deg$^2$ in four NIRCam bands (F115W, F150W, F277W, and F444W). With JWST imaging, we determine the UV continua with $M_{\rm UV}$ ranging from ${-}20.5$ to ${-}18.5$ mag. The UV slopes have a median value of $βべーた\approx-2.35$, and the steepest slope can reach $βべーた<-3$. Under an excellent spatial resolution of JWST, we identify three out of the sample as potential merging/interacting systems. The 14 LAEs (and their components) are compact in morphology residing substantially below the mass-size relation of high-$z$ galaxies. We further investigate the stellar mass ($M_*$) and star-formation rates (SFRs). Most of the LAEs lie on the SFR-$M_*$ main-sequence relation while two of them featured as "little red dots" likely host active galactic nuclei (AGN), implying a ${\sim}10\%$ AGN fraction. Moreover, we reveal that a new correlation may exist between Ly$αあるふぁ$ equivalent width and the offset between Ly$αあるふぁ$ and UV emission ($Δでるたd_{\rm Lyαあるふぁ}$) with a median $Δでるたd_{\rm Lyαあるふぁ} \sim 1$ kpc. This could be explained by Ly$αあるふぁ$ radiative transfer process in both ISM and CGM. The results usher a new era of detailed analysis on high-$z$ LAEs with the JWST capability. △ Less

Submitted 5 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Comments: 15 pages, 5 figures, accepted by ApJL

Journal ref: 2024 ApJL 963 L38

arXiv:2312.02140 [pdf, other]

MMT/Binospec Spectroscopic Survey of Two $z\sim$ 0.8 Galaxy Clusters in the Eye of Horus Field

Authors: Jiyun Di, Eiichi Egami, Kenneth C. Wong, Chien-Hsiu Lee, Yuanhang Ning, Naomi Ota, Masayuki Tanaka

Abstract: The discovery of the Eye of Horus (EoH), a rare double source-plane lens system ($z_{\rm lens}=$ 0.795; $z_{\rm src}=$ 1.302 and 1.988), has also led to the identification of two high-redshift ($z_{\rm phot}\sim$ 0.8) galaxy clusters in the same field based on the subsequent analysis of the Subaru/Hyper Suprime-Cam (HSC) optical and XMM-Newton X-ray data. The two brightest cluster galaxies (BCGs),… ▽ More The discovery of the Eye of Horus (EoH), a rare double source-plane lens system ($z_{\rm lens}=$ 0.795; $z_{\rm src}=$ 1.302 and 1.988), has also led to the identification of two high-redshift ($z_{\rm phot}\sim$ 0.8) galaxy clusters in the same field based on the subsequent analysis of the Subaru/Hyper Suprime-Cam (HSC) optical and XMM-Newton X-ray data. The two brightest cluster galaxies (BCGs), one of which is the lensing galaxy of the EoH, are separated by only $\sim$100$"$ ($=$ 0.75 Mpc $<$ $r_{200}$) on the sky, raising the possibility that these two clusters may be physically associated. Here, we present a follow-up optical spectroscopic survey of this EoH field, obtaining 218 secure redshifts using MMT/Binospec. We have confirmed that there indeed exist two massive ($M_{\rm dyn}$ $>$ $10^{14}$ M$_\odot$) clusters of galaxies at $z$ $=$ 0.795 (the main cluster) and at $z=0.769$ (the NE cluster). However, these clusters have a velocity offset of $\sim$4300 km s$^{-1}$, suggesting that this two-cluster system is likely a line-of-sight projection rather than a physically-related association (e.g., a cluster merger). In terms of the properties of cluster-member galaxies, these two $z\sim0.8$ clusters appear well-developed, each harboring an old (age $=$ 3.6-6.0 Gyr) and massive ($M_\mathrm{*}$ $=$ 4.2-9.5 $\times$ $10^{11}$ M$_\odot$) BCG and exhibiting a well-established red sequence (RS). This study underscores the importance of conducting a spectroscopic follow-up for high-redshift cluster candidates because RS-based cluster selections are susceptible to such a projection effect in general. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: 13 pages (+56 pages in appendices), 7(+47) figures, 4(+1) tables; to be submitted to ApJ

arXiv:2311.08747 [pdf, other]

Improved Dense Nested Attention Network Based on Transformer for Infrared Small Target Detection

Authors: Chun Bao, Jie Cao, Yaqian Ning, Tianhua Zhao, Zhijun Li, Zechen Wang, Li Zhang, Qun Hao

Abstract: Infrared small target detection based on deep learning offers unique advantages in separating small targets from complex and dynamic backgrounds. However, the features of infrared small targets gradually weaken as the depth of convolutional neural network (CNN) increases. To address this issue, we propose a novel method for detecting infrared small targets called improved dense nested attention ne… ▽ More Infrared small target detection based on deep learning offers unique advantages in separating small targets from complex and dynamic backgrounds. However, the features of infrared small targets gradually weaken as the depth of convolutional neural network (CNN) increases. To address this issue, we propose a novel method for detecting infrared small targets called improved dense nested attention network (IDNANet), which is based on the transformer architecture. We preserve the dense nested structure of dense nested attention network (DNANet) and introduce the Swin-transformer during feature extraction stage to enhance the continuity of features. Furthermore, we integrate the ACmix attention structure into the dense nested structure to enhance the features of intermediate layers. Additionally, we design a weighted dice binary cross-entropy (WD-BCE) loss function to mitigate the negative impact of foreground-background imbalance in the samples. Moreover, we develop a dataset specifically for infrared small targets, called BIT-SIRST. The dataset comprises a significant amount of real-world targets and manually annotated labels, as well as synthetic data and corresponding labels. We have evaluated the effectiveness of our method through experiments conducted on public datasets. In comparison to other state-of-the-art methods, our approach outperforms in terms of probability of detection ($P_d$), false-alarm rate ($F_a$), and mean intersection of union ($mIoU$). The $mIoU$ reaches 90.89\% on the NUDT-SIRST dataset and 79.72\% on the SIRST dataset. The BIT-SIRST dataset and codes are available openly at \href{https://github.com/EdwardBao1006/bit\_sirst}{\color[HTML]{B22222}{https://github.com/EdwardBao1006/bit\_sirst}}. △ Less

Submitted 17 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

arXiv:2311.07237 [pdf, other]

In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search

Authors: Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren

Abstract: State-of-the-art LLMs outperform humans on reasoning tasks such as Natural Language Inference. Recent works evaluating LLMs note a marked performance drop on input data from the low-probability distribution, i.e., the longtail. Therefore, we focus on systematically generating statements involving long-tail inferential knowledge for more effective evaluation of LLMs in the reasoning space. We first… ▽ More State-of-the-art LLMs outperform humans on reasoning tasks such as Natural Language Inference. Recent works evaluating LLMs note a marked performance drop on input data from the low-probability distribution, i.e., the longtail. Therefore, we focus on systematically generating statements involving long-tail inferential knowledge for more effective evaluation of LLMs in the reasoning space. We first propose a novel framework Logic-Induced- Knowledge-Search (LINK) that generates factually correct and long-tail knowledge statements grounded on symbolic rule templates; LINK effectively generates data in the longtail distribution that zero-shot prompted LLMs are unable to reach, and outperforms zero-shot GPT4 on factual correctness by 5%. We further use the data generated by LINK to construct a dataset Logic-Induced-Long-Tail (LINT) that can be used to evaluate downstream models on the long-tail distribution; LINT contains 108K knowledge statements spanning four domains. We use LINT to test LLMs on an entailment classification task and find that model performances drop by as high as 5% in the long-tail distribution compared to head distribution. Our work shows the utility of evaluating models in the long-tail distribution, and calls for more research on generating evaluation data in the long-tail distribution. △ Less

Submitted 27 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.03417 [pdf]

Federated Learning for Clinical Structured Data: A Benchmark Comparison of Engineering and Statistical Approaches

Authors: Siqi Li, Di Miao, Qiming Wu, Chuan Hong, Danny D'Agostino, Xin Li, Yilin Ning, Yuqing Shang, Huazhu Fu, Marcus Eng Hock Ong, Hamed Haddadi, Nan Liu

Abstract: Federated learning (FL) has shown promising potential in safeguarding data privacy in healthcare collaborations. While the term "FL" was originally coined by the engineering community, the statistical field has also explored similar privacy-preserving algorithms. Statistical FL algorithms, however, remain considerably less recognized than their engineering counterparts. Our goal was to bridge the… ▽ More Federated learning (FL) has shown promising potential in safeguarding data privacy in healthcare collaborations. While the term "FL" was originally coined by the engineering community, the statistical field has also explored similar privacy-preserving algorithms. Statistical FL algorithms, however, remain considerably less recognized than their engineering counterparts. Our goal was to bridge the gap by presenting the first comprehensive comparison of FL frameworks from both engineering and statistical domains. We evaluated five FL frameworks using both simulated and real-world data. The results indicate that statistical FL algorithms yield less biased point estimates for model coefficients and offer convenient confidence interval estimations. In contrast, engineering-based methods tend to generate more accurate predictions, sometimes surpassing central pooled and statistical FL models. This study underscores the relative strengths and weaknesses of both types of methods, emphasizing the need for increased awareness and their integration in future FL applications. △ Less

Submitted 6 November, 2023; originally announced November 2023.

arXiv:2311.02107 [pdf]

Generative Artificial Intelligence in Healthcare: Ethical Considerations and Assessment Checklist

Authors: Yilin Ning, Salinelat Teixayavong, Yuqing Shang, Julian Savulescu, Vaishaanth Nagaraj, Di Miao, Mayli Mertens, Daniel Shu Wei Ting, Jasmine Chiat Ling Ong, Mingxuan Liu, Jiuwen Cao, Michael Dunn, Roger Vaughan, Marcus Eng Hock Ong, Joseph Jao-Yiu Sung, Eric J Topol, Nan Liu

Abstract: The widespread use of ChatGPT and other emerging technology powered by generative artificial intelligence (GenAI) has drawn much attention to potential ethical issues, especially in high-stakes applications such as healthcare, but ethical discussions are yet to translate into operationalisable solutions. Furthermore, ongoing ethical discussions often neglect other types of GenAI that have been use… ▽ More The widespread use of ChatGPT and other emerging technology powered by generative artificial intelligence (GenAI) has drawn much attention to potential ethical issues, especially in high-stakes applications such as healthcare, but ethical discussions are yet to translate into operationalisable solutions. Furthermore, ongoing ethical discussions often neglect other types of GenAI that have been used to synthesise data (e.g., images) for research and practical purposes, which resolved some ethical issues and exposed others. We conduct a scoping review of ethical discussions on GenAI in healthcare to comprehensively analyse gaps in the current research, and further propose to reduce the gaps by developing a checklist for comprehensive assessment and transparent documentation of ethical discussions in GenAI research. The checklist can be readily integrated into the current peer review and publication system to enhance GenAI research, and may be used for ethics-related disclosures for GenAI-powered products, healthcare applications of such products and beyond. △ Less

Submitted 23 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

arXiv:2310.12350 [pdf, other]

Equipping Federated Graph Neural Networks with Structure-aware Group Fairness

Authors: Nan Cui, Xiuling Wang, Wendy Hui Wang, Violet Chen, Yue Ning

Abstract: Graph Neural Networks (GNNs) have been widely used for various types of graph data processing and analytical tasks in different domains. Training GNNs over centralized graph data can be infeasible due to privacy concerns and regulatory restrictions. Thus, federated learning (FL) becomes a trending solution to address this challenge in a distributed learning paradigm. However, as GNNs may inherit h… ▽ More Graph Neural Networks (GNNs) have been widely used for various types of graph data processing and analytical tasks in different domains. Training GNNs over centralized graph data can be infeasible due to privacy concerns and regulatory restrictions. Thus, federated learning (FL) becomes a trending solution to address this challenge in a distributed learning paradigm. However, as GNNs may inherit historical bias from training data and lead to discriminatory predictions, the bias of local models can be easily propagated to the global model in distributed settings. This poses a new challenge in mitigating bias in federated GNNs. To address this challenge, we propose $\text{F}^2$GNN, a Fair Federated Graph Neural Network, that enhances group fairness of federated GNNs. As bias can be sourced from both data and learning algorithms, $\text{F}^2$GNN aims to mitigate both types of bias under federated settings. First, we provide theoretical insights on the connection between data bias in a training graph and statistical fairness metrics of the trained GNN models. Based on the theoretical analysis, we design $\text{F}^2$GNN which contains two key components: a fairness-aware local model update scheme that enhances group fairness of the local models on the client side, and a fairness-weighted global model update scheme that takes both data bias and fairness metrics of local models into consideration in the aggregation process. We evaluate $\text{F}^2$GNN empirically versus a number of baseline methods, and demonstrate that $\text{F}^2$GNN outperforms these baselines in terms of both fairness and model accuracy. △ Less

Submitted 13 May, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

arXiv:2310.09672 [pdf, other]

Towards Semi-Structured Automatic ICD Coding via Tree-based Contrastive Learning

Authors: Chang Lu, Chandan K. Reddy, Ping Wang, Yue Ning

Abstract: Automatic coding of International Classification of Diseases (ICD) is a multi-label text categorization task that involves extracting disease or procedure codes from clinical notes. Despite the application of state-of-the-art natural language processing (NLP) techniques, there are still challenges including limited availability of data due to privacy constraints and the high variability of clinica… ▽ More Automatic coding of International Classification of Diseases (ICD) is a multi-label text categorization task that involves extracting disease or procedure codes from clinical notes. Despite the application of state-of-the-art natural language processing (NLP) techniques, there are still challenges including limited availability of data due to privacy constraints and the high variability of clinical notes caused by different writing habits of medical professionals and various pathological features of patients. In this work, we investigate the semi-structured nature of clinical notes and propose an automatic algorithm to segment them into sections. To address the variability issues in existing ICD coding models with limited data, we introduce a contrastive pre-training approach on sections using a soft multi-label similarity metric based on tree edit distance. Additionally, we design a masked section training strategy to enable ICD coding models to locate sections related to ICD codes. Extensive experimental results demonstrate that our proposed training strategies effectively enhance the performance of existing ICD coding methods. △ Less

Submitted 14 October, 2023; originally announced October 2023.

Comments: Accepted by NeurIPS 2023

arXiv:2310.06378 [pdf, ps, other]

Bounds on $k$-Uniform Quantum States

Authors: Fei Shi, Yu Ning, Qi Zhao, Xiande Zhang

Abstract: Do $N$-partite $k$-uniform states always exist when $k\leq \lfloor\frac{N}{2}\rfloor-1$? In this work, we provide new upper bounds on the parameter $k$ for the existence of $k$-uniform states in $(\mathbb{C}^{d})^{\otimes N}$ when $d=3,4,5$, which extend Rains' bound in 1999 and improve Scott's bound in 2004. Since a $k$-uniform state in $(\mathbb{C}^{d})^{\otimes N}$ corresponds to a pure… ▽ More Do $N$-partite $k$-uniform states always exist when $k\leq \lfloor\frac{N}{2}\rfloor-1$? In this work, we provide new upper bounds on the parameter $k$ for the existence of $k$-uniform states in $(\mathbb{C}^{d})^{\otimes N}$ when $d=3,4,5$, which extend Rains' bound in 1999 and improve Scott's bound in 2004. Since a $k$-uniform state in $(\mathbb{C}^{d})^{\otimes N}$ corresponds to a pure $((N,1,k+1))_{d}$ quantum error-correcting codes, we also give new upper bounds on the minimum distance $k+1$ of pure $((N,1,k+1))_d$ quantum error-correcting codes. Furthermore, we generalize Scott's bound to heterogeneous systems, and show some non-existence results of absolutely maximally entangled states in $\mathbb{C}^{d_1}\otimes(\mathbb{C}^{d_2})^{\otimes 2n}$. △ Less

Submitted 29 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

arXiv:2309.12995 [pdf, other]

doi 10.1088/1674-1137/acf65c

Study of neutron density fluctuation and neutron-proton correlation in Au+Au collisions using PYTHIA8/Angantyr

Authors: Zhang Zuman, Li Sha, Yu Ning, Lin Jianping, Li Shuang, Tang Siyu, Zhou Daicui

Abstract: Utilizing the PYTHIA8 Angantyr model, which incorporates the multiple-parton interactions (MPI) based color reconnection (CR) mechanism, we study the relative neutron density fluctuation and neutron-proton correlation in Au+Au collisions at $\sqrt{s_\text{NN}}$ = 7.7, 11.5, 14.5, 19.6, 27, 39, 62.4, and 200 GeV. In this study, we have not only delved into the dependence of these two remarkable obs… ▽ More Utilizing the PYTHIA8 Angantyr model, which incorporates the multiple-parton interactions (MPI) based color reconnection (CR) mechanism, we study the relative neutron density fluctuation and neutron-proton correlation in Au+Au collisions at $\sqrt{s_\text{NN}}$ = 7.7, 11.5, 14.5, 19.6, 27, 39, 62.4, and 200 GeV. In this study, we have not only delved into the dependence of these two remarkable observations on rapidity, centrality, and energy, but also presented an analysis of their interplay with the MPI and CR. Our results have shown that the light nuclei yield ratio of proton, deuteron, and triton, expressed by the elegant expression $N_tN_p/N_d^2$, remains unchanged even as the rapidity coverage and collision centrality increase. Interestingly, we have also revealed that the effect of CR is entirely dependent on the presence of MPI; CR has no impact on the yield ratio if MPI is off. Our findings further demonstrate that the light nuclei yield ratio experiences a slight increase with increasing collision energy as predicted by the PYTHIA8 Angantyr model, but it cannot describe the non-monotonic trend observed by the STAR experiment. Based on the Angantyr model simulation results, it is essential not to overlook the correlation between neutron and proton fluctuations. The Angantyr model is a good baseline for studying collisions in the absence of a Quark-Gluon Plasma (QGP) system, given its lack of flow and jet quenching. △ Less

Submitted 22 September, 2023; originally announced September 2023.

Comments: arXiv admin note: text overlap with arXiv:2211.03297 by other authors

Journal ref: Chin. Phys. C 47, no.11,114102 (2023)

arXiv:2308.15253 [pdf, other]

Development of a SciFi-based beam monitor for COMET

Authors: Yu Xu, Yunsong Ning, Zhizhen Qin, Yao Teng, Changqing Feng, Jian Tang, Yu Chen, Yoshinori Fukao, Satoshi Mihara, Kou Oishi

Abstract: COMET is a leading experiment to search for coherent conversion of $μみゅー^- \mathrm{N}\to e^- \mathrm{N}$ with a high-intensity pulsed muon beamline, produced by the innovative slow extraction techniques. Therefore, it is critical to measure the characteristics of the muon beam. We set up a Muon Beam Monitor (MBM), where scintillation fibers (SciFi) weaved in the cross shape are coupled to silicon pho… ▽ More COMET is a leading experiment to search for coherent conversion of $μみゅー^- \mathrm{N}\to e^- \mathrm{N}$ with a high-intensity pulsed muon beamline, produced by the innovative slow extraction techniques. Therefore, it is critical to measure the characteristics of the muon beam. We set up a Muon Beam Monitor (MBM), where scintillation fibers (SciFi) weaved in the cross shape are coupled to silicon photomultipliers (SiPM), to measure the spatial profile and timing structure of the extracted muon beam for COMET. The MBM detector has been tested successfully with a proton beamline in China Spallation Neutron Source (CSNS) and taken data with good performance in the commissioning run called COMET Phase-$αあるふぁ$. Experience of the MBM development, such as the mechanical structure and electronics readout, and its beam measurement results will be shared. △ Less

Submitted 15 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

arXiv:2307.11758 [pdf, other]

A Comprehensive Introduction of Visual-Inertial Navigation

Authors: Yangyang Ning

Abstract: In this article, a tutorial introduction to visual-inertial navigation(VIN) is presented. Visual and inertial perception are two complementary sensing modalities. Cameras and inertial measurement units (IMU) are the corresponding sensors for these two modalities. The low cost and light weight of camera-IMU sensor combinations make them ubiquitous in robotic navigation. Visual-inertial Navigation i… ▽ More In this article, a tutorial introduction to visual-inertial navigation(VIN) is presented. Visual and inertial perception are two complementary sensing modalities. Cameras and inertial measurement units (IMU) are the corresponding sensors for these two modalities. The low cost and light weight of camera-IMU sensor combinations make them ubiquitous in robotic navigation. Visual-inertial Navigation is a state estimation problem, that estimates the ego-motion and local environment of the sensor platform. This paper presents visual-inertial navigation in the classical state estimation framework, first illustrating the estimation problem in terms of state variables and system models, including related quantities representations (Parameterizations), IMU dynamic and camera measurement models, and corresponding general probabilistic graphical models (Factor Graph). Secondly, we investigate the existing model-based estimation methodologies, these involve filter-based and optimization-based frameworks and related on-manifold operations. We also discuss the calibration of some relevant parameters, also initialization of state of interest in optimization-based frameworks. Then the evaluation and improvement of VIN in terms of accuracy, efficiency, and robustness are discussed. Finally, we briefly mention the recent development of learning-based methods that may become alternatives to traditional model-based methods. △ Less

Submitted 27 June, 2023; originally announced July 2023.

Comments: 35 pages, 10 figures

arXiv:2307.05943 [pdf, other]

Empirical Bayes large-scale multiple testing for high-dimensional binary outcome data

Authors: Yu-Chien Bo Ning

Abstract: This paper explores the multiple testing problem for sparse high-dimensional data with binary outcomes. We utilize the empirical Bayes posterior to construct multiple testing procedures and evaluate their performance on false discovery rate (FDR) control. We first show that the $\ell$-value (a.k.a. the local FDR) procedure can be overly conservative in estimating the FDR if choosing the conjugate… ▽ More This paper explores the multiple testing problem for sparse high-dimensional data with binary outcomes. We utilize the empirical Bayes posterior to construct multiple testing procedures and evaluate their performance on false discovery rate (FDR) control. We first show that the $\ell$-value (a.k.a. the local FDR) procedure can be overly conservative in estimating the FDR if choosing the conjugate spike and uniform slab prior. To address this, we propose two new procedures that calibrate the posterior to achieve correct FDR control. Sharp frequentist theoretical results are established for these procedures, and numerical experiments are conducted to validate our theory in finite samples. To the best of our knowledge, we obtain the first {\it uniform} FDR control result in multiple testing for high-dimensional data with binary outcomes under the sparsity assumption. △ Less

Submitted 9 June, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

Comments: 85 pages, 7 figures

MSC Class: 62G10; 62G20

arXiv:2306.11443 [pdf, other]

UUKG: Unified Urban Knowledge Graph Dataset for Urban Spatiotemporal Prediction

Authors: Yansong Ning, Hao Liu, Hao Wang, Zhenyu Zeng, Hui Xiong

Abstract: Accurate Urban SpatioTemporal Prediction (USTP) is of great importance to the development and operation of the smart city. As an emerging building block, multi-sourced urban data are usually integrated as urban knowledge graphs (UrbanKGs) to provide critical knowledge for urban spatiotemporal prediction models. However, existing UrbanKGs are often tailored for specific downstream prediction tasks… ▽ More Accurate Urban SpatioTemporal Prediction (USTP) is of great importance to the development and operation of the smart city. As an emerging building block, multi-sourced urban data are usually integrated as urban knowledge graphs (UrbanKGs) to provide critical knowledge for urban spatiotemporal prediction models. However, existing UrbanKGs are often tailored for specific downstream prediction tasks and are not publicly available, which limits the potential advancement. This paper presents UUKG, the unified urban knowledge graph dataset for knowledge-enhanced urban spatiotemporal predictions. Specifically, we first construct UrbanKGs consisting of millions of triplets for two metropolises by connecting heterogeneous urban entities such as administrative boroughs, POIs, and road segments. Moreover, we conduct qualitative and quantitative analysis on constructed UrbanKGs and uncover diverse high-order structural patterns, such as hierarchies and cycles, that can be leveraged to benefit downstream USTP tasks. To validate and facilitate the use of UrbanKGs, we implement and evaluate 15 KG embedding methods on the KG completion task and integrate the learned KG embeddings into 9 spatiotemporal models for five different USTP tasks. The extensive experimental results not only provide benchmarks of knowledge-enhanced USTP models under different task settings but also highlight the potential of state-of-the-art high-order structure-aware UrbanKG embedding methods. We hope the proposed UUKG fosters research on urban knowledge graphs and broad smart city applications. The dataset and source code are available at https://github.com/usail-hkust/UUKG/. △ Less

Submitted 22 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

Comments: NeurIPS 2023 Track on Datasets and Benchmarks

arXiv:2306.10512 [pdf, other]

Efficiently Measuring the Cognitive Ability of LLMs: An Adaptive Testing Perspective

Authors: Yan Zhuang, Qi Liu, Yuting Ning, Weizhe Huang, Rui Lv, Zhenya Huang, Guanhao Zhao, Zheng Zhang, Qingyang Mao, Shijin Wang, Enhong Chen

Abstract: Large language models (LLMs), like ChatGPT, have shown some human-like cognitive abilities. For comparing these abilities of different models, several benchmarks (i.e. sets of standard test questions) from different fields (e.g., Literature, Biology and Psychology) are often adopted and the test results under traditional metrics such as accuracy, recall and F1, are reported. However, such way for… ▽ More Large language models (LLMs), like ChatGPT, have shown some human-like cognitive abilities. For comparing these abilities of different models, several benchmarks (i.e. sets of standard test questions) from different fields (e.g., Literature, Biology and Psychology) are often adopted and the test results under traditional metrics such as accuracy, recall and F1, are reported. However, such way for evaluating LLMs can be inefficient and inaccurate from the cognitive science perspective. Inspired by Computerized Adaptive Testing (CAT) used in psychometrics, we propose an adaptive testing framework for LLM evaluation. Rather than using a standard test set and simply reporting accuracy, this approach dynamically adjusts the characteristics of the test questions, such as difficulty, based on the model's performance. This allows for a more accurate estimation of the model's abilities, using fewer questions. More importantly, it allows LLMs to be compared with humans easily, which is essential for NLP models that aim for human-level ability. Our diagnostic reports have found that ChatGPT often behaves like a ``careless student'', prone to slip and occasionally guessing the questions. We conduct a fine-grained diagnosis and rank the latest 6 instruction-tuned LLMs from three aspects of Subject Knowledge, Mathematical Reasoning, and Programming, where GPT4 can outperform other models significantly and reach the cognitive ability of middle-level students. Different tests for different models using efficient adaptive testing -- we believe this has the potential to become a new norm in evaluating large language models. △ Less

Submitted 28 October, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

arXiv:2306.10351 [pdf, other]

Bkd-FedGNN: A Benchmark for Classification Backdoor Attacks on Federated Graph Neural Network

Authors: Fan Liu, Siqi Lai, Yansong Ning, Hao Liu

Abstract: Federated Graph Neural Network (FedGNN) has recently emerged as a rapidly growing research topic, as it integrates the strengths of graph neural networks and federated learning to enable advanced machine learning applications without direct access to sensitive data. Despite its advantages, the distributed nature of FedGNN introduces additional vulnerabilities, particularly backdoor attacks stemmin… ▽ More Federated Graph Neural Network (FedGNN) has recently emerged as a rapidly growing research topic, as it integrates the strengths of graph neural networks and federated learning to enable advanced machine learning applications without direct access to sensitive data. Despite its advantages, the distributed nature of FedGNN introduces additional vulnerabilities, particularly backdoor attacks stemming from malicious participants. Although graph backdoor attacks have been explored, the compounded complexity introduced by the combination of GNNs and federated learning has hindered a comprehensive understanding of these attacks, as existing research lacks extensive benchmark coverage and in-depth analysis of critical factors. To address these limitations, we propose Bkd-FedGNN, a benchmark for backdoor attacks on FedGNN. Specifically, Bkd-FedGNN decomposes the graph backdoor attack into trigger generation and injection steps, and extending the attack to the node-level federated setting, resulting in a unified framework that covers both node-level and graph-level classification tasks. Moreover, we thoroughly investigate the impact of multiple critical factors in backdoor attacks on FedGNN. These factors are categorized into global-level and local-level factors, including data distribution, the number of malicious attackers, attack time, overlapping rate, trigger size, trigger type, trigger position, and poisoning rate. Finally, we conduct comprehensive evaluations on 13 benchmark datasets and 13 critical factors, comprising 1,725 experimental configurations for node-level and graph-level tasks from six domains. These experiments encompass over 8,000 individual tests, allowing us to provide a thorough evaluation and insightful observations that advance our understanding of backdoor attacks on FedGNN.The Bkd-FedGNN benchmark is publicly available at https://github.com/usail-hkust/BkdFedGCN. △ Less

Submitted 17 June, 2023; originally announced June 2023.

arXiv:2304.13493 [pdf]

Towards clinical AI fairness: A translational perspective

Authors: Mingxuan Liu, Yilin Ning, Salinelat Teixayavong, Mayli Mertens, Jie Xu, Daniel Shu Wei Ting, Lionel Tim-Ee Cheng, Jasmine Chiat Ling Ong, Zhen Ling Teo, Ting Fang Tan, Ravi Chandran Narrendar, Fei Wang, Leo Anthony Celi, Marcus Eng Hock Ong, Nan Liu

Abstract: Artificial intelligence (AI) has demonstrated the ability to extract insights from data, but the issue of fairness remains a concern in high-stakes fields such as healthcare. Despite extensive discussion and efforts in algorithm development, AI fairness and clinical concerns have not been adequately addressed. In this paper, we discuss the misalignment between technical and clinical perspectives o… ▽ More Artificial intelligence (AI) has demonstrated the ability to extract insights from data, but the issue of fairness remains a concern in high-stakes fields such as healthcare. Despite extensive discussion and efforts in algorithm development, AI fairness and clinical concerns have not been adequately addressed. In this paper, we discuss the misalignment between technical and clinical perspectives of AI fairness, highlight the barriers to AI fairness' translation to healthcare, advocate multidisciplinary collaboration to bridge the knowledge gap, and provide possible solutions to address the clinical concerns pertaining to AI fairness. △ Less

Submitted 26 April, 2023; originally announced April 2023.

arXiv:2304.07310 [pdf]

doi 10.1093/jamia/ocad170

Federated and distributed learning applications for electronic health records and structured medical data: A scoping review

Authors: Siqi Li, Pinyan Liu, Gustavo G. Nascimento, Xinru Wang, Fabio Renato Manzolli Leite, Bibhas Chakraborty, Chuan Hong, Yilin Ning, Feng Xie, Zhen Ling Teo, Daniel Shu Wei Ting, Hamed Haddadi, Marcus Eng Hock Ong, Marco Aurélio Peres, Nan Liu

Abstract: Federated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical practice. This review examines FL applications on structured medi… ▽ More Federated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical practice. This review examines FL applications on structured medical data, identifies contemporary limitations and discusses potential innovations. We searched five databases, SCOPUS, MEDLINE, Web of Science, Embase, and CINAHL, to identify articles that applied FL to structured medical data and reported results following the PRISMA guidelines. Each selected publication was evaluated from three primary perspectives, including data quality, modeling strategies, and FL frameworks. Out of the 1160 papers screened, 34 met the inclusion criteria, with each article consisting of one or more studies that used FL to handle structured clinical/medical data. Of these, 24 utilized data acquired from electronic health records, with clinical predictions and association studies being the most common clinical research tasks that FL was applied to. Only one article exclusively explored the vertical FL setting, while the remaining 33 explored the horizontal FL setting, with only 14 discussing comparisons between single-site (local) and FL (global) analysis. The existing FL applications on structured medical data lack sufficient evaluations of clinically meaningful benefits, particularly when compared to single-site analyses. Therefore, it is crucial for future FL applications to prioritize clinical motivations and develop designs and methodologies that can effectively support and aid clinical practice and research. △ Less

Submitted 14 April, 2023; originally announced April 2023.

arXiv:2304.03779 [pdf]

A roadmap to fair and trustworthy prediction model validation in healthcare

Authors: Yilin Ning, Victor Volovici, Marcus Eng Hock Ong, Benjamin Alan Goldstein, Nan Liu

Abstract: A prediction model is most useful if it generalizes beyond the development data with external validations, but to what extent should it generalize remains unclear. In practice, prediction models are externally validated using data from very different settings, including populations from other health systems or countries, with predictably poor results. This may not be a fair reflection of the perfo… ▽ More A prediction model is most useful if it generalizes beyond the development data with external validations, but to what extent should it generalize remains unclear. In practice, prediction models are externally validated using data from very different settings, including populations from other health systems or countries, with predictably poor results. This may not be a fair reflection of the performance of the model which was designed for a specific target population or setting, and may be stretching the expected model generalizability. To address this, we suggest to externally validate a model using new data from the target population to ensure clear implications of validation performance on model reliability, whereas model generalizability to broader settings should be carefully investigated during model development instead of explored post-hoc. Based on this perspective, we propose a roadmap that facilitates the development and application of reliable, fair, and trustworthy artificial intelligence prediction models. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 12 pages, 2 figures

arXiv:2303.07830 [pdf]

Emergent Bio-Functional Similarities in a Cortical-Spike-Train-Decoding Spiking Neural Network Facilitate Predictions of Neural Computation

Authors: Tengjun Liu, Yansong Chua, Yiwei Zhang, Yuxiao Ning, Pengfu Liu, Guihua Wan, Zijun Wan, Shaomin Zhang, Weidong Chen

Abstract: Despite its better bio-plausibility, goal-driven spiking neural network (SNN) has not achieved applicable performance for classifying biological spike trains, and showed little bio-functional similarities compared to traditional artificial neural networks. In this study, we proposed the motorSRNN, a recurrent SNN topologically inspired by the neural motor circuit of primates. By employing the moto… ▽ More Despite its better bio-plausibility, goal-driven spiking neural network (SNN) has not achieved applicable performance for classifying biological spike trains, and showed little bio-functional similarities compared to traditional artificial neural networks. In this study, we proposed the motorSRNN, a recurrent SNN topologically inspired by the neural motor circuit of primates. By employing the motorSRNN in decoding spike trains from the primary motor cortex of monkeys, we achieved a good balance between classification accuracy and energy consumption. The motorSRNN communicated with the input by capturing and cultivating more cosine-tuning, an essential property of neurons in the motor cortex, and maintained its stability during training. Such training-induced cultivation and persistency of cosine-tuning was also observed in our monkeys. Moreover, the motorSRNN produced additional bio-functional similarities at the single-neuron, population, and circuit levels, demonstrating biological authenticity. Thereby, ablation studies on motorSRNN have suggested long-term stable feedback synapses contribute to the training-induced cultivation in the motor cortex. Besides these novel findings and predictions, we offer a new framework for building authentic models of neural computation. △ Less

Submitted 14 March, 2023; originally announced March 2023.

arXiv:2303.00282 [pdf]

doi 10.1016/j.jbi.2023.104485

FedScore: A privacy-preserving framework for federated scoring system development

Authors: Siqi Li, Yilin Ning, Marcus Eng Hock Ong, Bibhas Chakraborty, Chuan Hong, Feng Xie, Han Yuan, Mingxuan Liu, Daniel M. Buckland, Yong Chen, Nan Liu

Abstract: We propose FedScore, a privacy-preserving federated learning framework for scoring system generation across multiple sites to facilitate cross-institutional collaborations. The FedScore framework includes five modules: federated variable ranking, federated variable transformation, federated score derivation, federated model selection and federated model evaluation. To illustrate usage and assess F… ▽ More We propose FedScore, a privacy-preserving federated learning framework for scoring system generation across multiple sites to facilitate cross-institutional collaborations. The FedScore framework includes five modules: federated variable ranking, federated variable transformation, federated score derivation, federated model selection and federated model evaluation. To illustrate usage and assess FedScore's performance, we built a hypothetical global scoring system for mortality prediction within 30 days after a visit to an emergency department using 10 simulated sites divided from a tertiary hospital in Singapore. We employed a pre-existing score generator to construct 10 local scoring systems independently at each site and we also developed a scoring system using centralized data for comparison. We compared the acquired FedScore model's performance with that of other scoring models using the receiver operating characteristic (ROC) analysis. The FedScore model achieved an average area under the curve (AUえーゆーC) value of 0.763 across all sites, with a standard deviation (SD) of 0.020. We also calculated the average AUC values and SDs for each local model, and the FedScore model showed promising accuracy and stability with a high average AUC value which was closest to the one of the pooled model and SD which was lower than that of most local models. This study demonstrates that FedScore is a privacy-preserving scoring system generator with potentially good generalizability. △ Less

Submitted 1 March, 2023; originally announced March 2023.

arXiv:2302.04643 [pdf, other]

A Novel Approach for Auto-Formulation of Optimization Problems

Authors: Yuting Ning, Jiayu Liu, Longhu Qin, Tong Xiao, Shangzi Xue, Zhenya Huang, Qi Liu, Enhong Chen, Jinze Wu

Abstract: In the Natural Language for Optimization (NL4Opt) NeurIPS 2022 competition, competitors focus on improving the accessibility and usability of optimization solvers, with the aim of subtask 1: recognizing the semantic entities that correspond to the components of the optimization problem; subtask 2: generating formulations for the optimization problem. In this paper, we present the solution of our t… ▽ More In the Natural Language for Optimization (NL4Opt) NeurIPS 2022 competition, competitors focus on improving the accessibility and usability of optimization solvers, with the aim of subtask 1: recognizing the semantic entities that correspond to the components of the optimization problem; subtask 2: generating formulations for the optimization problem. In this paper, we present the solution of our team. First, we treat subtask 1 as a named entity recognition (NER) problem with the solution pipeline including pre-processing methods, adversarial training, post-processing methods and ensemble learning. Besides, we treat subtask 2 as a generation problem with the solution pipeline including specially designed prompts, adversarial training, post-processing methods and ensemble learning. Our proposed methods have achieved the F1-score of 0.931 in subtask 1 and the accuracy of 0.867 in subtask 2, which won the fourth and third places respectively in this competition. Our code is available at https://github.com/bigdata-ustc/nl4opt. △ Less

Submitted 9 February, 2023; originally announced February 2023.

arXiv:2301.07558 [pdf, other]

Towards a Holistic Understanding of Mathematical Questions with Contrastive Pre-training

Authors: Yuting Ning, Zhenya Huang, Xin Lin, Enhong Chen, Shiwei Tong, Zheng Gong, Shijin Wang

Abstract: Understanding mathematical questions effectively is a crucial task, which can benefit many applications, such as difficulty estimation. Researchers have drawn much attention to designing pre-training models for question representations due to the scarcity of human annotations (e.g., labeling difficulty). However, unlike general free-format texts (e.g., user comments), mathematical questions are ge… ▽ More Understanding mathematical questions effectively is a crucial task, which can benefit many applications, such as difficulty estimation. Researchers have drawn much attention to designing pre-training models for question representations due to the scarcity of human annotations (e.g., labeling difficulty). However, unlike general free-format texts (e.g., user comments), mathematical questions are generally designed with explicit purposes and mathematical logic, and usually consist of more complex content, such as formulas, and related mathematical knowledge (e.g., Function). Therefore, the problem of holistically representing mathematical questions remains underexplored. To this end, in this paper, we propose a novel contrastive pre-training approach for mathematical question representations, namely QuesCo, which attempts to bring questions with more similar purposes closer. Specifically, we first design two-level question augmentations, including content-level and structure-level, which generate literally diverse question pairs with similar purposes. Then, to fully exploit hierarchical information of knowledge concepts, we propose a knowledge hierarchy-aware rank strategy (KHAR), which ranks the similarities between questions in a fine-grained manner. Next, we adopt a ranking contrastive learning task to optimize our model based on the augmented and ranked questions. We conduct extensive experiments on two real-world mathematical datasets. The experimental results demonstrate the effectiveness of our model. △ Less

Submitted 18 January, 2023; originally announced January 2023.

Comments: Accepted by AAAI 2023

arXiv:2212.08370 [pdf]

Shapley variable importance cloud for machine learning models

Authors: Yilin Ning, Mingxuan Liu, Nan Liu

Abstract: Current practice in interpretable machine learning often focuses on explaining the final model trained from data, e.g., by using the Shapley additive explanations (SHAP) method. The recently developed Shapley variable importance cloud (ShapleyVIC) extends the current practice to a group of "nearly optimal models" to provide comprehensive and robust variable importance assessments, with estimated u… ▽ More Current practice in interpretable machine learning often focuses on explaining the final model trained from data, e.g., by using the Shapley additive explanations (SHAP) method. The recently developed Shapley variable importance cloud (ShapleyVIC) extends the current practice to a group of "nearly optimal models" to provide comprehensive and robust variable importance assessments, with estimated uncertainty intervals for a more complete understanding of variable contributions to predictions. ShapleyVIC was initially developed for applications with traditional regression models, and the benefits of ShapleyVIC inference have been demonstrated in real-life prediction tasks using the logistic regression model. However, as a model-agnostic approach, ShapleyVIC application is not limited to such scenarios. In this work, we extend ShapleyVIC implementation for machine learning models to enable wider applications, and propose it as a useful complement to the current SHAP analysis to enable more trustworthy applications of these black-box models. △ Less

Submitted 16 December, 2022; originally announced December 2022.

arXiv:2211.13620 [pdf, other]

doi 10.3847/2041-8213/acb26b

An Hαあるふぁ Impression of Lyαあるふぁ Galaxies at $z\simeq6$ with Deep JWST/NIRCam Imaging

Authors: Yuanhang Ning, Zheng Cai, Linhua Jiang, Xiaojing Lin, Shuqi Fu, Daniele Spinoso

Abstract: We present a study of seven spectroscopically confirmed (Ly$αあるふぁ$ emitting) galaxies at redshift $z\simeq6$ using the $JWST$/NIRCam imaging data. These galaxies, with a wide range of Ly$αあるふぁ$ luminosities, were recently observed in a series of NIRCam broad- and medium-bands. We constrain the rest-frame UV/optical continua and measure the H$αあるふぁ$ line emission of the galaxies using the combination of the… ▽ More We present a study of seven spectroscopically confirmed (Ly$αあるふぁ$ emitting) galaxies at redshift $z\simeq6$ using the $JWST$/NIRCam imaging data. These galaxies, with a wide range of Ly$αあるふぁ$ luminosities, were recently observed in a series of NIRCam broad- and medium-bands. We constrain the rest-frame UV/optical continua and measure the H$αあるふぁ$ line emission of the galaxies using the combination of the $JWST$/NIRCam and archival $HST$/WFC3 infrared photometry. We further estimate their escape fractions of Ly$αあるふぁ$ photons ($f_{\rm esc}^{\rm Lyαあるふぁ}$) and the production efficiency of ionizing photons ($ξくしー_{\rm ion}$). Among the sample, 6/7 galaxies have Ly$αあるふぁ$ escape fractions of ${\lesssim}10\%$, which might be the status for most of star-forming galaxies at $z\simeq6$. One UV-faint Ly$αあるふぁ$ galaxy with an extremely blue UV slope owns a large value of $f_{\rm esc}^{\rm Lyαあるふぁ}$ reaching ${\simeq}50\%$. These galaxies spread a broad range of $ξくしー_{\rm ion}$ over log$_{10}$ $ξくしー_{\rm ion, 0}$ (Hz erg$^{-1}$) $\sim25.0-26.5$. We find that UV-fainter galaxies with bluer UV continuum slopes likely have higher escape fractions of Ly$αあるふぁ$ photons. We also find that galaxies with higher Ly$αあるふぁ$ line emission tend to produce ionizing photons more efficiently. The most Ly$αあるふぁ$-luminous galaxy in the sample has a very high $ξくしー_{\rm ion, 0}$ of log$_{10}$ $ξくしー_{\rm ion, 0}$ (Hz erg$^{-1}$) $>26$. Our results support that Ly$αあるふぁ$ galaxies may have served as an important contributor to the cosmic reionization. Blue and bright Ly$αあるふぁ$ galaxies are excellent targets for $JWST$ follow-up spectroscopic observations. △ Less

Submitted 15 January, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

Comments: Accepted by ApJL; an improved version

Journal ref: 2023 ApJL 944 L1

arXiv:2211.12698 [pdf, other]

doi 10.1109/LGRS.2023.3270186

Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks

Authors: Chun Bao, Jie Cao, Yaqian Ning, Yang Cheng, Qun Hao

Abstract: Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few works design attention mechanisms using large receptive fields. In this work, we propose a novel attention method named Rega-net to increase CNN accuracy by enlarging the receptive field. Inspired by the mechanism of the human retina, we design… ▽ More Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few works design attention mechanisms using large receptive fields. In this work, we propose a novel attention method named Rega-net to increase CNN accuracy by enlarging the receptive field. Inspired by the mechanism of the human retina, we design convolutional kernels to resemble the non-uniformly distributed structure of the human retina. Then, we sample variable-resolution values in the Gabor function distribution and fill these values in retina-like kernels. This distribution allows essential features to be more visible in the center position of the receptive field. We further design an attention module including these retina-like kernels. Experiments demonstrate that our Rega-Net achieves 79.96% top-1 accuracy on ImageNet-1K classification and 43.1% mAP on COCO2017 object detection. The mAP of the Rega-Net increased by up to 3.5% compared to baseline networks. △ Less

Submitted 3 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

Showing 1–50 of 148 results for author: Ning, Y