(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 63 results for author: Feng, A

.
  1. arXiv:2407.04236  [pdf, other

    cs.LG

    Graph Pooling via Ricci Flow

    Authors: Amy Feng, Melanie Weber

    Abstract: Graph Machine Learning often involves the clustering of nodes based on similarity structure encoded in the graph's topology and the nodes' attributes. On homophilous graphs, the integration of pooling layers has been shown to enhance the performance of Graph Neural Networks by accounting for inherent multi-scale structure. Here, similar nodes are grouped together to coarsen the graph and reduce th… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 32 pages, 7 figures

  2. arXiv:2407.00031  [pdf, other

    cs.DC cs.SE

    Supercharging Federated Learning with Flower and NVIDIA FLARE

    Authors: Holger R. Roth, Daniel J. Beutel, Yan Cheng, Javier Fernandez Marques, Heng Pan, Chester Chen, Zhihong Zhang, Yuhong Wen, Sean Yang, Isaac, Yang, Yuan-Ting Hsieh, Ziyue Xu, Daguang Xu, Nicholas D. Lane, Andrew Feng

    Abstract: Several open-source systems, such as Flower and NVIDIA FLARE, have been developed in recent years while focusing on different aspects of federated learning (FL). Flower is dedicated to implementing a cohesive approach to FL, analytics, and evaluation. Over time, Flower has cultivated extensive strategies and algorithms tailored for FL application development, fostering a vibrant FL community in re… ▽ More

    Submitted 21 May, 2024; originally announced July 2024.

  3. arXiv:2406.12072  [pdf, other

    cs.AI cs.LG

    DTGB: A Comprehensive Benchmark for Dynamic Text-Attributed Graphs

    Authors: Jiasheng Zhang, Jialin Chen, Menglin Yang, Aosong Feng, Shuang Liang, Jie Shao, Rex Ying

    Abstract: Dynamic text-attributed graphs (DyTAGs) are prevalent in various real-world scenarios, where each node and edge are associated with text descriptions, and both the graph structure and text descriptions evolve over time. Despite their broad applicability, there is a notable scarcity of benchmark datasets tailored to DyTAGs, which hinders the potential advancement in many research fields. To address… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 28 pages, 13 figures

  4. CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems

    Authors: Yanlin Feng, Sajjadur Rahman, Aaron Feng, Vincent Chen, Eser Kandogan

    Abstract: Compound AI systems (CASs) that employ LLMs as agents to accomplish knowledge-intensive tasks via interactions with tools and data retrievers have garnered significant interest within database and AI communities. While these systems have the potential to supplement typical analysis workflows of data analysts in enterprise data platforms, unfortunately, CASs are subject to the same data discovery c… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI '24), June 14, 2024, Santiago, AA, Chile

  5. arXiv:2405.12369  [pdf, other

    cs.CV

    AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field

    Authors: Rong Liu, Rui Xu, Yue Hu, Meida Chen, Andrew Feng

    Abstract: 3D Gaussian Splatting (3DGS) has recently advanced radiance field reconstruction by offering superior capabilities for novel view synthesis and real-time rendering speed. However, its strategy of blending optimization and adaptive density control might lead to sub-optimal results; it can sometimes yield noisy geometry and blurry artifacts due to prioritizing optimizing large Gaussians at the cost… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2404.01340  [pdf, other

    cs.LG cs.AI

    From Similarity to Superiority: Channel Clustering for Time Series Forecasting

    Authors: Jialin Chen, Jan Eric Lenssen, Aosong Feng, Weihua Hu, Matthias Fey, Leandros Tassiulas, Jure Leskovec, Rex Ying

    Abstract: Time series forecasting has attracted significant attention in recent decades. Previous studies have demonstrated that the Channel-Independent (CI) strategy improves forecasting performance by treating different channels individually, while it leads to poor generalization on unseen instances and ignores potentially necessary interactions between channels. Conversely, the Channel-Dependent (CD) str… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 20 pages, 6 figures

  7. arXiv:2403.10585  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint

    Authors: Haoyue Tang, Tian Xie, Aosong Feng, Hanyu Wang, Chenyang Zhang, Yang Bai

    Abstract: Solving image inverse problems (e.g., super-resolution and inpainting) requires generating a high fidelity image that matches the given input (the low-resolution image or the masked image). By using the input image as guidance, we can leverage a pretrained diffusion generative model to solve a wide range of image inverse tasks without task specific model fine-tuning. To precisely estimate the guid… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted and to Appear, AISTATS 2024

  8. arXiv:2403.04882  [pdf, other

    cs.LG

    Efficient High-Resolution Time Series Classification via Attention Kronecker Decomposition

    Authors: Aosong Feng, Jialin Chen, Juan Garza, Brooklyn Berry, Francisco Salazar, Yifeng Gao, Rex Ying, Leandros Tassiulas

    Abstract: The high-resolution time series classification problem is essential due to the increasing availability of detailed temporal data in various domains. To tackle this challenge effectively, it is imperative that the state-of-the-art attention model is scalable to accommodate the growing sequence lengths typically encountered in high-resolution time series data, while also demonstrating robustness in… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  9. arXiv:2403.04880  [pdf, other

    cs.CV

    An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control

    Authors: Aosong Feng, Weikang Qiu, Jinbin Bai, Xiao Zhang, Zhen Dong, Kaicheng Zhou, Rex Ying, Leandros Tassiulas

    Abstract: Building on the success of text-to-image diffusion models (DPMs), image editing is an important application to enable human interaction with AI-generated content. Among various editing methods, editing within the prompt space gains more attention due to its capacity and simplicity of controlling semantics. However, since diffusion models are commonly pretrained on descriptive text captions, direct… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  10. arXiv:2402.14293  [pdf, other

    cs.CL

    Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: In the domain of Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated promise in text-generation tasks. However, their educational applications, particularly for domain-specific queries, remain underexplored. This study investigates LLMs' capabilities in educational scenarios, focusing on concept graph recovery and question-answering (QA). We assess LLMs' zero-shot per… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  11. arXiv:2402.12288  [pdf, other

    eess.IV

    Revisiting registration-based synthesis: A focus on unsupervised MR image synthesis

    Authors: Savannah P. Hays, Lianrui Zuo, Yihao Liu, Anqi Feng, Jiachen Zhuo, Jerry L. Prince, Aaron Carass

    Abstract: Deep learning (DL) has led to significant improvements in medical image synthesis, enabling advanced image-to-image translation to generate synthetic images. However, DL methods face challenges such as domain shift and high demands for training data, limiting their generalizability and applicability. Historically, image synthesis was also carried out using deformable image registration (DIR), a me… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024

  12. arXiv:2402.07792  [pdf, other

    cs.LG cs.DC

    Empowering Federated Learning for Massive Models with NVIDIA FLARE

    Authors: Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng

    Abstract: In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copy… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  13. arXiv:2310.16002  [pdf, other

    cs.CV

    Integrating View Conditions for Image Synthesis

    Authors: Jinbin Bai, Zhen Dong, Aosong Feng, Xiao Zhang, Tian Ye, Kaicheng Zhou

    Abstract: In the field of image processing, applying intricate semantic modifications within existing images remains an enduring challenge. This paper introduces a pioneering framework that integrates viewpoint information to enhance the control of image editing tasks, especially for interior design scenes. By surveying existing object editing methodologies, we distill three essential criteria -- consistenc… ▽ More

    Submitted 8 May, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted by IJCAI 2024

  14. arXiv:2309.10011  [pdf, other

    cs.CV eess.IV

    Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach

    Authors: Rong Liu, Enyu Zhao, Zhiyuan Liu, Andrew Feng, Scott John Easley

    Abstract: In this paper, we propose an Instant Photorealistic Style Transfer (IPST) approach, designed to achieve instant photorealistic style transfer on super-resolution inputs without the need for pre-training on pair-wise datasets or imposing extra constraints. Our method utilizes a lightweight StyleNet to enable style transfer from a style image to a content image while preserving non-color information… ▽ More

    Submitted 20 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages (reference excluded), 6 figures, 4 tables

  15. arXiv:2308.13649   

    eess.IV

    Efficient Annotation for Medical Image Analysis: A One-Pass Selective Annotation Approach

    Authors: Yuli Wang, Peiyu Duan, Zhangxing Bian, Anqi Feng, Yuan Xue

    Abstract: Annotating biomedical images for supervised learning is a complex and labor-intensive task due to data diversity and its intricate nature. In this paper, we propose an innovative method, the efficient one-pass selective annotation (EPOSA), that significantly reduces the annotation burden while maintaining robust model performance. Our approach employs a variational autoencoder (VAE) to extract sal… ▽ More

    Submitted 14 September, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: We found that the idea and results of this paper were not mature enough to go public, after discussion with all co-authors, we decide to withdraw this paper

  16. arXiv:2308.13420  [pdf, other

    cs.NE cs.AI cs.LG

    Reinforcement Learning-assisted Evolutionary Algorithm: A Survey and Research Opportunities

    Authors: Yanjie Song, Yutong Wu, Yangyang Guo, Ran Yan, P. N. Suganthan, Yue Zhang, Witold Pedrycz, Swagatam Das, Rammohan Mallipeddi, Oladayo Solomon Ajani. Qiang Feng

    Abstract: Evolutionary algorithms (EA), a class of stochastic search methods based on the principles of natural evolution, have received widespread acclaim for their exceptional performance in various real-world optimization problems. While researchers worldwide have proposed a wide variety of EAs, certain limitations remain, such as slow convergence speed and poor generalization capabilities. Consequently,… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: 28 pages, 16 figures

    Report number: SWEVO-S-2023-00771

  17. arXiv:2307.15208  [pdf, other

    eess.IV cs.CV

    Generative AI for Medical Imaging: extending the MONAI Framework

    Authors: Walter H. L. Pinaya, Mark S. Graham, Eric Kerfoot, Petru-Daniel Tudosiu, Jessica Dafflon, Virginia Fernandez, Pedro Sanchez, Julia Wolleb, Pedro F. da Costa, Ashay Patel, Hyungjin Chung, Can Zhao, Wei Peng, Zelong Liu, Xueyan Mei, Oeslle Lucena, Jong Chul Ye, Sotirios A. Tsaftaris, Prerna Dogra, Andrew Feng, Marc Modat, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Recent advances in generative AI have brought incredible breakthroughs in several areas, including medical imaging. These generative models have tremendous potential not only to help safely share medical data via synthetic datasets but also to perform an array of diverse applications, such as anomaly detection, image-to-image translation, denoising, and MRI reconstruction. However, due to the comp… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  18. arXiv:2307.13560  [pdf, other

    cs.CL

    XDLM: Cross-lingual Diffusion Language Model for Machine Translation

    Authors: Linyao Chen, Aosong Feng, Boming Yang, Zihui Li

    Abstract: Recently, diffusion models have excelled in image generation tasks and have also been applied to neural language processing (NLP) for controllable text generation. However, the application of diffusion models in a cross-lingual setting is less unexplored. Additionally, while pretraining with diffusion models has been studied within a single language, the potential of cross-lingual pretraining rema… ▽ More

    Submitted 30 July, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  19. arXiv:2307.05780  [pdf

    cs.CV

    Automated Artifact Detection in Ultra-widefield Fundus Photography of Patients with Sickle Cell Disease

    Authors: Anqi Feng, Dimitri Johnson, Grace R. Reilly, Loka Thangamathesvaran, Ann Nampomba, Mathias Unberath, Adrienne W. Scott, Craig Jones

    Abstract: Importance: Ultra-widefield fundus photography (UWF-FP) has shown utility in sickle cell retinopathy screening; however, image artifact may diminish quality and gradeability of images. Objective: To create an automated algorithm for UWF-FP artifact classification. Design: A neural network based automated artifact detection algorithm was designed to identify commonly encountered UWF-FP artifacts in… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  20. arXiv:2305.10655  [pdf, other

    eess.IV cs.CV cs.LG

    DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images

    Authors: Andres Diaz-Pinto, Pritesh Mehta, Sachidanand Alle, Muhammad Asad, Richard Brown, Vishwesh Nath, Alvin Ihsani, Michela Antonelli, Daniel Palkovics, Csaba Pinter, Ron Alkalay, Steve Pieper, Holger R. Roth, Daguang Xu, Prerna Dogra, Tom Vercauteren, Andrew Feng, Abood Quraini, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Automatic segmentation of medical images is a key step for diagnostic and interventional tasks. However, achieving this requires large amounts of annotated volumes, which can be tedious and time-consuming task for expert annotators. In this paper, we introduce DeepEdit, a deep learning-based method for volumetric medical image annotation, that allows automatic and semi-automatic segmentation, and… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  21. arXiv:2305.03319  [pdf, other

    cs.CL

    HiPool: Modeling Long Documents Using Graph Neural Networks

    Authors: Irene Li, Aosong Feng, Dragomir Radev, Rex Ying

    Abstract: Encoding long sequences in Natural Language Processing (NLP) is a challenging problem. Though recent pretraining language models achieve satisfying performances in many NLP tasks, they are still restricted by a pre-defined maximum length, making them challenging to be extended to longer sequences. So some recent works utilize hierarchies to model long sequences. However, most of them apply sequent… ▽ More

    Submitted 14 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Journal ref: ACL 2023 main proceedings

  22. FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information

    Authors: Andrew Zhu, Karmanya Aggarwal, Alexander Feng, Lara J. Martin, Chris Callison-Burch

    Abstract: Dungeons & Dragons (D&D) is a tabletop roleplaying game with complex natural language interactions between players and hidden state information. Recent work has shown that large language models (LLMs) that have access to state information can generate higher quality game turns than LLMs that use dialog history alone. However, previous work used game state information that was heuristically created… ▽ More

    Submitted 25 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 21 pages, 2 figures. Accepted at ACL 2023

    Journal ref: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023, pp. 4171-4193

  23. arXiv:2303.17706  [pdf, other

    eess.IV q-bio.QM

    Label Propagation via Random Walk for Training Robust Thalamus Nuclei Parcellation Model from Noisy Annotations

    Authors: Anqi Feng, Yuan Xue, Yuli Wang, Chang Yan, Zhangxing Bian, Muhan Shao, Jiachen Zhuo, Rao P. Gullapalli, Aaron Carass, Jerry L. Prince

    Abstract: Data-driven thalamic nuclei parcellation depends on high-quality manual annotations. However, the small size and low contrast changes among thalamic nuclei, yield annotations that are often incomplete, noisy, or ambiguously labelled. To train a robust thalamic nuclei parcellation model with noisy annotations, we propose a label propagation algorithm based on random walker to refine the annotations… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  24. arXiv:2303.12822  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Co-Speech Gesture Synthesis using Discrete Gesture Token Learning

    Authors: Shuhong Lu, Youngwoo Yoon, Andrew Feng

    Abstract: Synthesizing realistic co-speech gestures is an important and yet unsolved problem for creating believable motions that can drive a humanoid robot to interact and communicate with human users. Such capability will improve the impressions of the robots by human users and will find applications in education, training, and medical services. One challenge in learning the co-speech gesture model is tha… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 3 tables

  25. arXiv:2303.01922  [pdf, other

    eess.IV

    Automated Ventricle Parcellation and Evan's Ratio Computation in Pre- and Post-Surgical Ventriculomegaly

    Authors: Yuli Wang, Anqi Feng, Yuan Xue, Lianrui Zuo, Yihao Liu, Ari M. Blitz, Mark G. Luciano, Aaron Carass, Jerry L. Prince

    Abstract: Normal pressure hydrocephalus~(NPH) is a brain disorder associated with enlarged ventricles and multiple cognitive and motor symptoms. The degree of ventricular enlargement can be measured using magnetic resonance images~(MRIs) and characterized quantitatively using the Evan's ratio (ER). Automatic computation of ER is desired to avoid the extra time and variations associated with manual measureme… ▽ More

    Submitted 6 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  26. Author as Character and Narrator: Deconstructing Personal Narratives from the r/AmITheAsshole Reddit Community

    Authors: Salvatore Giorgi, Ke Zhao, Alexander H. Feng, Lara J. Martin

    Abstract: In the r/AmITheAsshole subreddit, people anonymously share first person narratives that contain some moral dilemma or conflict and ask the community to judge who is at fault (i.e., who is "the asshole"). In general, first person narratives are a unique storytelling domain where the author is the narrator (the person telling the story) but can also be a character (the person living the story) and,… ▽ More

    Submitted 15 March, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: Accepted to the 17th International AAAI Conference on Web and Social Media (ICWSM), 2023

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media (ICWSM) 2023, 17(1), 233-244

  27. arXiv:2301.06114  [pdf, other

    eess.IV cs.LG

    Segmenting thalamic nuclei from manifold projections of multi-contrast MRI

    Authors: Chang Yan, Muhan Shao, Zhangxing Bian, Anqi Feng, Yuan Xue, Jiachen Zhuo, Rao P. Gullapalli, Aaron Carass, Jerry L. Prince

    Abstract: The thalamus is a subcortical gray matter structure that plays a key role in relaying sensory and motor signals within the brain. Its nuclei can atrophy or otherwise be affected by neurological disease and injuries including mild traumatic brain injury. Segmenting both the thalamus and its nuclei is challenging because of the relatively low contrast within and around the thalamus in conventional m… ▽ More

    Submitted 31 January, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 figures, 2023 SPIE-MI Image Processing

  28. arXiv:2211.02701  [pdf, other

    cs.LG cs.AI cs.CV

    MONAI: An open-source framework for deep learning in healthcare

    Authors: M. Jorge Cardoso, Wenqi Li, Richard Brown, Nic Ma, Eric Kerfoot, Yiheng Wang, Benjamin Murrey, Andriy Myronenko, Can Zhao, Dong Yang, Vishwesh Nath, Yufan He, Ziyue Xu, Ali Hatamizadeh, Andriy Myronenko, Wentao Zhu, Yun Liu, Mingxin Zheng, Yucheng Tang, Isaac Yang, Michael Zephyr, Behrooz Hashemian, Sachidanand Alle, Mohammad Zalbagi Darestani, Charlie Budd , et al. (32 additional authors not shown)

    Abstract: Artificial Intelligence (AI) is having a tremendous impact across most areas of science. Applications of AI in healthcare have the potential to improve our ability to detect, diagnose, prognose, and intervene on human disease. For AI models to be used clinically, they need to be made safe, reproducible and robust, and the underlying software framework must be aware of the particularities (e.g. geo… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: www.monai.io

  29. arXiv:2210.13291  [pdf, other

    cs.LG cs.AI cs.CV cs.NI cs.SE

    NVIDIA FLARE: Federated Learning from Simulation to Real-World

    Authors: Holger R. Roth, Yan Cheng, Yuhong Wen, Isaac Yang, Ziyue Xu, Yuan-Ting Hsieh, Kristopher Kersten, Ahmed Harouni, Can Zhao, Kevin Lu, Zhihong Zhang, Wenqi Li, Andriy Myronenko, Dong Yang, Sean Yang, Nicola Rieke, Abood Quraini, Chester Chen, Daguang Xu, Nic Ma, Prerna Dogra, Mona Flores, Andrew Feng

    Abstract: Federated learning (FL) enables building robust and generalizable AI models by leveraging diverse datasets from multiple collaborators without centralizing the data. We created NVIDIA FLARE as an open-source software development kit (SDK) to make it easier for data scientists to use FL in their research and real-world applications. The SDK includes solutions for state-of-the-art FL algorithms and… ▽ More

    Submitted 28 April, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at the International Workshop on Federated Learning, NeurIPS 2022, New Orleans, USA (https://federated-learning.org/fl-neurips-2022); Revised version v2: added Key Components list, system metrics for homomorphic encryption experiment; Extended v3 for journal submission

    Journal ref: IEEE Data Eng. Bull., Vol. 46, No. 1, 2023

  30. arXiv:2210.11794  [pdf, other

    cs.LG cs.CL

    Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long Sequences

    Authors: Aosong Feng, Irene Li, Yuang Jiang, Rex Ying

    Abstract: Efficient Transformers have been developed for long sequence modeling, due to their subquadratic memory and time complexity. Sparse Transformer is a popular approach to improving the efficiency of Transformers by restricting self-attention to locations specified by the predefined sparse patterns. However, leveraging sparsity may sacrifice expressiveness compared to full-attention, when important t… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  31. arXiv:2210.03534  [pdf, other

    cs.NI

    A Quantitative Theory of Bottleneck Structures for Data Networks

    Authors: Jordi Ros-Giralt, Noah Amsel, Sruthi Yellamraju, James Ezick, Richard Lethin, Yuang Jiang, Aosong Feng, Leandros Tassiulas

    Abstract: The conventional view of the congestion control problem in data networks is based on the principle that a flow's performance is uniquely determined by the state of its bottleneck link, regardless of the topological properties of the network. However, recent work has shown that the behavior of congestion-controlled networks is better explained by models that account for the interactions between bot… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  32. Detecting Environmental Violations with Satellite Imagery in Near Real Time: Land Application under the Clean Water Act

    Authors: Ben Chugg, Nicolas Rothbacher, Alex Feng, Xiaoqi Long, Daniel E. Ho

    Abstract: This paper introduces a new, highly consequential setting for the use of computer vision for environmental sustainability. Concentrated Animal Feeding Operations (CAFOs) (aka intensive livestock farms or "factory farms") produce significant manure and pollution. Dumping manure in the winter months poses significant environmental risks and violates environmental law in many states. Yet the federal… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to CIKM '22

  33. arXiv:2207.05064  [pdf, other

    cs.LG cs.AI

    Adaptive Graph Spatial-Temporal Transformer Network for Traffic Flow Forecasting

    Authors: Aosong Feng, Leandros Tassiulas

    Abstract: Traffic flow forecasting on graphs has real-world applications in many fields, such as transportation system and computer networks. Traffic forecasting can be highly challenging due to complex spatial-temporal correlations and non-linear traffic patterns. Existing works mostly model such spatial-temporal dependencies by considering spatial correlations and temporal correlations separately and fail… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

  34. arXiv:2203.12362  [pdf, other

    cs.HC cs.CV cs.LG eess.IV

    MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical Images

    Authors: Andres Diaz-Pinto, Sachidanand Alle, Vishwesh Nath, Yucheng Tang, Alvin Ihsani, Muhammad Asad, Fernando Pérez-García, Pritesh Mehta, Wenqi Li, Mona Flores, Holger R. Roth, Tom Vercauteren, Daguang Xu, Prerna Dogra, Sebastien Ourselin, Andrew Feng, M. Jorge Cardoso

    Abstract: The lack of annotated datasets is a major bottleneck for training new task-specific supervised machine learning models, considering that manual annotation is extremely expensive and time-consuming. To address this problem, we present MONAI Label, a free and open-source framework that facilitates the development of applications based on artificial intelligence (AI) models that aim at reducing the t… ▽ More

    Submitted 28 April, 2023; v1 submitted 23 March, 2022; originally announced March 2022.

  35. arXiv:2203.09065  [pdf, other

    cs.CV

    STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud Dataset

    Authors: Meida Chen, Qingyong Hu, Zifan Yu, Hugues Thomas, Andrew Feng, Yu Hou, Kyle McCullough, Fengbo Ren, Lucio Soibelman

    Abstract: Although various 3D datasets with different functions and scales have been proposed recently, it remains challenging for individuals to complete the whole pipeline of large-scale data collection, sanitization, and annotation. Moreover, the created datasets usually suffer from extremely imbalanced class distribution or partial low-quality data samples. Motivated by this, we explore the procedurally… ▽ More

    Submitted 13 October, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Report number: https://bmvc2022.mpi-inf.mpg.de/0429.pdf

    Journal ref: https://bmvc2022.mpi-inf.mpg.de/0429.pdf

  36. arXiv:2202.06924  [pdf, other

    cs.LG cs.CR cs.CV cs.DC

    Do Gradient Inversion Attacks Make Federated Learning Unsafe?

    Authors: Ali Hatamizadeh, Hongxu Yin, Pavlo Molchanov, Andriy Myronenko, Wenqi Li, Prerna Dogra, Andrew Feng, Mona G. Flores, Jan Kautz, Daguang Xu, Holger R. Roth

    Abstract: Federated learning (FL) allows the collaborative training of AI models without needing to share raw data. This capability makes it especially interesting for healthcare applications where patient and data privacy is of utmost concern. However, recent works on the inversion of deep neural networks from model gradients raised concerns about the security of FL in preventing the leakage of training da… ▽ More

    Submitted 30 January, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Revised version; Accepted to IEEE Transactions on Medical Imaging; Improved and reformatted version of https://www.researchsquare.com/article/rs-1147182/v2; Added NVFlare reference

  37. arXiv:2201.00491  [pdf, other

    cs.LG cs.AI

    KerGNNs: Interpretable Graph Neural Networks with Graph Kernels

    Authors: Aosong Feng, Chenyu You, Shiqiang Wang, Leandros Tassiulas

    Abstract: Graph kernels are historically the most widely-used technique for graph classification tasks. However, these methods suffer from limited performance because of the hand-crafted combinatorial features of graphs. In recent years, graph neural networks (GNNs) have become the state-of-the-art method in downstream graph-related tasks due to their superior performance. Most GNNs are based on Message Pas… ▽ More

    Submitted 25 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

  38. Nanoparticle Radiosensitization: from extended local effect modeling to a survival modificationframework of compound Poisson additive killing and its carbon dots validation

    Authors: Hailun Pan, Xiaowa Wang, Aihui Feng, Qinqin Cheng, Xue Chen, Xiaodong He, Xinglan Qin, Xiaolong Sha, Shen Fu, Cuiping Chi, Xufei Wang

    Abstract: Objective: To construct an analytical model instead of local effect modeling for the prediction of the biological effectiveness of nanoparticle radiosensitization. Approach: An extended local effects model is first proposed with a more comprehensive description of the nanoparticles mediated local killing enhancements, but meanwhile puts forward challenging issues that remain difficult and need to… ▽ More

    Submitted 30 December, 2021; v1 submitted 29 October, 2021; originally announced November 2021.

  39. arXiv:2110.15327  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MEGAN: Memory Enhanced Graph Attention Network for Space-Time Video Super-Resolution

    Authors: Chenyu You, Lianyi Han, Aosong Feng, Ruihan Zhao, Hui Tang, Wei Fan

    Abstract: Space-time video super-resolution (STVSR) aims to construct a high space-time resolution video sequence from the corresponding low-frame-rate, low-resolution video sequence. Inspired by the recent success to consider spatial-temporal information for space-time super-resolution, our main goal in this work is to take full considerations of spatial and temporal correlations within the video sequences… ▽ More

    Submitted 29 November, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

  40. arXiv:2109.12221  [pdf

    cs.CV

    Ground material classification for UAV-based photogrammetric 3D data A 2D-3D Hybrid Approach

    Authors: Meida Chen, Andrew Feng, Yu Hou, Kyle McCullough, Pratusha Bhuvana Prasad, Lucio Soibelman

    Abstract: In recent years, photogrammetry has been widely used in many areas to create photorealistic 3D virtual data representing the physical environment. The innovation of small unmanned aerial vehicles (sUAVs) has provided additional high-resolution imaging capabilities with low cost for mapping a relatively large area of interest. These cutting-edge technologies have caught the US Army and Navy's atten… ▽ More

    Submitted 15 October, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2021

  41. arXiv:2103.14620  [pdf, other

    cs.CL

    LiGCN: Label-interpretable Graph Convolutional Networks for Multi-label Text Classification

    Authors: Irene Li, Aosong Feng, Hao Wu, Tianxiao Li, Toyotaro Suzumura, Ruihai Dong

    Abstract: Multi-label text classification (MLTC) is an attractive and challenging task in natural language processing (NLP). Compared with single-label text classification, MLTC has a wider range of applications in practice. In this paper, we propose a label-interpretable graph convolutional network model to solve the MLTC problem by modeling tokens and labels as nodes in a heterogeneous graph. In this way,… ▽ More

    Submitted 22 May, 2022; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 8 tables, 3 figures

    Journal ref: DLG4NLP Workshop, NAACL 2022

  42. arXiv:2012.11300  [pdf, other

    cond-mat.mtrl-sci physics.app-ph physics.optics

    Revealing trap depth distributions in persistent phosphors with a thermal barrier for charging

    Authors: Ang Feng, Jonas J. Joos, Jiaren Du, Philippe F. Smet

    Abstract: The performance of persistent phosphors under given charging and working conditions is determined by the properties of the traps that are responsible for these unique properties. Traps are characterized by the height of their associated barrier for thermal detrapping, and a continuous distribution of trap depths is often found in real materials. Accurately determining trap depth distributions is h… ▽ More

    Submitted 2 June, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: This is the postprint of the paper Physical Review B 105, 205101. It also includes the supplemental material

    Journal ref: Phys. Rev. B; May 03, 2022, Volume 105 (issue 20)

  43. arXiv:2009.00185  [pdf

    cs.CV

    Utilizing Satellite Imagery Datasets and Machine Learning Data Models to Evaluate Infrastructure Change in Undeveloped Regions

    Authors: Kyle McCullough, Andrew Feng, Meida Chen, Ryan McAlinden

    Abstract: In the globalized economic world, it has become important to understand the purpose behind infrastructural and construction initiatives occurring within developing regions of the earth. This is critical when the financing for such projects must be coming from external sources, as is occurring throughout major portions of the African continent. When it comes to imagery analysis to research these re… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2020

  44. arXiv:2008.09648  [pdf

    cs.CV

    Semantic Segmentation and Data Fusion of Microsoft Bing 3D Cities and Small UAV-based Photogrammetric Data

    Authors: Meida Chen, Andrew Feng, Kyle McCullough, Pratusha Bhuvana Prasad, Ryan McAlinden, Lucio Soibelman

    Abstract: With state-of-the-art sensing and photogrammetric techniques, Microsoft Bing Maps team has created over 125 highly detailed 3D cities from 11 different countries that cover hundreds of thousands of square kilometer areas. The 3D city models were created using the photogrammetric technique with high-resolution images that were captured from aircraft-mounted cameras. Such a large 3D city database ha… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2020

  45. arXiv:2008.09647  [pdf

    cs.CV

    Generating synthetic photogrammetric data for training deep learning based 3D point cloud segmentation models

    Authors: Meida Chen, Andrew Feng, Kyle McCullough, Pratusha Bhuvana Prasad, Ryan McAlinden, Lucio Soibelman

    Abstract: At I/ITSEC 2019, the authors presented a fully-automated workflow to segment 3D photogrammetric point-clouds/meshes and extract object information, including individual tree locations and ground materials (Chen et al., 2019). The ultimate goal is to create realistic virtual environments and provide the necessary information for simulation. We tested the generalizability of the previously proposed… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2020

  46. arXiv:2008.03697  [pdf

    cs.CV

    Fully Automated Photogrammetric Data Segmentation and Object Information Extraction Approach for Creating Simulation Terrain

    Authors: Meida Chen, Andrew Feng, Kyle McCullough, Pratusha Bhuvana Prasad, Ryan McAlinden, Lucio Soibelman, Mike Enloe

    Abstract: Our previous works have demonstrated that visually realistic 3D meshes can be automatically reconstructed with low-cost, off-the-shelf unmanned aerial systems (UAS) equipped with capable cameras, and efficient photogrammetric software techniques. However, such generated data do not contain semantic information/features of objects (i.e., man-made objects, vegetation, ground, object materials, etc.)… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Journal ref: Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC) 2019

  47. arXiv:2003.13968  [pdf, other

    cs.DB cs.IR

    Towards Productionizing Subjective Search Systems

    Authors: Aaron Feng, Shuwei Chen, Yuliang Li, Hiroshi Matsuda, Hidekazu Tamaki, Wang-Chiew Tan

    Abstract: Existing e-commerce search engines typically support search only over objective attributes, such as price and locations, leaving the more desirable subjective attributes, such as romantic vibe and worklife balance unsearchable. We found that this is also the case for Recruit Group, which operates a wide range of online booking and search services, including jobs, travel, housing, bridal, dining, b… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: In Submission to VLDB 2020

  48. arXiv:2003.01204  [pdf

    cs.CV cs.LG

    Energy-efficient and Robust Cumulative Training with Net2Net Transformation

    Authors: Aosong Feng, Priyadarshini Panda

    Abstract: Deep learning has achieved state-of-the-art accuracies on several computer vision tasks. However, the computational and energy requirements associated with training such deep neural networks can be quite high. In this paper, we propose a cumulative training strategy with Net2Net transformation that achieves training computational efficiency without incurring large accuracy loss, in comparison to a… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 6 pages, 6 figures, 2 Tables

  49. arXiv:1912.00406  [pdf, ps, other

    cs.IT

    Reconsidering Design of Multi-Antenna NOMA Systems with Limited Feedback

    Authors: Zhiyao Tang, Liang Sun, Lu Cao, Shutong Qi, and Yong Feng

    Abstract: We provide in this paper a comprehensive solution to the design, performance analysis, and optimization of a multi-antenna non-orthogonal multiple access (NOMA) system for multiuser downlink communications under a general limited channel state information (CSI) feedback framework for frequency division duplex mode. We design a general framework including user clustering, joint power and bits alloc… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: accepted to IEEE Transactions on Wireless Communications,2019

  50. arXiv:1910.00962  [pdf, other

    cs.CV

    Privacy-preserving Federated Brain Tumour Segmentation

    Authors: Wenqi Li, Fausto Milletarì, Daguang Xu, Nicola Rieke, Jonny Hancox, Wentao Zhu, Maximilian Baust, Yan Cheng, Sébastien Ourselin, M. Jorge Cardoso, Andrew Feng

    Abstract: Due to medical data privacy regulations, it is often infeasible to collect and share patient data in a centralised data lake. This poses challenges for training machine learning algorithms, such as deep convolutional networks, which often require large numbers of diverse training examples. Federated learning sidesteps this difficulty by bringing code to the patient data owners and only sharing int… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: MICCAI MLMI 2019