(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 82 results for author: Ng, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01938  [pdf, other

    cs.CV cs.MM

    Nutrition Estimation for Dietary Management: A Transformer Approach with Depth Sensing

    Authors: Zhengyi Kwan, Wei Zhang, Zhengkui Wang, Aik Beng Ng, Simon See

    Abstract: Nutrition estimation is crucial for effective dietary management and overall health and well-being. Existing methods often struggle with sub-optimal accuracy and can be time-consuming. In this paper, we propose NuNet, a transformer-based network designed for nutrition estimation that utilizes both RGB and depth information from food images. We have designed and implemented a multi-scale encoder an… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages

  2. arXiv:2405.09798  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Many-Shot In-Context Learning in Multimodal Foundation Models

    Authors: Yixing Jiang, Jeremy Irvin, Ji Hun Wang, Muhammad Ahmed Chaudhry, Jonathan H. Chen, Andrew Y. Ng

    Abstract: Large language models are well-known to be effective at few-shot in-context learning (ICL). Recent advancements in multimodal foundation models have enabled unprecedentedly long context windows, presenting an opportunity to explore their capability to perform ICL with many more demonstrating examples. In this work, we evaluate the performance of multimodal foundation models scaling from few-shot t… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  3. Compositional Factorization of Visual Scenes with Convolutional Sparse Coding and Resonator Networks

    Authors: Christopher J. Kymn, Sonia Mazelet, Annabel Ng, Denis Kleyko, Bruno A. Olshausen

    Abstract: We propose a system for visual scene analysis and recognition based on encoding the sparse, latent feature-representation of an image into a high-dimensional vector that is subsequently factorized to parse scene content. The sparse feature representation is learned from image statistics via convolutional sparse coding, while scene parsing is performed by a resonator network. The integration of spa… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures

    Journal ref: 2024 Neuro Inspired Computational Elements Conference (NICE)

  4. arXiv:2404.17033  [pdf, other

    cs.CV

    Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segmentation

    Authors: Tanvi Deshpande, Eva Prakash, Elsie Gyang Ross, Curtis Langlotz, Andrew Ng, Jeya Maria Jose Valanarasu

    Abstract: The high cost of creating pixel-by-pixel gold-standard labels, limited expert availability, and presence of diverse tasks make it challenging to generate segmentation labels to train deep learning models for medical imaging tasks. In this work, we present a new approach to overcome the hurdle of costly medical image labeling by leveraging foundation models like Segment Anything Model (SAM) and its… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted at MIDL 2024

  5. arXiv:2404.13185  [pdf, other

    eess.IV cs.CV

    Unlocking Robust Segmentation Across All Age Groups via Continual Learning

    Authors: Chih-Ying Liu, Jeya Maria Jose Valanarasu, Camila Gonzalez, Curtis Langlotz, Andrew Ng, Sergios Gatidis

    Abstract: Most deep learning models in medical imaging are trained on adult data with unclear performance on pediatric images. In this work, we aim to address this challenge in the context of automated anatomy segmentation in whole-body Computed Tomography (CT). We evaluate the performance of CT organ segmentation algorithms trained on adult data when applied to pediatric CT volumes and identify substantial… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  6. arXiv:2401.14486  [pdf, other

    cs.CV cs.LG

    CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

    Authors: Muhammad Ahmed Chaudhry, Lyna Kim, Jeremy Irvin, Yuzu Ido, Sonia Chu, Jared Thomas Isobe, Andrew Y. Ng, Duncan Watson-Parris

    Abstract: Clouds play a significant role in global temperature regulation through their effect on planetary albedo. Anthropogenic emissions of aerosols can alter the albedo of clouds, but the extent of this effect, and its consequent impact on temperature change, remains uncertain. Human-induced clouds caused by ship aerosol emissions, commonly referred to as ship tracks, provide visible manifestations of t… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 11 pages, 5 figures, submitted to Journal of Machine Learning Research

  7. arXiv:2312.02200  [pdf, other

    cs.CV cs.AI stat.AP

    An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets

    Authors: Maya Srikanth, Jeremy Irvin, Brian Wesley Hill, Felipe Godoy, Ishan Sabane, Andrew Y. Ng

    Abstract: Major advancements in computer vision can primarily be attributed to the use of labeled datasets. However, acquiring labels for datasets often results in errors which can harm model performance. Recent works have proposed methods to automatically identify mislabeled images, but developing strategies to effectively implement them in real world datasets has been sparsely explored. Towards improved d… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  8. arXiv:2312.02199  [pdf, other

    cs.CV cs.AI cs.LG eess.IV stat.AP

    USat: A Unified Self-Supervised Encoder for Multi-Sensor Satellite Imagery

    Authors: Jeremy Irvin, Lucas Tao, Joanne Zhou, Yuntao Ma, Langston Nashold, Benjamin Liu, Andrew Y. Ng

    Abstract: Large, self-supervised vision models have led to substantial advancements for automatically interpreting natural images. Recent works have begun tailoring these methods to remote sensing data which has rich structure with multi-sensor, multi-spectral, and temporal information providing massive amounts of self-labeled data that can be used for self-supervised pre-training. In this work, we develop… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  9. arXiv:2311.17449  [pdf, other

    cs.CV

    Weakly-semi-supervised object detection in remotely sensed imagery

    Authors: Ji Hun Wang, Jeremy Irvin, Beri Kohen Behar, Ha Tran, Raghav Samavedam, Quentin Hsu, Andrew Y. Ng

    Abstract: Deep learning for detecting objects in remotely sensed imagery can enable new technologies for important applications including mitigating climate change. However, these models often require large datasets labeled with bounding box annotations which are expensive to curate, prohibiting the development of models for new tasks and geographies. To address this challenge, we develop weakly-semi-superv… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2023

  10. arXiv:2311.09574  [pdf, other

    cs.LG cs.AI cs.CV

    LymphoML: An interpretable artificial intelligence-based method identifies morphologic features that correlate with lymphoma subtype

    Authors: Vivek Shankar, Xiaoli Yang, Vrishab Krishna, Brent Tan, Oscar Silva, Rebecca Rojansky, Andrew Ng, Fabiola Valvert, Edward Briercheck, David Weinstock, Yasodha Natkunam, Sebastian Fernandez-Pol, Pranav Rajpurkar

    Abstract: The accurate classification of lymphoma subtypes using hematoxylin and eosin (H&E)-stained tissue is complicated by the wide range of morphological features these cancers can exhibit. We present LymphoML - an interpretable machine learning method that identifies morphologic features that correlate with lymphoma subtypes. Our method applies steps to process H&E-stained tissue microarray cores, segm… ▽ More

    Submitted 19 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: To be published in Proceedings of the 3rd Machine Learning for Health symposium, Proceedings of Machine Learning Research (PMLR)

    ACM Class: I.5.1; I.5.2; I.5.4; J.3

  11. arXiv:2305.08017  [pdf, other

    cs.CV

    How to Train Your CheXDragon: Training Chest X-Ray Models for Transfer to Novel Tasks and Healthcare Systems

    Authors: Cara Van Uden, Jeremy Irvin, Mars Huang, Nathan Dean, Jason Carr, Andrew Ng, Curtis Langlotz

    Abstract: Self-supervised learning (SSL) enables label efficient training for machine learning models. This is essential for domains such as medical imaging, where labels are costly and time-consuming to curate. However, the most effective supervised or SSL strategy for transferring models to different healthcare systems or novel tasks is not well understood. In this work, we systematically experiment with… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Comments: 13 pages, 12 figures

  12. arXiv:2301.01842  [pdf, other

    cs.CV cs.CY

    Detecting Neighborhood Gentrification at Scale via Street-level Visual Data

    Authors: Tianyuan Huang, Timothy Dai, Zhecheng Wang, Hesu Yoon, Hao Sheng, Andrew Y. Ng, Ram Rajagopal, Jackelyn Hwang

    Abstract: Neighborhood gentrification plays a significant role in shaping the social and economic well-being of both individuals and communities at large. While some efforts have been made to detect gentrification in cities, existing approaches rely mainly on estimated measures from survey data, require substantial work of human labeling, and are limited in characterizing the neighborhood as a whole. We pro… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  13. arXiv:2212.09895  [pdf, other

    cs.CL

    Improved Long-Form Spoken Language Translation with Large Language Models

    Authors: Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Axel H. Ng

    Abstract: A challenge in spoken language translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we fine-tune a general-purpose, large language model to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We compare to several segmen… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  14. arXiv:2212.00581  [pdf

    eess.SY cs.AI

    An enhanced simulation-based multi-objective optimization approach with knowledge discovery for reconfigurable manufacturing systems

    Authors: Carlos Alberto Barrera-Diaz, Amir Nourmohammdi, Henrik Smedberg, Tehseen Aslam, Amos H. C. Ng

    Abstract: In today's uncertain and competitive market, where enterprises are subjected to increasingly shortened product life-cycles and frequent volume changes, reconfigurable manufacturing systems (RMS) applications play a significant role in the manufacturing industry's success. Despite the advantages offered by RMS, achieving a high-efficiency degree constitutes a challenging task for stakeholders and d… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  15. arXiv:2209.15454  [pdf, other

    cs.LG cs.AI

    GPNet: Simplifying Graph Neural Networks via Multi-channel Geometric Polynomials

    Authors: Xun Liu, Alex Hay-Man Ng, Fangyuan Lei, Yikuan Zhang, Zhengmin Li

    Abstract: Graph Neural Networks (GNNs) are a promising deep learning approach for circumventing many real-world problems on graph-structured data. However, these models usually have at least one of four fundamental limitations: over-smoothing, over-fitting, difficult to train, and strong homophily assumption. For example, Simple Graph Convolution (SGC) is known to suffer from the first and fourth limitation… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 15 pages, 15 figures

  16. arXiv:2208.13027  [pdf, other

    cs.LG cs.AI

    Improving debris flow evacuation alerts in Taiwan using machine learning

    Authors: Yi-Lin Tsai, Jeremy Irvin, Suhas Chundi, Andrew Y. Ng, Christopher B. Field, Peter K. Kitanidis

    Abstract: Taiwan has the highest susceptibility to and fatalities from debris flows worldwide. The existing debris flow warning system in Taiwan, which uses a time-weighted measure of rainfall, leads to alerts when the measure exceeds a predefined threshold. However, this system generates many false alarms and misses a substantial fraction of the actual debris flows. Towards improving this system, we implem… ▽ More

    Submitted 2 September, 2022; v1 submitted 27 August, 2022; originally announced August 2022.

    Comments: Supplementary information: https://drive.google.com/file/d/1Y17YxXo5rhIbUuZzwLo99pmttbh28v9X/view?usp=sharing

  17. arXiv:2207.11166  [pdf, other

    cs.CV

    METER-ML: A Multi-Sensor Earth Observation Benchmark for Automated Methane Source Mapping

    Authors: Bryan Zhu, Nicholas Lui, Jeremy Irvin, Jimmy Le, Sahil Tadwalkar, Chenghao Wang, Zutao Ouyang, Frankie Y. Liu, Andrew Y. Ng, Robert B. Jackson

    Abstract: Reducing methane emissions is essential for mitigating global warming. To attribute methane emissions to their sources, a comprehensive dataset of methane source infrastructure is necessary. Recent advancements with deep learning on remotely sensed imagery have the potential to identify the locations and characteristics of methane sources, but there is a substantial lack of publicly available data… ▽ More

    Submitted 15 August, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Workshop on Complex Data Challenges in Earth Observation at IJCAI-ECAI 2022

  18. arXiv:2207.10062  [pdf, other

    cs.LG

    DataPerf: Benchmarks for Data-Centric AI Development

    Authors: Mark Mazumder, Colby Banbury, Xiaozhe Yao, Bojan Karlaš, William Gaviria Rojas, Sudnya Diamos, Greg Diamos, Lynn He, Alicia Parrish, Hannah Rose Kirk, Jessica Quaye, Charvi Rastogi, Douwe Kiela, David Jurado, David Kanter, Rafael Mosquera, Juan Ciro, Lora Aroyo, Bilge Acun, Lingjiao Chen, Mehul Smriti Raje, Max Bartolo, Sabri Eyuboglu, Amirata Ghorbani, Emmett Goodman , et al. (20 additional authors not shown)

    Abstract: Machine learning research has long focused on models rather than datasets, and prominent datasets are used for common ML tasks without regard to the breadth, difficulty, and faithfulness of the underlying problems. Neglecting the fundamental importance of data has given rise to inaccuracy, bias, and fragility in real-world applications, and research is hindered by saturation across existing datase… ▽ More

    Submitted 13 October, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2023 Datasets and Benchmarks Track

  19. arXiv:2201.01449  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia

    Authors: Jon Braatz, Pranav Rajpurkar, Stephanie Zhang, Andrew Y. Ng, Jeanne Shen

    Abstract: In recent years, deep learning has successfully been applied to automate a wide variety of tasks in diagnostic histopathology. However, fast and reliable localization of small-scale regions-of-interest (ROI) has remained a key challenge, as discriminative morphologic features often occupy only a small fraction of a gigapixel-scale whole-slide image (WSI). In this paper, we propose a sparse WSI ana… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  20. arXiv:2108.01764  [pdf, other

    cs.CL cs.AI

    Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

    Authors: Cécile Logé, Emily Ross, David Yaw Amoah Dadey, Saahil Jain, Adriel Saporta, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Recent advances in Natural Language Processing (NLP), and specifically automated Question Answering (QA) systems, have demonstrated both impressive linguistic fluency and a pernicious tendency to reflect social biases. In this study, we introduce Q-Pain, a dataset for assessing bias in medical QA in the context of pain management, one of the most challenging forms of clinical decision-making. Alon… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  21. arXiv:2106.14463  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

    Authors: Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Yuhao Zhang, Matthew P. Lungren, Andrew Y. Ng, Curtis P. Langlotz, Pranav Rajpurkar

    Abstract: Extracting structured clinical information from free-text radiology reports can enable the use of radiology report information for a variety of critical healthcare applications. In our work, we present RadGraph, a dataset of entities and relations in full-text chest X-ray radiology reports based on a novel information extraction schema we designed to structure radiology reports. We release a devel… ▽ More

    Submitted 29 August, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  22. arXiv:2106.04452  [pdf, other

    physics.med-ph cs.LG eess.SP

    3KG: Contrastive Learning of 12-Lead Electrocardiograms using Physiologically-Inspired Augmentations

    Authors: Bryan Gopal, Ryan W. Han, Gautham Raghupathi, Andrew Y. Ng, Geoffrey H. Tison, Pranav Rajpurkar

    Abstract: We propose 3KG, a physiologically-inspired contrastive learning approach that generates views using 3D augmentations of the 12-lead electrocardiogram. We evaluate representation quality by fine-tuning a linear layer for the downstream task of 23-class diagnosis on the PhysioNet 2020 challenge training data and find that 3KG achieves a $9.1\%$ increase in mean AUえーゆーC over the best self-supervised base… ▽ More

    Submitted 20 September, 2021; v1 submitted 21 April, 2021; originally announced June 2021.

    Comments: 11 pages, 3 figures, paper revision with new set of experiments and comparison to previous methods

  23. arXiv:2105.02489  [pdf, other

    cs.LG cs.CV

    Learning Neighborhood Representation from Multi-Modal Multi-Graph: Image, Text, Mobility Graph and Beyond

    Authors: Tianyuan Huang, Zhecheng Wang, Hao Sheng, Andrew Y. Ng, Ram Rajagopal

    Abstract: Recent urbanization has coincided with the enrichment of geotagged data, such as street view and point-of-interest (POI). Region embedding enhanced by the richer data modalities has enabled researchers and city administrators to understand the built environment, socioeconomics, and the dynamics of cities better. While some efforts have been made to simultaneously use multi-modal inputs, existing m… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  24. arXiv:2104.08727  [pdf, other

    cs.CL cs.AI

    GooAQ: Open Question Answering with Diverse Answer Types

    Authors: Daniel Khashabi, Amos Ng, Tushar Khot, Ashish Sabharwal, Hannaneh Hajishirzi, Chris Callison-Burch

    Abstract: While day-to-day questions come with a variety of answer types, the current question-answering (QA) literature has failed to adequately address the answer diversity of questions. To this end, we present GooAQ, a large-scale dataset with a variety of answer types. This dataset contains over 5 million questions and 3 million answers collected from Google. GooAQ questions are collected semi-automatic… ▽ More

    Submitted 10 September, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: EMNLP-Findings 2021

  25. arXiv:2104.00793  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Effect of Radiology Report Labeler Quality on Deep Learning Models for Chest X-Ray Interpretation

    Authors: Saahil Jain, Akshay Smit, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Although deep learning models for chest X-ray interpretation are commonly trained on labels generated by automatic radiology report labelers, the impact of improvements in report labeling on the performance of chest X-ray classification models has not been systematically investigated. We first compare the CheXpert, CheXbert, and VisualCheXbert labelers on the task of extracting accurate chest X-ra… ▽ More

    Submitted 27 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: In Neural Information Processing Systems (NeurIPS) Workshop on Data-Centric AI (DCAI)

  26. arXiv:2103.14339  [pdf, other

    cs.CV cs.AI cs.LG

    MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement Learning

    Authors: Akshay Smit, Damir Vrabac, Yujie He, Andrew Y. Ng, Andrew L. Beam, Pranav Rajpurkar

    Abstract: We propose a selective learning method using meta-learning and deep reinforcement learning for medical image interpretation in the setting of limited labeling resources. Our method, MedSelect, consists of a trainable deep learning selector that uses image embeddings obtained from contrastive pretraining for determining which images to label, and a non-parametric selector that uses cosine similarit… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

  27. arXiv:2103.09957  [pdf, other

    cs.CV cs.AI cs.LG

    CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays

    Authors: Emma Chen, Andy Kim, Rayan Krishnan, Jin Long, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: A major obstacle to the integration of deep learning models for chest x-ray interpretation into clinical settings is the lack of understanding of their failure modes. In this work, we first investigate whether there are patient subgroups that chest x-ray models are likely to misclassify. We find that patient age and the radiographic finding of lung lesion, pneumothorax or support devices are stati… ▽ More

    Submitted 20 July, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: In Proceedings of the 2021 Conference on Machine Learning for Health Care, 2021. In ACM Conference on Health, Inference, and Learning (ACM-CHIL) Workshop 2021

  28. arXiv:2103.04590  [pdf, other

    cs.CV cs.AI cs.LG

    CheXseen: Unseen Disease Detection for Deep Learning Interpretation of Chest X-rays

    Authors: Siyu Shi, Ishaan Malhi, Kevin Tran, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: We systematically evaluate the performance of deep learning models in the presence of diseases not labeled for or present during training. First, we evaluate whether deep learning models trained on a subset of diseases (seen diseases) can detect the presence of any one of a larger set of diseases. We find that models tend to falsely classify diseases outside of the subset (unseen diseases) as "no… ▽ More

    Submitted 17 May, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at MIDL Conference 2021. Previous version accepted at ACM Conference on Health, Inference, and Learning (ACM-CHIL) Workshop 2021

  29. arXiv:2102.11467  [pdf, other

    eess.IV cs.CV cs.LG

    VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels

    Authors: Saahil Jain, Akshay Smit, Steven QH Truong, Chanh DT Nguyen, Minh-Thanh Huynh, Mudit Jain, Victoria A. Young, Andrew Y. Ng, Matthew P. Lungren, Pranav Rajpurkar

    Abstract: Automatic extraction of medical conditions from free-text radiology reports is critical for supervising computer vision models to interpret medical images. In this work, we show that radiologists labeling reports significantly disagree with radiologists labeling corresponding chest X-ray images, which reduces the quality of report labels as proxies for image labels. We develop and evaluate methods… ▽ More

    Submitted 15 March, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted to ACM Conference on Health, Inference, and Learning (ACM-CHIL) 2021

  30. arXiv:2102.10663  [pdf, other

    eess.IV cs.CV cs.LG

    MedAug: Contrastive learning leveraging patient metadata improves representations for chest X-ray interpretation

    Authors: Yen Nhi Truong Vu, Richard Wang, Niranjan Balachandar, Can Liu, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Self-supervised contrastive learning between pairs of multiple views of the same image has been shown to successfully leverage unlabeled data to produce meaningful visual representations for both natural and medical images. However, there has been limited work on determining how to select pairs for medical images, where availability of patient metadata can be leveraged to improve representations.… ▽ More

    Submitted 17 October, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

  31. arXiv:2102.10484  [pdf, other

    cs.CV cs.AI cs.LG

    CheXseg: Combining Expert Annotations with DNN-generated Saliency Maps for X-ray Segmentation

    Authors: Soham Gadgil, Mark Endo, Emily Wen, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Medical image segmentation models are typically supervised by expert annotations at the pixel-level, which can be expensive to acquire. In this work, we propose a method that combines the high quality of pixel-level expert annotations with the scale of coarse DNN-generated saliency maps for training multi-label semantic segmentation models. We demonstrate the application of our semi-supervised met… ▽ More

    Submitted 17 May, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Accepted to Medical Imaging with Deep Learning (MIDL) Conference 2021

  32. arXiv:2102.08660  [pdf, other

    eess.IV cs.CV cs.LG

    CheXternal: Generalization of Deep Learning Models for Chest X-ray Interpretation to Photos of Chest X-rays and External Clinical Settings

    Authors: Pranav Rajpurkar, Anirudh Joshi, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren

    Abstract: Recent advances in training deep learning models have demonstrated the potential to provide accurate chest X-ray interpretation and increase access to radiology expertise. However, poor generalization due to data distribution shifts in clinical settings is a key barrier to implementation. In this study, we measured the diagnostic performance for 8 different chest X-ray models when applied to (1) s… ▽ More

    Submitted 20 February, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: Accepted to ACM Conference on Health, Inference, and Learning (ACM-CHIL) 2021. arXiv admin note: substantial text overlap with arXiv:2011.06129

  33. arXiv:2101.06871  [pdf, other

    cs.CV cs.AI cs.LG

    CheXtransfer: Performance and Parameter Efficiency of ImageNet Models for Chest X-Ray Interpretation

    Authors: Alexander Ke, William Ellsworth, Oishi Banerjee, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Deep learning methods for chest X-ray interpretation typically rely on pretrained models developed for ImageNet. This paradigm assumes that better ImageNet architectures perform better on chest X-ray tasks and that ImageNet-pretrained weights provide a performance boost over random initialization. In this work, we compare the transfer performance and parameter efficiency of 16 popular convolutiona… ▽ More

    Submitted 20 February, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

  34. arXiv:2011.07227  [pdf, other

    cs.CV cs.AI cs.LG

    OGNet: Towards a Global Oil and Gas Infrastructure Database using Deep Learning on Remotely Sensed Imagery

    Authors: Hao Sheng, Jeremy Irvin, Sasankh Munukutla, Shawn Zhang, Christopher Cross, Kyle Story, Rose Rustowicz, Cooper Elsworth, Zutao Yang, Mark Omara, Ritesh Gautam, Robert B. Jackson, Andrew Y. Ng

    Abstract: At least a quarter of the warming that the Earth is experiencing today is due to anthropogenic methane emissions. There are multiple satellites in orbit and planned for launch in the next few years which can detect and quantify these emissions; however, to attribute methane emissions to their sources on the ground, a comprehensive database of the locations and characteristics of emission sources w… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2020 (Spotlight talk)

  35. arXiv:2011.06129  [pdf, other

    eess.IV cs.CV cs.LG

    CheXphotogenic: Generalization of Deep Learning Models for Chest X-ray Interpretation to Photos of Chest X-rays

    Authors: Pranav Rajpurkar, Anirudh Joshi, Anuj Pareek, Jeremy Irvin, Andrew Y. Ng, Matthew Lungren

    Abstract: The use of smartphones to take photographs of chest x-rays represents an appealing solution for scaled deployment of deep learning models for chest x-ray interpretation. However, the performance of chest x-ray algorithms on photos of chest x-rays has not been thoroughly investigated. In this study, we measured the diagnostic performance for 8 different chest x-ray models when applied to photos of… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  36. arXiv:2011.05479  [pdf, other

    cs.CV cs.LG eess.IV

    ForestNet: Classifying Drivers of Deforestation in Indonesia using Deep Learning on Satellite Imagery

    Authors: Jeremy Irvin, Hao Sheng, Neel Ramachandran, Sonja Johnson-Yu, Sharon Zhou, Kyle Story, Rose Rustowicz, Cooper Elsworth, Kemen Austin, Andrew Y. Ng

    Abstract: Characterizing the processes leading to deforestation is critical to the development and implementation of targeted forest conservation and management policies. In this work, we develop a deep learning model called ForestNet to classify the drivers of primary forest loss in Indonesia, a country with one of the highest deforestation rates in the world. Using satellite imagery, ForestNet identifies… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2020

  37. arXiv:2010.15269  [pdf, other

    eess.IV cs.CV cs.LG

    GloFlow: Global Image Alignment for Creation of Whole Slide Images for Pathology from Video

    Authors: Viswesh Krishna, Anirudh Joshi, Philip L. Bulterys, Eric Yang, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: The application of deep learning to pathology assumes the existence of digital whole slide images of pathology slides. However, slide digitization is bottlenecked by the high cost of precise motor stages in slide scanners that are needed for position information used for slide stitching. We propose GloFlow, a two-stage method for creating a whole slide image using optical flow-based image registra… ▽ More

    Submitted 12 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  38. arXiv:2010.05352  [pdf, other

    cs.CV cs.AI cs.LG

    MoCo-CXR: MoCo Pretraining Improves Representation and Transferability of Chest X-ray Models

    Authors: Hari Sowrirajan, Jingbo Yang, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Contrastive learning is a form of self-supervision that can leverage unlabeled data to produce pretrained models. While contrastive learning has demonstrated promising results on natural image classification tasks, its application to medical imaging tasks like chest X-ray interpretation has been limited. In this work, we propose MoCo-CXR, which is an adaptation of the contrastive learning method M… ▽ More

    Submitted 17 May, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted at Medical Imaging with Deep Learning (MIDL) Conference 2021

  39. arXiv:2010.04715  [pdf, other

    cs.LG stat.AP stat.ML

    Short-Term Solar Irradiance Forecasting Using Calibrated Probabilistic Models

    Authors: Eric Zelikman, Sharon Zhou, Jeremy Irvin, Cooper Raterink, Hao Sheng, Anand Avati, Jack Kelly, Ram Rajagopal, Andrew Y. Ng, David Gagne

    Abstract: Advancing probabilistic solar forecasting methods is essential to supporting the integration of solar energy into the electricity grid. In this work, we develop a variety of state-of-the-art probabilistic models for forecasting solar irradiance. We investigate the use of post-hoc calibration techniques for ensuring well-calibrated probabilistic predictions. We train and evaluate the models using p… ▽ More

    Submitted 14 October, 2020; v1 submitted 9 October, 2020; originally announced October 2020.

  40. arXiv:2009.08123  [pdf, other

    cs.CV cs.AI cs.LG

    DLBCL-Morph: Morphological features computed using deep learning for an annotated digital DLBCL image set

    Authors: Damir Vrabac, Akshay Smit, Rebecca Rojansky, Yasodha Natkunam, Ranjana H. Advani, Andrew Y. Ng, Sebastian Fernandez-Pol, Pranav Rajpurkar

    Abstract: Diffuse Large B-Cell Lymphoma (DLBCL) is the most common non-Hodgkin lymphoma. Though histologically DLBCL shows varying morphologies, no morphologic features have been consistently demonstrated to correlate with prognosis. We present a morphologic analysis of histology sections from 209 DLBCL cases with associated clinical and cytogenetic data. Duplicate tissue core sections were arranged in tiss… ▽ More

    Submitted 24 September, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Corrections to folder structure figure

  41. arXiv:2007.06199  [pdf, other

    eess.IV cs.CV cs.LG

    CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning Robustness

    Authors: Nick A. Phillips, Pranav Rajpurkar, Mark Sabini, Rayan Krishnan, Sharon Zhou, Anuj Pareek, Nguyet Minh Phu, Chris Wang, Mudit Jain, Nguyen Duong Du, Steven QH Truong, Andrew Y. Ng, Matthew P. Lungren

    Abstract: Clinical deployment of deep learning algorithms for chest x-ray interpretation requires a solution that can integrate into the vast spectrum of clinical workflows across the world. An appealing approach to scaled deployment is to leverage the ubiquity of smartphones by capturing photos of x-rays to share with clinicians using messaging services like WhatsApp. However, the application of chest x-ra… ▽ More

    Submitted 11 December, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  42. arXiv:2006.03680  [pdf, other

    stat.ML cs.CV cs.LG

    Evaluating the Disentanglement of Deep Generative Models through Manifold Topology

    Authors: Sharon Zhou, Eric Zelikman, Fred Lu, Andrew Y. Ng, Gunnar Carlsson, Stefano Ermon

    Abstract: Learning disentangled representations is regarded as a fundamental task for improving the generalization, robustness, and interpretability of generative models. However, measuring disentanglement has been challenging and inconsistent, often dependent on an ad-hoc external model or specific to a certain dataset. To address this, we present a method for quantifying disentanglement that only uses the… ▽ More

    Submitted 17 March, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021

  43. arXiv:2005.03743  [pdf, other

    cs.CV

    Effective Data Fusion with Generalized Vegetation Index: Evidence from Land Cover Segmentation in Agriculture

    Authors: Hao Sheng, Xiao Chen, Jingyi Su, Ram Rajagopal, Andrew Ng

    Abstract: How can we effectively leverage the domain knowledge from remote sensing to better segment agriculture land cover from satellite images? In this paper, we propose a novel, model-agnostic, data-fusion approach for vegetation-related computer vision tasks. Motivated by the various Vegetation Indices (VIs), which are introduced by domain experts, we systematically reviewed the VIs that are widely use… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: CVPR 2020 - Vision for Agriculture; https://www.agriculture-vision.com

  44. arXiv:2004.09754  [pdf, other

    cs.CV cs.LG eess.IV

    The 1st Agriculture-Vision Challenge: Methods and Results

    Authors: Mang Tik Chiu, Xingqian Xu, Kai Wang, Jennifer Hobbs, Naira Hovakimyan, Thomas S. Huang, Honghui Shi, Yunchao Wei, Zilong Huang, Alexander Schwing, Robert Brunner, Ivan Dozier, Wyatt Dozier, Karen Ghandilyan, David Wilson, Hyunseong Park, Junhee Kim, Sungho Kim, Qinghui Liu, Michael C. Kampffmeyer, Robert Jenssen, Arnt B. Salberg, Alexandre Barbosa, Rodrigo Trevisan, Bingchen Zhao , et al. (17 additional authors not shown)

    Abstract: The first Agriculture-Vision Challenge aims to encourage research in developing novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset. Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation. The Agricultu… ▽ More

    Submitted 23 April, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 Workshop

  45. arXiv:2004.09167  [pdf, other

    cs.CL cs.IR cs.LG

    CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

    Authors: Akshay Smit, Saahil Jain, Pranav Rajpurkar, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren

    Abstract: The extraction of labels from radiology text reports enables large-scale training of medical imaging models. Existing approaches to report labeling typically rely either on sophisticated feature engineering based on medical domain knowledge or manual annotations by experts. In this work, we introduce a BERT-based approach to medical image report labeling that exploits both the scale of available r… ▽ More

    Submitted 18 October, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted to EMNLP 2020

  46. arXiv:2003.03785  [pdf, ps, other

    cs.FL cs.AI cs.DB

    Dependently Typed Knowledge Graphs

    Authors: Zhangsheng Lai, Aik Beng Ng, Liang Ze Wong, Simon See, Shaowei Lin

    Abstract: Reasoning over knowledge graphs is traditionally built upon a hierarchy of languages in the Semantic Web Stack. Starting from the Resource Description Framework (RDF) for knowledge graphs, more advanced constructs have been introduced through various syntax extensions to add reasoning capabilities to knowledge graphs. In this paper, we show how standardized semantic web technologies (RDF and its q… ▽ More

    Submitted 8 March, 2020; originally announced March 2020.

  47. arXiv:2002.11379  [pdf, other

    eess.IV cs.CV cs.LG

    CheXpedition: Investigating Generalization Challenges for Translation of Chest X-Ray Algorithms to the Clinical Setting

    Authors: Pranav Rajpurkar, Anirudh Joshi, Anuj Pareek, Phil Chen, Amirhossein Kiani, Jeremy Irvin, Andrew Y. Ng, Matthew P. Lungren

    Abstract: Although there have been several recent advances in the application of deep learning algorithms to chest x-ray interpretation, we identify three major challenges for the translation of chest x-ray algorithms to the clinical setting. We examine the performance of the top 10 performing models on the CheXpert challenge leaderboard on three tasks: (1) TB detection, (2) pathology detection on photos of… ▽ More

    Submitted 11 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: Accepted as workshop paper at ACM Conference on Health, Inference, and Learning (CHIL) 2020

  48. arXiv:2002.02917  [pdf, other

    cs.CV cs.LG

    Data augmentation with Mobius transformations

    Authors: Sharon Zhou, Jiequan Zhang, Hang Jiang, Torbjorn Lundh, Andrew Y. Ng

    Abstract: Data augmentation has led to substantial improvements in the performance and generalization of deep models, and remain a highly adaptable method to evolving model architectures and varying amounts of data---in particular, extremely scarce amounts of available training data. In this paper, we present a novel method of applying Mobius transformations to augment input images during training. Mobius t… ▽ More

    Submitted 7 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  49. arXiv:1910.03225  [pdf, other

    cs.LG stat.ML

    NGBoost: Natural Gradient Boosting for Probabilistic Prediction

    Authors: Tony Duan, Anand Avati, Daisy Yi Ding, Khanh K. Thai, Sanjay Basu, Andrew Y. Ng, Alejandro Schuler

    Abstract: We present Natural Gradient Boosting (NGBoost), an algorithm for generic probabilistic prediction via gradient boosting. Typical regression models return a point estimate, conditional on covariates, but probabilistic regression models output a full probability distribution over the outcome space, conditional on the covariates. This allows for predictive uncertainty estimation -- crucial in applica… ▽ More

    Submitted 9 June, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: Accepted for ICML 2020

  50. arXiv:1910.01335  [pdf, other

    cs.CL

    Topic-aware Pointer-Generator Networks for Summarizing Spoken Conversations

    Authors: Zhengyuan Liu, Angela Ng, Sheldon Lee, Ai Ti Aw, Nancy F. Chen

    Abstract: Due to the lack of publicly available resources, conversation summarization has received far less attention than text summarization. As the purpose of conversations is to exchange information between at least two interlocutors, key information about a certain topic is often scattered and spanned across multiple utterances and turns from different speakers. This phenomenon is more pronounced during… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: To appear in ASRU2019