Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Holste, Gregory; Wang, Song; Jiang, Ziyu; Shen, Thomas C.; Shih, George; Summers, Ronald M.; Peng, Yifan; Wang, Zhangyang

doi:10.1007/978-3-031-17027-0_3

Computer Science > Computer Vision and Pattern Recognition

arXiv:2208.13365 (cs)

[Submitted on 29 Aug 2022]

Title:Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Authors:Gregory Holste, Song Wang, Ziyu Jiang, Thomas C. Shen, George Shih, Ronald M. Summers, Yifan Peng, Zhangyang Wang

View PDF

Abstract:Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a "long-tailed" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at this https URL.

Comments:	DALI 2022 (MICCAI workshop)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2208.13365 [cs.CV]
	(or arXiv:2208.13365v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2208.13365
Related DOI:	https://doi.org/10.1007/978-3-031-17027-0_3

Submission history

From: Gregory Holste [view email]
[v1] Mon, 29 Aug 2022 04:34:15 UTC (3,083 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators