Published 03 Oct 2018

Deep learning in chest radiography: Detection of findings and presence of change

Author: Ramandeep Singh 1, Mannudeep K. Kalra 1, Chayanin Nitiwarangkul 1,2, John A. Patti 1, Fatemeh Homayounieh 1, Atul Padole 1, Pooja Rao 3, Preetham Putha 3, Victorine V. Muse 1, Amita Sharma 1, Subba R. Digumarthy 11

SHARE

https://cms.qure.ai

Back

Background
Deep learning (DL) based solutions have been proposed for interpretation of several imaging modalities including radiography, CT, and MR. For chest radiographs, DL algorithms have found success in the evaluation of abnormalities such as lung nodules, pulmonary tuberculosis, cystic fibrosis, pneumoconiosis, and location of peripherally inserted central catheters. Chest radiography represents the most commonly performed radiological test for a multitude of non-emergent and emergent clinical indications. This study aims to assess accuracy of deep learning (DL) algorithm for detection of abnormalities on routine frontal chest radiographs (CXR), and assessment of stability or change in findings over serial radiographs.
Methods and Findings
We processed 874 de-identified frontal CXR from 724 adult patients (> 18 years) with DL (Qure AI). Scores and prediction statistics from DL were generated and recorded for the presence of pulmonary opacities, pleural effusions, hilar prominence, and enlarged cardiac silhouette. To establish a standard of reference (SOR), two thoracic radiologists assessed all CXR for these abnormalities. Four other radiologists (test radiologists), unaware of SOR and DL findings, independently assessed the presence of radiographic abnormalities. A total 724 radiographs were assessed for detection of findings. A subset of 150 radiographs with follow up examinations was used to asses change over time. Data were analyzed with receiver operating characteristics analyses and post-hoc power analysis.
Results
About 42% (305/ 724) CXR had no findings according to SOR; single and multiple abnormalities were seen in 23% (168/724) and 35% (251/724) of CXR. There was no statistical difference between DL and SOR for all abnormalities (p = 0.2–0.8). The area under the curve (AUC) for DL and test radiologists ranged between 0.837–0.929 and 0.693–0.923, respectively. DL had lowest AUC (0.758) for assessing changes in pulmonary opacities over follow up CXR. Presence of chest wall implanted devices negatively affected the accuracy of DL algorithm for evaluation of pulmonary and hilar abnormalities.
Conclusions
DL algorithm can aid in interpretation of CXR findings and their stability over follow up CXR. However, in its present version, it is unlikely to replace radiologists due to its limited specificity for categorizing specific findings.

Authors

Ramandeep Singh 1, Mannudeep K. Kalra 1, Chayanin Nitiwarangkul 1,2, John A. Patti 1, Fatemeh Homayounieh 1, Atul Padole 1, Pooja Rao 3, Preetham Putha 3, Victorine V. Muse 1, Amita Sharma 1, Subba R. Digumarthy 11

Citation

1. Department of Radiology 2. Massachusetts General Hospital 3. Boston 4. Massachusetts 5. United States of America 6. Harvard Medical School 7. Boston 8. Massachusetts 9. United States of America 10. Division of Diagnostic Radiology 11. Department of Diagnostic and Therapeutic Radiology 12. Faculty of Medicine 13. Ramathibodi Hospital 14. Mahidol University 15. Bangkok 16. Thailand 17. Qure.ai 18. 101 Raheja Titanium 19. Goregaon East 20. Mumbai 21. India

Share this publication