Skip to content

Mammography AI in 147 Clinics Results in Increased Cancer Detection Rate

May 6, 2023
2 min

SBI-ACR BREAST IMAGING SYMPOSIUM 2023
Leeann Louis, PhD

Purpose

Though many studies have documented performance improvements with mammography AI, little data has been reported from large scale deployment. Such data would likely help radiologists better understand whether AI will generalize to their clinical practice. The purpose of this study was to evaluate performance changes of radiologists in a large-scale deployment of categorical AI used in screening mammography.

Materials and Methods

An FDA-cleared categorical AI product was deployed at 147 clinical sites across the USA. The AI provides lesion localization and categorical levels of suspicion for cancer on digital breast tomosynthesis screening mammograms. Data was collected from sites with three months of use of the AI, and on screening mammograms that have at least 30 days of follow-up (n=200,342). Data included the AI’s suspicion categories, radiologists’ BIRADS assessments, and available biopsy outcomes. (The suspicion categories are Minimal, Low, Intermediate and High, corresponding to ~25%, ~50%, ~20% and ~5% of a screening population.) For comparison, mammograms from the 14-month period prior to AI deployment were also retrospectively evaluated by AI. Performance metrics of cancer detection rate (CDR), abnormal interpretation or recall rate (RR) and positive predictive value (PPV1) from 185 radiologists were compared before and after deployment. Given the AI deployment was recent, there was limited follow-up time for cancer diagnosis so,for consistency, only cancers identified within 30 days of screening were counted. Significance was calculated using a Chi-squared test.

Results

Radiologists’ CDR increased by 33% after deployment, from 1.5 to 2.0 per 1000 mammograms (p< 0.05).Given the short 30 days follow-up period, the low CDRs were expected. The increase was primarily driven by a higher CDR in the High suspicion category going from 22.9 to 28.9 per 1000 mammograms. There was also a slight increase in RR of 11.8% to 12.5% (p< 0.001) with a slight increase in PPV1 (1.3% before vs. 1.6% after deployment).

Conclusion

These initial results show that radiologists are detecting more cancers soon after the deployment of an AI tool that provides suspicion categories as well as lesion localization. An increase in recall rate was expected as radiologists become accustomed to the product, similar to what has been observed when other new technologies such as DBT have been adopted. Our results provide early encouraging evidence that radiologists in clinical practice will benefit at scale, though further monitoring is ongoing and will be reported.

Clinical Relevance

AI for mammography using categorical suspicion levels and lesion localization can help radiologists detect more cancers.

Figure
Radiologist performance measures by AI categories in 147 clinical sites. Black lines indicate the 95% confidence intervals. CDR overall increases post deployment, primarily due to an increase in CDR for the High suspicion exams.

Disclaimers: DeepHealth Prostate is manufactured as Quantib Prostate by Quantib BV for DeepHealth Inc., DeepHealth Lung is manufactured as Veye Lung Nodules by Aidence BV for DeepHealth Inc., DeepHeath Lung Tracker is manufactured as Veye Clinic by Aidence BV for DeepHealth Inc. and DeepHealth Brain is manufactured as Quantib ND by Quantib BV for DeepHealth Inc. Not all products and functionalities are commercially available in all countries. For clearance and commercial availability in your geography of functionalities listed and compatibility with other systems, please contact us