Detecting “missed” breast cancers: A comparison


View content online at: http://www.appliedradiology.com/Issues/2002/10/Supplements/Detecting-“missed”-breast-cancers---A-comparison.aspx

Abstract:  Dr. Shile is Medical Director and Dr. Guingrich is a Breast Imager at the Susan G. Komen Breast Center, Peoria, IL. Breast cancer is among the most common forms of cancer in American women. Second only to skin cancer, it accounts for one of every three cancer diagnoses. Accordin

Loading...

Dr. Shile is Medical Director and Dr. Guingrich is a Breast Imager at the Susan G. Komen Breast Center, Peoria, IL.

Breast cancer is among the most common forms of cancer in American women. Second only to skin cancer, it accounts for one of every three cancer diagnoses. According to data from the American Cancer Society, the year 2001 witnessed approximately 192,200 new cases of invasive breast cancer, nearly 47,100 additional cases of in situ breast cancer, and some 40,200 breast cancer deaths. 1 Only lung cancer accounts for more cancer deaths in women.

Early detection is critical to reducing breast cancer mortality and improving patient outcomes. Screening mammography is the best method available for detecting breast cancer early, when treatment is most effective. With a reported sensitivity of 70% to 90%, 1-9 mammography is highly effective. It is, nonetheless, imperfect. Intensive research has focused on ways to improve detection of subtle mammographic abnormalities that may signal the earliest stages of tumor development, but may be overlooked during visual examination.

Evaluation of mammograms by more than one radiologist--so-called double reading--has been shown to improve sensitivity substantially, by as much as 15%, 10-20 but at considerable time and expense. Computer-aided detection (CAD) offers an alternative method of double reading that may be both useful 21-23 and efficient 22,24 in reducing the rate of false-negative mammographic interpretations.

Computer-aided detection systems use mathematical algorithms to analyze mammograms for possible abnormalities, including masses, microcalcifications, distortions in breast architecture, and asymmetric densities. After an initial examination of the mammographic images, the radiologist reviews potentially suspicious areas highlighted by the CAD software and determines whether they warrant further work-up. In this way, the CAD system functions as an automated second reader that confirms the presence of suspicious areas of breast tissue or identifies possible abnormalities that might otherwise be missed.

Although commercially available CAD systems operate on similar principles, there are differences in their design and performance. How these differences affect their relative performance in the detection of biopsy-proven breast cancer has been compared for two CAD systems in a study that used the mammograms that led to the diagnosis of each case. 25 The study reported here compared the same two CAD systems, evaluating their ability to detect breast cancers that had been missed during conventional mammographic interpretation.

Methods and materials

Consecutive cases of breast cancer were identified from five private-practice mammography facilities in central Illinois between April 1998 and June 2000. Patient volume at the mammography centers covered a wide range, from low to high volume. All facilities and radiologists were certified under the Mammography Quality Standards Act (MQSA).

Clinical records from each case were examined to determine whether the patients had had a negative screening mammogram within 2 years prior to the diagnosis of cancer. A total of 93 such cases were identified for which a prior mammogram was available.

Two MQSA-certified radiologists at the Susan G. Komen Breast Center, Peoria, IL, examined the 93 prior mammograms to identify findings that warranted any type of follow-up, including comparison with prior films, acquisition of additional mammographic views, and biopsy. The radiologists, who worked independently of one another, were not informed about the specific clinical history associated with the prior mammograms, but were aware that, in each case, cancer had eventually been detected.

The mammographic films were also processed by two CAD systems: Second Look ® , version 4.0 (CADx Medical Systems, Quebec, Canada) and ImageChecker ® , version 3.1 (R2 Technology, Los Altos, CA). At the time of the study, version 4.0 of Second Look had just been approved by the Food and Drug Administration (FDA) for sale in the United States. Version 3.1 of ImageChecker had not yet been approved by the FDA, but was the latest version of the software in use in Canada.

One mammographer later reviewed the exams, along with imaging and clinical information from the time of cancer diagnosis, in order to identify whether there was evidence of breast cancer on prior mammography. The CAD output was also examined to identify cases in which CAD had accurately highlighted the site of subsequent cancer. The CAD output and the radiologists' interpretations of prior exams were correlated. These correlations were used to assess CAD-identified imaging features that would lead to further patient work-up.

Results

There was retrospective evidence of cancer in 56% (52 of 93) of the prior mammograms. Table 1 outlines the characteristics of cancers that could be detected on retrospective review (missed cancers) and those that could not (true negatives).

Cases were grouped according to the number of radiologists who indicated a need for further work-up after reviewing each mammogram. The degree of agreement between the two radiologists was considered an indication of how a single radiologist might respond to the identification by CAD of a suspicious mammographic finding.

When abnormalities were identified by both radiologists, the likelihood was considered to be high that identification of these same findings by CAD would prompt a single radiologist to recall the patient for additional follow-up.

Findings identified by only one radiologist signaled a medium-to-low likelihood that a similar finding on CAD would prompt recall of the patient for further work-up. Findings identified by neither radiologist were considered unlikely to prompt patient recall, regardless of the CAD results.

Overall, Second Look (SL) correctly identified 60% (31 of 52) of the missed cancers, and ImageChecker (IC) identified 52% (27 of 52) (Table 2). Of the high-likelihood findings, SL identified 12 of 12 calcifications, 4 of 11 masses, and 1 of 1 distortion. By comparison, IC identified 9 of 12 calcifications, 8 of 11 masses, and 1 of 1 distortion. Of the medium-to-low likelihood findings, SL identified 4 of 4 calcifications, 4 of 8 masses, and 1 of 2 distortions, whereas IC identified 1 of 4 calcifications, 4 of 8 masses, and 0 of 2 distortions.

Discussion

Particularly in the earliest stages of breast cancer, abnormalities on screening mammography can be very subtle. Mammographic interpretation by more than one radiologist has been shown to improve sensitivity, but at a substantially increased cost and investment of radiologist time. 10-20

Computer-aided detection systems represent an alternative approach to improving the diagnostic accuracy and consistency of mammographic interpretation. In essence, the CAD system offers an automated "second opinion" that the radiologist can consider when reviewing the mammogram and arriving at a diagnosis.

Recent research suggests that CAD systems offer information that can influence the interpretation of mammograms in clinically important ways. A study by Freer and Ulissey 21 compared the interpretation of nearly 13,000 screening mammograms by radiologists alone and with the assistance of CAD. They found that nearly 20% more cancers were detected with the assistance of CAD and that a larger percentage were early-stage malignancies (78%, as compared with 73% of those identified by radiologists alone). 21

A study by Burhenne et al 22 examined whether the use of CAD could reduce the false-negative rate in screening mammography. Investigators from 13 mammographic facilities identified a total of 427 patients whose screening mammograms had led to the detection of biopsy-proven cancer, and who had a prior mammogram on file. Blinded retrospective review showed that mammographic interpretation by the original radiologists had a false-negative rate of 21%, and that CAD prompting had the potential to reduce that rate by 77%. 22

In a related study, Birdwell et al 23 found that of the 115 cancers missed on screening mammography, 30% were calcifications, of which 49% were clustered or pleomorphic. The remaining 70% were mass lesions, of which 40% were spiculated or irregular. The CAD software marked 86% of the missed calcifications and 73% of the missed masses.

The study reported here compares the performance of two different CAD systems for the detection of breast cancers missed by conventional evaluation of mammographic films. Because commercially available CAD systems use proprietary mathematical algorithms to distinguish normal from abnormal breast tissue, it is possible they differ in their abilities to detect breast cancer.

The study's findings confirm previous evidence that CAD markedly improves the detection of suspicious breast lesions that warrant recall of patients for further work-up. Computer-aided detection was particularly effective in highlighting calcifications, which are often the earliest signs of breast cancer. The study's findings also suggest, however, that there may be clinically meaningful differences in the accuracy with which CAD systems detect both calcifications and masses.

Overall, the performance of Second Look was superior to that of ImageChecker. Second Look correctly identified 60% of the missed cancers, and ImageChecker identified 52%. Second Look was also better at detecting calcifications in "high-likelihood" cases, where such information would be most likely to prompt further work-up of patients. ImageChecker detected more masses in this high-likelihood group, and the two systems detected the same number of distortions in breast architecture. In the medium-to-low likelihood cases, Second Look identified more distortions and calcifications than ImageChecker. The two CAD systems were equally effective in highlighting masses.

In the case of high-likelihood lesions, CAD prompting could be expected to result in the recall of all patients for further work-up. Only a portion of patients with medium-to-low likelihood findings would be recalled, however. Thus, of the 52 "missed" cancers, prompting by Second Look would have contributed to the recall of 33% (17 of 52) to 50% (26 of 52) of patients for further work-up. By comparison, ImageChecker would have contributed to the recall of between 35% (18 of 52) to 44% (23 of 52) of patients.

Conclusion

Computer-aided detection systems improve the accuracy and consistency of mammographic interpretation. In a side-by-side comparison of two such systems, Second Look's overall performance was better than that of ImageChecker. Additional research could further improve the mathematical algorithms that CAD systems use to distinguish normal from abnormal breast tissue.