Skip to main content

CDD-CESM | Categorized Digital Database for Low energy and Subtracted Contrast Enhanced Spectral Mammography images

DOI: 10.7937/29kw-ae92 | Page Accessibility: Public | Collection

Collection Snapshot
Location Species Subjects Data Types Cancer Types Supporting Data Status Updated
Breast Human 326 MG Breast Cancer Clinical, Image Analyses Public, Complete 2023/09/13

Summary

Deep learning (DL) has a promising potential in reducing the workload of radiologists and helping them provide a more accurate diagnosis. However, fully annotated and large-sized datasets are required. This dataset is a collection of 2,006 high-resolution Contrast-enhanced spectral mammography (CESM) images with annotations and medical reports. 

Acquisition protocol: 

CESM is done using the standard digital mammography equipment, with additional software that performs dual-energy image acquisition. Two minutes after intravenously injecting the patient with non-ionic low-osmolar iodinated contrast material (dose: 1.5 mL/kg), craniocaudal (CC) and mediolateral oblique (MLO) views are obtained. Each view comprises two exposures, one with low energy (peak kilo-voltage values ranging from 26 to 31kVp) and one with high energy (45 to 49 kVp). Low and high-energy images are then recombined and subtracted through appropriate image processing to suppress the background breast parenchyma. A complete examination is carried out in about 5-6 minutes.

Image preprocessing:

The images were converted from DICOM to JPEG using RadiAnt with best 100% image quality (lossless).  They have an average of 2355 x 1315 pixels.

Supporting data:

Full medical reports are also provided for each case (DOCX) along with manual segmentation annotation for the abnormal findings in each image (CSV file).  Each image with its corresponding manual annotation (breast composition, mass shape, mass margin, mass density, architectural distortion, asymmetries, calcification type, calcification distribution, mass enhancement pattern, non-mass enhancement pattern, non-mass enhancement distribution, and overall BIRADS assessment) is compiled into 1 Excel file. 

https://www.robots.ox.ac.uk/~vgg/software/via/via.html was used for the segmentation annotation.  It can be used to show the annotations on the images by clicking on Annotation--> import annotations (from csv), and then proceeding to upload any image to view the annotations drawn over it.  Moreover, a helper repository is created to help with pre-processing, model training, model evaluation, and segmentation annotation loading: https://github.com/omar-mohamed/CDD-CESM-Dataset

Regarding the tabs on the annotations Excel file, these are commonly used radiological descriptors as defined by the American College of Radiology 2013 lexicon.

Acknowledgements

We would like to acknowledge the individuals and institutions that have provided data for this collection:

  • National Cancer Institute, Cairo University, Cairo, Egypt : Special thanks to Dr. Rana Khaled, M.Sc, Prof. Maha Helal, MD, Prof. Omnia Mokhtar, MD and Dr. Hebatalla El Kassas, MD from the Department of Radiology.
  • Faculty of Computers and Artificial Intelligence, Cairo University, Cairo, Egypt – Special thanks to Omar Alfarghaly, Prof. Abeer Elkorany, and Prof. Aly Fahmy from the Department of Computer Science.

Data Access

Click the Versions tab for more info about data releases.

Title Data Type Format Access Points Studies Series Images License
Low Energy Images and Subtracted Images JPG and ZIP CC BY 4.0
Clinical data DOCX and ZIP CC BY 4.0
Radiology hand drawn segmentations v2 CC BY 4.0
Radiology manual annotations CC BY 4.0

Additional Resources for this Dataset

The following external resources have been made available by the data submitters.  These are not hosted or supported by TCIA, but may be useful to researchers utilizing this collection.

Please contact [email protected]  with any questions regarding usage.

Citations & Data Usage Policy

Data Citation

Khaled R., Helal M., Alfarghaly O., Mokhtar O., Elkorany A., El Kassas H., Fahmy A. Categorized Digital Database for Low energy and Subtracted Contrast Enhanced Spectral Mammography images [Dataset]. (2021) The Cancer Imaging Archive. DOI:  10.7937/29kw-ae92 

Publication Citation

Khaled, R., Helal, M., Alfarghaly, O., Mokhtar, O., Elkorany, A., El Kassas, H., & Fahmy, A. Categorized contrast enhanced mammography dataset for diagnostic and artificial intelligence research. (2022) Scientific Data, Volume 9, Issue 1. DOI: 10.1038/s41597-022-01238-0

TCIA Citation

Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7

Other Publications Using This Data

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you’d like to add please contact the TCIA Helpdesk.