{"id":3016,"date":"2023-08-31T18:52:43","date_gmt":"2023-08-31T18:52:43","guid":{"rendered":"https:\/\/cm.vastapps.dev\/tcia-collection\/her2-tumor-rois\/"},"modified":"2023-09-13T11:59:30","modified_gmt":"2023-09-13T11:59:30","slug":"her2-tumor-rois","status":"publish","type":"tcia_collection","link":"https:\/\/cm.vastapps.dev\/tcia-collection\/her2-tumor-rois\/","title":{"rendered":"HER2-TUMOR-ROIS"},"featured_media":7884,"template":"","citation-tax":[],"cancer_types":["HER2+ Breast Cancer"],"citations":[2946,2947,2925],"collection_doi":"10.7937\/E65C-AM96","collection_download_info":"Click the Versions tab for more info about data releases.","collection_downloads":[3148,3149],"full_export":"<h1 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-Summary\">Summary<\/h1><span style=\"color: rgb(33,37,41);\"><span class=\"confluence-embedded-file-wrapper image-right-wrapper confluence-embedded-manual-size\"><img class=\"confluence-embedded-image image-right\" draggable=\"false\" height=\"250\" src=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/embedded-page\/Public\/HER2%20and%20trastuzumab%20treatment%20response%20H&amp;E%20slides%20with%20tumor%20ROI%20annotations%20(HER2%20tumor%20ROIs)\/her2-roi.png?api=v2\"><\/span>The current standard of care for many patients with HER2-positive breast cancer is neoadjuvant chemotherapy in combination with anti-HER2 agents, based on HER2 amplification as detected by in situ hybridization (ISH) or protein immunohistochemistry (IHC). However, hematoxylin &amp; eosin (H&amp;E) tumor stains are more commonly available, and accurate prediction of HER2 status and anti-HER2 treatment response from H&amp;E would reduce costs and increase the speed of treatment selection. Computational algorithms for H&amp;E have been effective in predicting a variety of cancer features and clinical outcomes, including moderate success in predicting HER2 status. We trained a CNN classifier on 188 H&amp;E whole slide images (WSIs) manually annotated for tumor regions of interest (ROIs) by our pathology team. Our classifier achieved an area under the curve (AUC) of 0.90 in cross-validation of slide-level HER2 status and 0.81 on an independent TCGA test set. Moreover, we trained our classifier on pre-treatment samples from 187 HER2+ patients that subsequently received trastuzumab therapy. Our classifier achieved an AUC of 0.80 in a five-fold cross validation. Our work provides an H&amp;E-based algorithm that can predict HER2 status and trastuzumab response in breast cancer at an accuracy that may benefit clinical evaluations. Here, we are providing the datasets used in the study to facilitate development of other HER2+ diagnosis and trastuzumab response applications.<\/span><h3 style=\"text-align: left;\" id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-Dataannotation\"><strong>Data annotation<\/strong><\/h3><p style=\"text-align: left;\">Annotation of digital slides was performed, circling areas of invasive carcinoma (Region of Interests, ROIs). <span style=\"color: rgb(32,33,36);\">The manual annotation of ROIs significantly enhances the prediction accuracy and reduces the need for extensively large datasets.\u00a0 <\/span>Regions of necrosis, in situ carcinoma or benign stroma and epithelium were excluded. The images were annotated with ROIs associated to HER2+\/- tumor area (TA) by a senior breast pathologist.\u00a0 The annotations were marked tumor boundaries and annotated by Aperio ImageScope software. The annotations were exported from the Aperio software in The Extensible Markup Language (XML) format, including X and Y coordinates corresponding to the annotated regions. We used these coordinates for each slide image to tile these regions separately from the rest of the image, labeled as HER2+ or HER2- class.\u00a0\u00a0<\/p><h3 style=\"text-align: left;\" id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-Descriptionofdatasets\"><strong>Description of data sets<\/strong><\/h3><p class=\"p1\"><strong>Yale HER2 cohort:<\/strong>\u00a0This dataset presents 192 cases of HER2 positive and negative invasive breast carcinomas H&amp;E slides from the Yale Pathology electronic database. All tissues and data were retrieved under permission from the Yale Human Investigation Committee protocol #9505008219 to DLR. HER2 positive cases defined as those with 3+ score by immunohistochemistry (IHC) or an equivocal (2+) IHC score with subsequent amplification by fluorescence in situ hybridization (FISH) as defined by American Society of Clinical Oncology\/College of American Pathologists (ASCO\/CAP) clinical practice guidelines. H&amp;E slides generated at Yale School of Medicine include 93 HER2+ and 99 HER2- slides. The slides were scanned at Yale Pathology Tissue Services and underwent a slide quality check before they went into the scanner. The tissue samples were scanned using Vectra Polaris by Perkin-Elmer scanner using bright field whole slides scanning at 20\u00d7 magnification at Brady Memorial Laboratory Rimm\u2019s lab.<\/p><p style=\"text-align: left;\"><strong>Yale trastuzumab response cohort:<\/strong><span> 85<\/span>\u00a0response cohort cases were identified also by retrospective search of the Yale Pathology electronic database. Cases included those patients with a pre-treatment breast core biopsy with HER2 positive invasive breast carcinoma who then received neoadjuvant targeted therapy with trastuzumab +\/- pertuzumab prior to definitive surgery. HER2 positivity was defined as previously described for the HER2 negative\/positive cohort. The response to targeted therapy was obtained from the pathology reports of the surgical resection specimens and dichotomized into responders or non-responders. Those with a complete pathologic response, defined as no residual invasive, lymphovascular invasion or metastatic carcinoma, were designated as responders (n=36). Cases with only residual in situ carcinoma were included in the responder category. Those cases with any amount of residual invasive carcinoma, lymphovascular invasion or metastatic carcinoma were categorized as non-responders (n=49).<\/p><p class=\"p1\"><strong>TCGA HER2 cohort:<\/strong>\u00a0A total of 668 TCGA-BRCA HER2+\/- samples with available HER2 status were downloaded from the GDC portal (see &quot;<strong>Additional Resources<\/strong>&quot; below). Slides were visually inspected by our pathology team to exclude low quality samples with tissue folding or those that appeared to be from frozen tissue. A total of 182 samples (90 HER2- and 92 HER2+) were retained for use as independent test set.\u00a0 Information about which specific samples were retained can be found the\u00a0<a href=\"https:\/\/faspex.cancerimagingarchive.net\/aspera\/faspex\/external_deliveries\/278?passcode=4ee5d71f5adb4f116b72e3cab18abc6c4a037e5b\" class=\"external-link\" rel=\"nofollow\"><span class=\"s1\">TCGA_BRCA_Filtered<\/span><\/a>\u00a0folder of the dataset.<\/p><p class=\"p1\"><br\/><\/p><div class=\"tab-style-builtin\"><div class=\"localtabs-macro\"><div class=\"aui-tabs horizontal-tabs\" role=\"application\" data-aui-responsive=\"true\"><ul class=\"tabs-menu\"><li class=\"menu-item bv-localtab  active-tab \"><a href=\"#119702524aceef3fce2344b60b43de4c69f8459de\"><strong>Data Access<\/strong><\/a> <\/li><li class=\"menu-item bv-localtab \"><a href=\"#11970252411ac6411da9b4f119125d0c7afaa3837\"><strong>Detailed Description<\/strong><\/a> <\/li><li class=\"menu-item bv-localtab \"><a href=\"#119702524e8f91b671f9e42808ba9bd0b7fd551e0\"><strong>Citations &amp; Data Usage Policy<\/strong><\/a> <\/li><li class=\"menu-item bv-localtab \"><a href=\"#119702524aedf980875a944bba64b7656cb5142c4\"><strong>Versions<\/strong><\/a> <\/li><\/ul><div class=\"tabs-pane  active-pane \" id=\"119702524aceef3fce2344b60b43de4c69f8459de\" active=\"true\" name=\"Data Access\" ><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-DataAccess\">Data Access<\/h3><div class=\"table-wrap\"><table class=\"wrapped relative-table confluenceTable\" style=\"width: 65.0234%;\"><colgroup><col style=\"width: 20.447%;\"\/><col style=\"width: 51.6284%;\"\/><col style=\"width: 27.8822%;\"\/><\/colgroup><tbody><tr><th class=\"confluenceTh\">Data Type<\/th><th class=\"confluenceTh\">Download all or Query\/Filter<\/th><th class=\"confluenceTh\">License<\/th><\/tr><tr><td class=\"confluenceTd\"><p><span style=\"color: rgb(33,37,41);\">Tissue Slide Images (SVS,40GB)<\/span><\/p><p><span style=\"color: rgb(33,37,41);\">ROI Annotations (XML)<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/faspex.cancerimagingarchive.net\/aspera\/faspex\/external_deliveries\/311?passcode=dabde05cb596923338b717013b335f5258a9adb3\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/pathdb.cancerimagingarchive.net\/imagesearch?f[0]=collection:her2_roi\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><\/p>\u00a0<p>(Download and apply the\u00a0<a href=\"https:\/\/www.ibm.com\/aspera\/connect\/\" class=\"external-link\" rel=\"nofollow\">IBM-Aspera-Connect plugin\u00a0<\/a>to your browser to retrieve this faspex package)\u00a0<\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\">Clinical data (XLSX)<\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/119702524\/Yale_trastuzumab_response_cohort_metadata_clean.xlsx?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><\/tbody><\/table><\/div><p>Click the Versions tab for more info about data releases.<\/p><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-AdditionalResources\">Additional Resources<\/h3><p style=\"text-align: left;\"><span style=\"color: rgb(0,0,0);\">The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.<\/span><\/p><ul style=\"text-align: left;\"><li class=\"auto-cursor-target\"><a href=\"https:\/\/portal.gdc.cancer.gov\/legacy-archive\/search\/f?filters=%7B%22op%22:%22and%22,%22content%22:%5B%7B%22op%22:%22in%22,%22content%22:%7B%22field%22:%22files.data_format%22,%22value%22:%5B%22SVS%22%5D%7D%7D,%7B%22op%22:%22in%22,%22content%22:%7B%22field%22:%22cases.project.program.name%22,%22value%22:%5B%22TCGA%22%5D%7D%7D,%7B%22op%22:%22in%22,%22content%22:%7B%22field%22:%22cases.project.project_id%22,%22value%22:%5B%22TCGA-BRCA%22%5D%7D%7D%5D%7D\" class=\"external-link\" rel=\"nofollow\">Genomic Data Commons Legacy Archive<\/a><span>\u00a0<\/span>(Tissue Slide Images)<ul style=\"text-align: left;\"><li><strong>TCGA HER2 cohort:<\/strong><span>\u00a0<\/span>A total of 668 TCGA-BRCA HER2+\/- samples with available HER2 status were downloaded from the GDC portal. Slides were visually inspected by our pathology team to exclude low quality samples with tissue folding or those that appeared to be from frozen tissue. A total of 182 samples (90 HER2- and 92 HER2+) were retained for use as independent test set. Information about which specific samples were retained from the GDC can be found the <a style=\"text-decoration: underline;\" href=\"https:\/\/faspex.cancerimagingarchive.net\/aspera\/faspex\/external_deliveries\/278?passcode=4ee5d71f5adb4f116b72e3cab18abc6c4a037e5b\" class=\"external-link\" rel=\"nofollow\">TCGA_BRCA_Filtered<\/a> folder of the dataset. To download the slides, GDC recommends the following: \u201cThe TCGA case (patient) barcode can be extracted from the first 12 characters of the file names in the manifest (i.e. TCGA-BH-A0EE, TCGA-D8-A27W etc.) which can be used to match the data in the clinical files to the slide images.\u201d Contact <a href=\"https:\/\/gdc.cancer.gov\/support\" class=\"external-link\" rel=\"nofollow\">https:\/\/gdc.cancer.gov\/support<\/a> with any questions about downloading the corresponding slides.<\/li><\/ul><\/li><\/ul><\/div><div class=\"tabs-pane \" id=\"11970252411ac6411da9b4f119125d0c7afaa3837\" name=\"Detailed Description\" ><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-DetailedDescription\">Detailed Description<\/h3><div class=\"table-wrap\"><table class=\"wrapped confluenceTable\"><colgroup><col\/><col\/><\/colgroup><tbody><tr><th class=\"confluenceTh\"><p>Image Statistics<\/p><\/th><th class=\"confluenceTh\"><br\/><\/th><\/tr><tr><td class=\"confluenceTd\"><p>Modalities<\/p><\/td><td class=\"confluenceTd\"><p>Pathology<\/p><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Number of Patients<\/p><\/td><td class=\"confluenceTd\"><p>273<\/p><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Number of Images<\/p><\/td><td class=\"confluenceTd\"><p>273<\/p><\/td><\/tr><tr><td class=\"confluenceTd\">Images Size (GB)<\/td><td class=\"confluenceTd\">40<\/td><\/tr><\/tbody><\/table><\/div><p style=\"text-align: left;\"><br\/><\/p><p style=\"text-align: left;\"><br\/><\/p><\/div><div class=\"tabs-pane \" id=\"119702524e8f91b671f9e42808ba9bd0b7fd551e0\" name=\"Citations BITVOODOO_ANDamp; Data Usage Policy\" ><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-Citations&amp;DataUsagePolicy\">Citations &amp; Data Usage Policy<\/h3><p><span>\n<p>\nUsers must abide by the <a href=\"https:\/\/wiki.cancerimagingarchive.net\/x\/c4hF\" class=\"external-link\" rel=\"nofollow\">TCIA Data Usage Policy and Restrictions<\/a>. Attribution should include references to the following citations:\n<\/p><\/span><\/p><div class=\"confluence-information-macro confluence-information-macro-information\"><p class=\"title\">Data Citation<\/p><span class=\"aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon\"><\/span><div class=\"confluence-information-macro-body\"><p><span style=\"color: rgb(52,73,94);\">Farahmand, Saman, Fernandez, Aileen I, Ahmed, Fahad Shabbir, Rimm, David L., Chuang, Jeffrey H., Reisenbichler, Emily, &amp; Zarringhalam, Kourosh. (2022).<span>\u00a0<\/span><\/span><em>HER2 and trastuzumab treatment response H&amp;E slides with tumor ROI annotations<\/em><span style=\"color: rgb(52,73,94);\"><span>\u00a0<\/span>(Version 3) [Data set]. The Cancer Imaging Archive. <a href=\"https:\/\/doi.org\/10.7937\/E65C-AM96\" class=\"external-link\" rel=\"nofollow\">https:\/\/doi.org\/10.7937\/E65C-AM96<\/a><\/span><\/p><\/div><\/div><div class=\"confluence-information-macro confluence-information-macro-information\"><p class=\"title\">Publication Citation<\/p><span class=\"aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon\"><\/span><div class=\"confluence-information-macro-body\"><p><span style=\"color: rgb(32,33,36);\">Farahmand, S., Fernandez, A.I., Ahmed, F.S. et al. Deep learning trained on hematoxylin and eosin tumor region of Interest predicts HER2 status and trastuzumab treatment response in HER2+ breast cancer. Mod Pathol (2021).<span>\u00a0<\/span><\/span><a rel=\"nofollow\" style=\"text-decoration: none;text-align: left;\" href=\"https:\/\/doi.org\/10.1038\/s41379-021-00911-w\" class=\"external-link\">https:\/\/doi.org\/10.1038\/s41379-021-00911-w<\/a><\/p><\/div><\/div><div class=\"confluence-information-macro confluence-information-macro-information\"><p class=\"title\">TCIA Citation<\/p><span class=\"aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon\"><\/span><div class=\"confluence-information-macro-body\"><p>Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., &amp; Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045\u20131057). Springer Science and Business Media LLC. <a href=\"https:\/\/doi.org\/10.1007\/s10278-013-9622-7\" class=\"external-link\" rel=\"nofollow\">https:\/\/doi.org\/10.1007\/s10278-013-9622-7<\/a><\/p><\/div><\/div><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-OtherPublicationsUsingThisData\">Other Publications Using This Data<\/h3><p><span>TCIA maintains\u00a0<\/span><a href=\"https:\/\/www.cancerimagingarchive.net\/publications\/\" class=\"external-link\" rel=\"nofollow\">a list of publications<\/a><span> which leverage TCIA data. <\/span> If you have a manuscript you'd like to add please<a href=\"http:\/\/www.cancerimagingarchive.net\/support\/\" class=\"external-link\" rel=\"nofollow\"> contact the TCIA Helpdesk<\/a>.<\/p><\/div><div class=\"tabs-pane \" id=\"119702524aedf980875a944bba64b7656cb5142c4\" name=\"Versions\" ><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-Version3(Current):Updated2022\/08\/01\">Version 3 (Current): Updated 2022\/08\/01<\/h3><div class=\"table-wrap\"><table class=\"wrapped confluenceTable\"><colgroup><col\/><col\/><col\/><\/colgroup><tbody><tr><th class=\"confluenceTh\">Data Type<\/th><th class=\"confluenceTh\">Download all or Query\/Filter<\/th><th class=\"confluenceTh\">License<\/th><\/tr><tr><td class=\"confluenceTd\"><p><span style=\"color: rgb(33,37,41);\">Tissue Slide Images (SVS,40GB)<\/span><\/p><p><span style=\"color: rgb(33,37,41);\">ROI Annotations (XML)<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/faspex.cancerimagingarchive.net\/aspera\/faspex\/external_deliveries\/311?passcode=dabde05cb596923338b717013b335f5258a9adb3\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/pathdb.cancerimagingarchive.net\/imagesearch?f[0]=collection:her2_roi\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><br\/><\/p><p>(Download and apply the\u00a0<a href=\"https:\/\/www.ibm.com\/aspera\/connect\/\" class=\"external-link\" rel=\"nofollow\">IBM-Aspera-Connect plugin\u00a0<\/a>to your browser to retrieve this faspex package)\u00a0<\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\">Clinical data (XLSX)<\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/119702524\/Yale_trastuzumab_response_cohort_metadata_clean.xlsx?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><\/tbody><\/table><\/div><p>changes made:<\/p><p>macros removed from all SVS images. older versions of the data no longer available for download.<\/p><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-Version2:updated2022\/06\/28\">Version 2: updated 2022\/06\/28<\/h3><div class=\"table-wrap\"><table class=\"wrapped confluenceTable\"><colgroup><col\/><col\/><col\/><\/colgroup><tbody><tr><th class=\"confluenceTh\">Data Type<\/th><th class=\"confluenceTh\">Download all or Query\/Filter<\/th><th class=\"confluenceTh\">License<\/th><\/tr><tr><td class=\"confluenceTd\"><p><span style=\"color: rgb(33,37,41);\">Tissue Slide Images (SVS,40GB)<\/span><\/p><p><span style=\"color: rgb(33,37,41);\">ROI Annotations (XML)<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\" \/><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\">Clinical data (XLSX)<\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/119702524\/Yale_trastuzumab_response_cohort_metadata_clean.xlsx?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><\/tbody><\/table><\/div><p>changes made:\u00a0<\/p><p class=\"p1\">5 files were removed (&quot;TCGA-A2-AOEQ-01Z-00-DX1&quot;, &quot;TCGA-AO-AOJG-01Z-00-DX1&quot;, &quot;TCGA-B6-A019-01Z-00-DX1\u201d, &quot;TCGA-EW-A1OZ-01Z-00-DX1.svs&quot; &amp; TCGA_BRCA_Filtered\/meta.txt), 2 files were cleaned up and replaced (gdc_manifest.2022-0420.txt &amp; HER2_TCGA.csv) and 1 file was added (case&amp;annotation_counts_clean.xlsx). The 'HER2 Yale cohort' paragraph was updated to correct patient counts (99 negative &amp; 93 positive for a total of 192 cases). The 'TCGA HER2 cohort' paragraph was updated to correct patient counts (92 positive &amp; 90 negative for a total of 182 cases). The metadata CSV file was replaced with an updated XLSX file to remove duplicate patient rows.<\/p><h3 id=\"HER2andtrastuzumabtreatmentresponseH&amp;EslideswithtumorROIannotations(HER2tumorROIs)-Version1:updated2022\/03\/25\">Version 1: updated 2022\/03\/25<\/h3><div class=\"table-wrap\"><table class=\"wrapped confluenceTable\"><colgroup><col\/><col\/><col\/><\/colgroup><tbody><tr><th class=\"confluenceTh\">Data Type<\/th><th class=\"confluenceTh\">Download all or Query\/Filter<\/th><th class=\"confluenceTh\">License<\/th><\/tr><tr><td class=\"confluenceTd\"><p><span style=\"color: rgb(33,37,41);\">Tissue Slide Images (SVS,40GB)<\/span><\/p><p><span style=\"color: rgb(33,37,41);\">ROI Annotations (XML)<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\" \/><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\">Clinical data (CSV)<\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/119702524\/HER2_ROI.csv?version=1&amp;modificationDate=1648579391951&amp;api=v2\" data-linked-resource-id=\"119702916\" data-linked-resource-version=\"1\" data-linked-resource-type=\"attachment\" data-linked-resource-default-alias=\"HER2_ROI.csv\" data-linked-resource-content-type=\"text\/csv\" data-linked-resource-container-id=\"119702524\" data-linked-resource-container-version=\"27\"><span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\"><img class=\"confluence-embedded-image confluence-thumbnail\" draggable=\"false\" height=\"30\" src=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/embedded-page\/Public\/HER2%20and%20trastuzumab%20treatment%20response%20H&amp;E%20slides%20with%20tumor%20ROI%20annotations%20(HER2%20tumor%20ROIs)\/tcia_wiki_download_button.png?api=v2\"><\/span><\/a><\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><\/tbody><\/table><\/div><\/div><\/div><\/div><\/div><p><br\/><\/p><p><br\/><\/p>","versions":false,"additional_resources":"The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.\n<ul><li><a href=\"https:\/\/portal.gdc.cancer.gov\/legacy-archive\/search\/f?filters=%7B%22op%22:%22and%22,%22content%22:%5B%7B%22op%22:%22in%22,%22content%22:%7B%22field%22:%22files.data_format%22,%22value%22:%5B%22SVS%22%5D%7D%7D,%7B%22op%22:%22in%22,%22content%22:%7B%22field%22:%22cases.project.program.name%22,%22value%22:%5B%22TCGA%22%5D%7D%7D,%7B%22op%22:%22in%22,%22content%22:%7B%22field%22:%22cases.project.project_id%22,%22value%22:%5B%22TCGA-BRCA%22%5D%7D%7D%5D%7D\">Genomic Data Commons Legacy Archive<\/a>\u00a0(Tissue Slide Images)<ul><li><strong>TCGA HER2 cohort:<\/strong>\u00a0A total of 668 TCGA-BRCA HER2+\/- samples with available HER2 status were downloaded from the GDC portal. Slides were visually inspected by our pathology team to exclude low quality samples with tissue folding or those that appeared to be from frozen tissue. A total of 182 samples (90 HER2- and 92 HER2+) were retained for use as independent test set. Information about which specific samples were retained from the GDC can be found the <a href=\"https:\/\/faspex.cancerimagingarchive.net\/aspera\/faspex\/external_deliveries\/278?passcode=4ee5d71f5adb4f116b72e3cab18abc6c4a037e5b\">TCGA_BRCA_Filtered<\/a> folder of the dataset. To download the slides, GDC recommends the following: \u201cThe TCGA case (patient) barcode can be extracted from the first 12 characters of the file names in the manifest (i.e. TCGA-BH-A0EE, TCGA-D8-A27W etc.) which can be used to match the data in the clinical files to the slide images.\u201d Contact <a href=\"https:\/\/gdc.cancer.gov\/support\">https:\/\/gdc.cancer.gov\/support<\/a> with any questions about downloading the corresponding slides.<\/li><\/ul><\/li><\/ul>","cancer_locations":["Breast"],"collection_page_accessibility":"Public","publications_related":"","version_change_log":"","version_change_log_archived":"","analysis_results":"","collection_status":"Complete","publications_using":"TCIA maintains\u00a0<a href=\"https:\/\/www.cancerimagingarchive.net\/publications\/\">a list of publications<\/a> which leverage TCIA data.  If you have a manuscript you'd like to add please<a href=\"http:\/\/www.cancerimagingarchive.net\/support\/\"> contact the TCIA Helpdesk<\/a>.","species":["Human"],"collection_title":"HER2 and trastuzumab treatment response H&E slides with tumor ROI annotations","detailed_description":"<br\/>\n<br\/>","related_analysis_results":false,"subjects":"273","collection_short_title":"HER2 tumor ROIs","data_types":["Pathology"],"date_updated":"2023-09-13","collection_browse_title":"","supporting_data":["Image Analyses"],"collection_featured_image":{"ID":"7884","post_author":"6","post_date":"2023-09-13 03:46:57","post_date_gmt":"2023-09-13 03:46:57","post_content":"","post_title":"her2-roi","post_excerpt":"","post_status":"inherit","comment_status":"open","ping_status":"closed","post_password":"","post_name":"her2-roi","to_ping":"","pinged":"","post_modified":"2023-09-13 11:59:30","post_modified_gmt":"2023-09-13 11:59:30","post_content_filtered":"","post_parent":"3016","guid":"https:\/\/cm.vastapps.dev\/wp-content\/uploads\/her2-roi.png","menu_order":"0","post_type":"attachment","post_mime_type":"image\/png","comment_count":"0","pod_item_id":"7884"},"collection_summary":"The current standard of care for many patients with HER2-positive breast cancer is neoadjuvant chemotherapy in combination with anti-HER2 agents, based on HER2 amplification as detected by in situ hybridization (ISH) or protein immunohistochemistry (IHC). However, hematoxylin &amp; eosin (H&amp;E) tumor stains are more commonly available, and accurate prediction of HER2 status and anti-HER2 treatment response from H&amp;E would reduce costs and increase the speed of treatment selection. Computational algorithms for H&amp;E have been effective in predicting a variety of cancer features and clinical outcomes, including moderate success in predicting HER2 status. We trained a CNN classifier on 188 H&amp;E whole slide images (WSIs) manually annotated for tumor regions of interest (ROIs) by our pathology team. Our classifier achieved an area under the curve (AUC) of 0.90 in cross-validation of slide-level HER2 status and 0.81 on an independent TCGA test set. Moreover, we trained our classifier on pre-treatment samples from 187 HER2+ patients that subsequently received trastuzumab therapy. Our classifier achieved an AUC of 0.80 in a five-fold cross validation. Our work provides an H&amp;E-based algorithm that can predict HER2 status and trastuzumab response in breast cancer at an accuracy that may benefit clinical evaluations. Here, we are providing the datasets used in the study to facilitate development of other HER2+ diagnosis and trastuzumab response applications.\n<h3><strong>Data annotation<\/strong><\/h3>\nAnnotation of digital slides was performed, circling areas of invasive carcinoma (Region of Interests, ROIs). The manual annotation of ROIs significantly enhances the prediction accuracy and reduces the need for extensively large datasets.\u00a0 Regions of necrosis, in situ carcinoma or benign stroma and epithelium were excluded. The images were annotated with ROIs associated to HER2+\/- tumor area (TA) by a senior breast pathologist.\u00a0 The annotations were marked tumor boundaries and annotated by Aperio ImageScope software. The annotations were exported from the Aperio software in The Extensible Markup Language (XML) format, including X and Y coordinates corresponding to the annotated regions. We used these coordinates for each slide image to tile these regions separately from the rest of the image, labeled as HER2+ or HER2- class.\u00a0\u00a0\n<h3><strong>Description of data sets<\/strong><\/h3>\n<strong>Yale HER2 cohort:<\/strong>\u00a0This dataset presents 192 cases of HER2 positive and negative invasive breast carcinomas H&amp;E slides from the Yale Pathology electronic database. All tissues and data were retrieved under permission from the Yale Human Investigation Committee protocol #9505008219 to DLR. HER2 positive cases defined as those with 3+ score by immunohistochemistry (IHC) or an equivocal (2+) IHC score with subsequent amplification by fluorescence in situ hybridization (FISH) as defined by American Society of Clinical Oncology\/College of American Pathologists (ASCO\/CAP) clinical practice guidelines. H&amp;E slides generated at Yale School of Medicine include 93 HER2+ and 99 HER2- slides. The slides were scanned at Yale Pathology Tissue Services and underwent a slide quality check before they went into the scanner. The tissue samples were scanned using Vectra Polaris by Perkin-Elmer scanner using bright field whole slides scanning at 20\u00d7 magnification at Brady Memorial Laboratory Rimm\u2019s lab.\n<strong>Yale trastuzumab response cohort:<\/strong> 85\u00a0response cohort cases were identified also by retrospective search of the Yale Pathology electronic database. Cases included those patients with a pre-treatment breast core biopsy with HER2 positive invasive breast carcinoma who then received neoadjuvant targeted therapy with trastuzumab +\/- pertuzumab prior to definitive surgery. HER2 positivity was defined as previously described for the HER2 negative\/positive cohort. The response to targeted therapy was obtained from the pathology reports of the surgical resection specimens and dichotomized into responders or non-responders. Those with a complete pathologic response, defined as no residual invasive, lymphovascular invasion or metastatic carcinoma, were designated as responders (n=36). Cases with only residual in situ carcinoma were included in the responder category. Those cases with any amount of residual invasive carcinoma, lymphovascular invasion or metastatic carcinoma were categorized as non-responders (n=49).\n<strong>TCGA HER2 cohort:<\/strong>\u00a0A total of 668 TCGA-BRCA HER2+\/- samples with available HER2 status were downloaded from the GDC portal (see \"<strong>Additional Resources<\/strong>\" below). Slides were visually inspected by our pathology team to exclude low quality samples with tissue folding or those that appeared to be from frozen tissue. A total of 182 samples (90 HER2- and 92 HER2+) were retained for use as independent test set.\u00a0 Information about which specific samples were retained can be found the\u00a0<a href=\"https:\/\/faspex.cancerimagingarchive.net\/aspera\/faspex\/external_deliveries\/278?passcode=4ee5d71f5adb4f116b72e3cab18abc6c4a037e5b\">TCGA_BRCA_Filtered<\/a>\u00a0folder of the dataset.\n<br\/>","collection_acknowledgements":"","collection_funding":"","hide_from_browse_table":[],"_links":{"self":[{"href":"https:\/\/cm.vastapps.dev\/api\/v1\/collections\/3016"}],"collection":[{"href":"https:\/\/cm.vastapps.dev\/api\/v1\/collections"}],"about":[{"href":"https:\/\/cm.vastapps.dev\/api\/wp\/v2\/types\/tcia_collection"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cm.vastapps.dev\/api\/wp\/v2\/media\/7884"}],"wp:attachment":[{"href":"https:\/\/cm.vastapps.dev\/api\/wp\/v2\/media?parent=3016"}],"wp:term":[{"taxonomy":"tcia_citation_tax","embeddable":true,"href":"https:\/\/cm.vastapps.dev\/api\/v1\/citation-tax?post=3016"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}