{"id":5526,"date":"2023-09-04T03:01:28","date_gmt":"2023-09-04T03:01:28","guid":{"rendered":"https:\/\/cm.vastapps.dev\/tcia-collection\/pseudo-phi-dicom-data\/"},"modified":"2023-09-13T11:55:51","modified_gmt":"2023-09-13T11:55:51","slug":"pseudo-phi-dicom-data","status":"publish","type":"tcia_collection","link":"https:\/\/cm.vastapps.dev\/tcia-collection\/pseudo-phi-dicom-data\/","title":{"rendered":"PSEUDO-PHI-DICOM-DATA"},"featured_media":0,"template":"","citation-tax":[],"cancer_types":["Various"],"citations":[4329,4330,2925],"collection_doi":"10.7937\/s17z-r072","collection_download_info":"Click the Versions tab for more info about data releases.\nPlease contact <a href=\"mailto:help@cancerimagingarchive.net\">help@cancerimagingarchive.net<\/a>\u00a0 with any questions regarding usage.","collection_downloads":[4880,4881,4882,4883],"full_export":"<h1 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-Summary\">Summary<\/h1>Open access or shared research data must comply with (HIPAA) patient privacy regulations. These regulations require the de-identification of datasets before they can be placed in the public domain.\u00a0 The process of image de-identification is time consuming, requires significant human resources, and is prone to human error. \u00a0Automated image de-identification algorithms have been developed but the research community requires some method of evaluation before such tools can be widely accepted.\u00a0 This evaluation requires a robust dataset that can be used as part of an evaluation process for de-identification algorithms. \u00a0<\/p><p>We developed a DICOM dataset that can be used to evaluate the performance of de-identification algorithms.\u00a0DICOM image information objects were selected from datasets published in TCIA.\u00a0 Synthetic Protected Health Information (PHI) was generated and inserted into selected DICOM data elements to mimic typical clinical imaging exams.\u00a0 The evaluation dataset was de-identified by a TCIA curation team using standard TCIA tools and procedures. We are publishing the evaluation dataset (containing synthetic PHI) and de-identified evaluation dataset (result of TCIA curation) in advance of a potential competition, sponsored by the National Cancer Institute (NCI), for de-identification algorithm evaluation, and de-identification of medical image datasets. The evaluation dataset published here is a subset of a larger evaluation dataset that was created under contract for the National Cancer Institute. This subset is being published to allow researchers to test their de-identification algorithms and promote standardized procedures for validating automated de-identification.<h3 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-Acknowledgements\"><span>Acknowledgements<\/span><\/h3><p>We would like to acknowledge the National Cancer Institute for funding and actively participating in the project that generated the evaluation datasets being published here and the TCIA curation team, led by Ms. Geri Blake, who curated this data.\u00a0 Original data came from multiple institutions and multiple TCIA image collections.<\/p><div class=\"tab-style-builtin\"><div class=\"localtabs-macro\"><div class=\"aui-tabs horizontal-tabs\" role=\"application\" data-aui-responsive=\"true\"><ul class=\"tabs-menu\"><li class=\"menu-item bv-localtab  active-tab \"><a href=\"#80969777d92ba6242b3e43148b7d46f9a84e8628\"><strong>Data Access<\/strong><\/a> <\/li><li class=\"menu-item bv-localtab \"><a href=\"#80969777dc3b7facffbc49ed877284fe9b33ef2e\"><strong>Detailed Description<\/strong><\/a> <\/li><li class=\"menu-item bv-localtab \"><a href=\"#80969777afad29920e1c4c5cbe1d676290052820\"><strong>Citations &amp; Data Usage Policy<\/strong><\/a> <\/li><li class=\"menu-item bv-localtab \"><a href=\"#8096977715ad9b8139f049779cc36917a41a4198\"><strong>Versions<\/strong><\/a> <\/li><\/ul><div class=\"tabs-pane  active-pane \" id=\"80969777d92ba6242b3e43148b7d46f9a84e8628\" active=\"true\" name=\"Data Access\" ><h3 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-DataAccess\">Data Access<\/h3><div class=\"table-wrap\"><table class=\"wrapped relative-table confluenceTable\" style=\"width: 58.0963%;\"><colgroup><col style=\"width: 24.1214%;\"\/><col style=\"width: 38.5622%;\"\/><col style=\"width: 37.2829%;\"\/><\/colgroup><tbody><tr><th class=\"confluenceTh\">Data Type<\/th><th class=\"confluenceTh\">Download all or Query\/Filter<\/th><th class=\"confluenceTh\">License<\/th><\/tr><tr><td class=\"confluenceTd\"><p>Images,\u00a0 (DICOM, 609 MB)<\/p><p>Evaluation dataset<\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-Phi-DICOM%20Evaluation%20dataset%20April%207%202021.tcia?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/nbia.cancerimagingarchive.net\/nbia-search\/?MinNumberOfStudiesCriteria=1&amp;PatientCriteria=6670427471,9189822998,9894340694,8989193730,8155012288,571403367,292821506,339833062,3642991663,6774825273,8732322741,7255997752,7361647728,6451050561,292821506,8834647487,6774825273,6451050561,8548156246,4025360156,6614238035,9894340694,6415974217,3209648408,9894340694,8189244869\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><br\/><\/p><p><span style=\"color: rgb(33,37,41);text-decoration: none;\">(Download requires\u00a0<\/span><span style=\"color: rgb(33,37,41);text-decoration: none;\">the<span>\u00a0<\/span><\/span><a style=\"text-decoration: none;text-align: left;\" href=\"https:\/\/wiki.cancerimagingarchive.net\/display\/NBIA\/Downloading+TCIA+Images\" rel=\"nofollow\">NBIA Data Retriever<\/a>)<\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Images,\u00a0 (DICOM, 606 MB)<\/p><p>De-identified Evaluation dataset<\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM%20De-id%20Evaluation%20dataset%20April%207%202021.tcia?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/nbia.cancerimagingarchive.net\/nbia-search\/?MinNumberOfStudiesCriteria=1&amp;PatientCriteria=Pseudo-PHI-005,Pseudo-PHI-015,Pseudo-PHI-019,Pseudo-PHI-001,Pseudo-PHI-010,Pseudo-PHI-014,Pseudo-PHI-018,Pseudo-PHI-002,Pseudo-PHI-013,Pseudo-PHI-012,Pseudo-PHI-020,Pseudo-PHI-011,Pseudo-PHI-006,Pseudo-PHI-011,Pseudo-PHI-016,Pseudo-PHI-008,Pseudo-PHI-017,Pseudo-PHI-007,Pseudo-PHI-021,Pseudo-PHI-001,Pseudo-PHI-003,Pseudo-PHI-009,Pseudo-PHI-008,Pseudo-PHI-021,Pseudo-PHI-021,Pseudo-PHI-004\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><br\/><\/p><p><span style=\"color: rgb(33,37,41);text-decoration: none;\">(Download requires\u00a0<\/span><span style=\"color: rgb(33,37,41);text-decoration: none;\">the<span>\u00a0<\/span><\/span><a style=\"text-decoration: none;text-align: left;\" href=\"https:\/\/wiki.cancerimagingarchive.net\/display\/NBIA\/Downloading+TCIA+Images\" rel=\"nofollow\">NBIA Data Retriever<\/a>)<\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Patient Mapping (csv)<\/p><p><span style=\"color: rgb(33,33,33);\">Evaluation\/De-identified<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM-Dataset%20patid_crosswalk.csv?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>UID Mapping (csv)<\/p><p><span style=\"color: rgb(33,33,33);\">Evaluation\/De-identified<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM-Dataset%20uid_crosswalk.csv?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p>\n<a href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\" class=\"external-link\" rel=\"nofollow\">CC BY 4.0<\/a><\/p><\/div><\/td><\/tr><\/tbody><\/table><\/div><p>Click the Versions tab for more info about data releases.<\/p><p><span style=\"color: rgb(23,43,77);\">Please contact <a rel=\"nofollow\" class=\"external-link\" href=\"mailto:help@cancerimagingarchive.net\">help@cancerimagingarchive.net<\/a>\u00a0 with any questions regarding usage.<\/span><\/p><h3 style=\"text-align: left;\" id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-AdditionalResourcesforthisDataset\">Additional Resources for this Dataset<\/h3><p style=\"text-align: left;\"><span style=\"color: rgb(0,0,0);\">The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.<\/span><\/p><ul style=\"text-align: left;\"><li class=\"auto-cursor-target\"><a href=\"https:\/\/portal.imaging.datacommons.cancer.gov\/explore\/filters\/?collection_id=pseudo_phi_dicom_data\" class=\"external-link\" rel=\"nofollow\">Imaging Data Commons (IDC)<\/a><span>\u00a0<\/span>(Imaging Data)<\/li><\/ul><\/div><div class=\"tabs-pane \" id=\"80969777dc3b7facffbc49ed877284fe9b33ef2e\" name=\"Detailed Description\" ><h3 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-DetailedDescription\">Detailed Description<\/h3><div class=\"table-wrap\"><table class=\"wrapped confluenceTable\"><colgroup><col\/><col\/><\/colgroup><tbody><tr><th class=\"confluenceTh\"><p>Image Statistics<\/p><\/th><th class=\"confluenceTh\"><br\/><\/th><\/tr><tr><td class=\"confluenceTd\"><p>Modalities<\/p><\/td><td class=\"confluenceTd\"><p>CR, CT, DX, MG, MR, PT<\/p><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Number of Patients<\/p><\/td><td class=\"confluenceTd\"><p>42<\/p><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Number of Studies<\/p><\/td><td class=\"confluenceTd\"><p>44<\/p><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Number of Series<\/p><\/td><td class=\"confluenceTd\"><p>52<\/p><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Number of Images<\/p><\/td><td class=\"confluenceTd\"><p>3386<\/p><\/td><\/tr><tr><td colspan=\"1\" class=\"confluenceTd\">Images Size (GB)<\/td><td colspan=\"1\" class=\"confluenceTd\">1.2<\/td><\/tr><\/tbody><\/table><\/div><p class=\"auto-cursor-target\">There are 21 patients, 22 studies, 26 series but the patient ids, study instance uids, and series instance uids are different between the 2 datasets thus resulting in a double count.<\/p><\/div><div class=\"tabs-pane \" id=\"80969777afad29920e1c4c5cbe1d676290052820\" name=\"Citations BITVOODOO_ANDamp; Data Usage Policy\" ><h3 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-Citations&amp;DataUsagePolicy\">Citations &amp; Data Usage Policy<\/h3><p><span>\n<p>\nUsers must abide by the <a href=\"https:\/\/wiki.cancerimagingarchive.net\/x\/c4hF\" class=\"external-link\" rel=\"nofollow\">TCIA Data Usage Policy and Restrictions<\/a>. Attribution should include references to the following citations:\n<\/p><\/span><\/p><div class=\"confluence-information-macro confluence-information-macro-information\"><p class=\"title\">Data Citation<\/p><span class=\"aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon\"><\/span><div class=\"confluence-information-macro-body\"><p>Rutherford, M., Mun, S.K., Levine, B., Bennett, W.C., Smith, K., Farmer, P., Jarosz, J., Wagner, U., Farahani, K., Prior, F. (2021). <span style=\"color: rgb(102,102,102);\">A DICOM dataset for evaluation of medical image de-identification (Pseudo-PHI-DICOM-Data) [Data set]<\/span><strong>.\u00a0<\/strong><span style=\"color: rgb(0,0,0);\">The Cancer Imaging Archive. DOI:\u00a0<a href=\"https:\/\/doi.org\/10.7937\/s17z-r072\" class=\"external-link\" rel=\"nofollow\"><span class=\"nolink\">https:\/\/doi.org\/10.7937\/s17z-r072<\/span><\/a><\/span><\/p><\/div><\/div><div class=\"confluence-information-macro confluence-information-macro-information\"><p class=\"title\">Publication Citation<\/p><span class=\"aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon\"><\/span><div class=\"confluence-information-macro-body\"><p>Rutherford, M., Mun, S.K., Levine, B., Bennett, W.C., Smith, K., Farmer, P., Jarosz, J., Wagner, U., Freyman, J., Blake, G., Tarbox, L., Farahani, K., Prior, F. (2021). A DICOM dataset for evaluation of medical image de-identification,\u00a0Nature Scientific Data. DOI: <a href=\"https:\/\/doi.org\/10.1038\/s41597-021-00967-y\" class=\"external-link\" rel=\"nofollow\">10.1038\/s41597-021-00967-y<\/a>.\u00a0<\/p><\/div><\/div><div class=\"confluence-information-macro confluence-information-macro-information\"><p class=\"title\">TCIA Citation<\/p><span class=\"aui-icon aui-icon-small aui-iconfont-info confluence-information-macro-icon\"><\/span><div class=\"confluence-information-macro-body\"><p>Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F.\u00a0The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository, Journal of Digital Imaging, Volume 26, Number 6, December, 2013, pp 1045-1057. DOI: <a href=\"https:\/\/doi.org\/10.1007\/s10278-013-9622-7\" class=\"external-link\" rel=\"nofollow\">10.1007\/s10278-013-9622-7<\/a><\/p><\/div><\/div><h3 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-OtherPublicationsUsingThisData\">Other Publications Using This Data<\/h3><p><span>TCIA maintains\u00a0<\/span><a href=\"https:\/\/www.cancerimagingarchive.net\/publications\/\" class=\"external-link\" rel=\"nofollow\">a list of publications<\/a><span> which leverage TCIA data. <\/span> If you have a manuscript you'd like to add please<a href=\"http:\/\/www.cancerimagingarchive.net\/support\/\" class=\"external-link\" rel=\"nofollow\"> contact the TCIA Helpdesk<\/a>.<\/p><\/div><div class=\"tabs-pane \" id=\"8096977715ad9b8139f049779cc36917a41a4198\" name=\"Versions\" ><h3 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-Version2(Current):Updated2021\/04\/07\">Version 2 (Current): Updated 2021\/04\/07<\/h3><div class=\"table-wrap\"><table class=\"wrapped confluenceTable\"><colgroup><col\/><col\/><\/colgroup><tbody><tr><th class=\"confluenceTh\"><span>Data Type<\/span><\/th><th class=\"confluenceTh\"><span>Download all or Query\/Filter<\/span><\/th><\/tr><tr><td class=\"confluenceTd\"><p>Images,\u00a0 (DICOM, 609 MB)<\/p><p>Evaluation dataset<\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-Phi-DICOM%20Evaluation%20dataset%20April%207%202021.tcia?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/nbia.cancerimagingarchive.net\/nbia-search\/?MinNumberOfStudiesCriteria=1&amp;PatientCriteria=6670427471,9189822998,9894340694,8989193730,8155012288,571403367,292821506,339833062,3642991663,6774825273,8732322741,7255997752,7361647728,6451050561,292821506,8834647487,6774825273,6451050561,8548156246,4025360156,6614238035,9894340694,6415974217,3209648408,9894340694,8189244869\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><br\/><\/p><p><span style=\"color: rgb(33,37,41);text-decoration: none;\">(Download requires\u00a0<\/span><span style=\"color: rgb(33,37,41);text-decoration: none;\">the<span>\u00a0<\/span><\/span><a rel=\"nofollow\" href=\"https:\/\/wiki.cancerimagingarchive.net\/display\/NBIA\/Downloading+TCIA+Images\" style=\"text-decoration: none;text-align: left;\">NBIA Data Retriever<\/a>)<\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Images,\u00a0 (DICOM, 606 MB)<\/p><p>De-identified Evaluation dataset<\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM%20De-id%20Evaluation%20dataset%20April%207%202021.tcia?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/nbia.cancerimagingarchive.net\/nbia-search\/?MinNumberOfStudiesCriteria=1&amp;PatientCriteria=Pseudo-PHI-005,Pseudo-PHI-015,Pseudo-PHI-019,Pseudo-PHI-001,Pseudo-PHI-010,Pseudo-PHI-014,Pseudo-PHI-018,Pseudo-PHI-002,Pseudo-PHI-013,Pseudo-PHI-012,Pseudo-PHI-020,Pseudo-PHI-011,Pseudo-PHI-006,Pseudo-PHI-011,Pseudo-PHI-016,Pseudo-PHI-008,Pseudo-PHI-017,Pseudo-PHI-007,Pseudo-PHI-021,Pseudo-PHI-001,Pseudo-PHI-003,Pseudo-PHI-009,Pseudo-PHI-008,Pseudo-PHI-021,Pseudo-PHI-021,Pseudo-PHI-004\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><br\/><\/p><p><span style=\"color: rgb(33,37,41);text-decoration: none;\">(Download requires\u00a0<\/span><span style=\"color: rgb(33,37,41);text-decoration: none;\">the<span>\u00a0<\/span><\/span><a href=\"https:\/\/wiki.cancerimagingarchive.net\/display\/NBIA\/Downloading+TCIA+Images\" rel=\"nofollow\" style=\"text-decoration: none;text-align: left;\">NBIA Data Retriever<\/a>)<\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Patient Mapping (csv)<\/p><p><span style=\"color: rgb(33,33,33);\">Evaluation\/De-identified<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM-Dataset%20patid_crosswalk.csv?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>UID Mapping (csv)<\/p><p><span style=\"color: rgb(33,33,33);\">Evaluation\/De-identified<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM-Dataset%20uid_crosswalk.csv?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><\/tr><\/tbody><\/table><\/div><p>Note: Removed head imaging from 8 series.<\/p><h3 id=\"ADICOMdatasetforevaluationofmedicalimagedeidentification(PseudoPHIDICOMData)-Version1:Updated2021\/01\/31\">Version 1: Updated 2021\/01\/31<\/h3><div class=\"table-wrap\"><table class=\"wrapped fixed-table confluenceTable\"><colgroup><col style=\"width: 210.0px;\"\/><col style=\"width: 663.0px;\"\/><\/colgroup><tbody><tr><th class=\"confluenceTh\"><span>Data Type<\/span><\/th><th class=\"confluenceTh\"><span>Download all or Query\/Filter<\/span><\/th><\/tr><tr><td class=\"confluenceTd\"><p>Images,\u00a0 (DICOM, 653 MB)<\/p><p>Evaluation dataset<\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/nbia.cancerimagingarchive.net\/nbia-search\/?MinNumberOfStudiesCriteria=1&amp;PatientCriteria=6670427471,9189822998,9894340694,8989193730,8155012288,571403367,292821506,339833062,3642991663,6774825273,8732322741,7255997752,7361647728,6451050561,292821506,8834647487,6774825273,6451050561,8548156246,4025360156,6614238035,9894340694,6415974217,3209648408,9894340694,8189244869\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><br\/><\/p><p><span style=\"color: rgb(33,37,41);text-decoration: none;\">(Download requires\u00a0<\/span><span style=\"color: rgb(33,37,41);text-decoration: none;\">the<span>\u00a0<\/span><\/span><a rel=\"nofollow\" style=\"text-decoration: none;text-align: left;\" href=\"https:\/\/wiki.cancerimagingarchive.net\/display\/NBIA\/Downloading+TCIA+Images\">NBIA Data Retriever<\/a>)<\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Images,\u00a0 (DICOM, 648 MB)<\/p><p>De-identified Evaluation dataset<\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/nbia.cancerimagingarchive.net\/nbia-search\/?MinNumberOfStudiesCriteria=1&amp;PatientCriteria=Pseudo-PHI-005,Pseudo-PHI-015,Pseudo-PHI-019,Pseudo-PHI-001,Pseudo-PHI-010,Pseudo-PHI-014,Pseudo-PHI-018,Pseudo-PHI-002,Pseudo-PHI-013,Pseudo-PHI-012,Pseudo-PHI-020,Pseudo-PHI-011,Pseudo-PHI-006,Pseudo-PHI-011,Pseudo-PHI-016,Pseudo-PHI-008,Pseudo-PHI-017,Pseudo-PHI-007,Pseudo-PHI-021,Pseudo-PHI-001,Pseudo-PHI-003,Pseudo-PHI-009,Pseudo-PHI-008,Pseudo-PHI-021,Pseudo-PHI-021,Pseudo-PHI-004\" class=\"external-link\" rel=\"nofollow\"><button class=\"tcia-btn tcia-search-color\"><i class=\"fa fa-search\" \/> Search<\/button><\/a>\u00a0\n<\/span><br\/><\/p><p><span style=\"color: rgb(33,37,41);text-decoration: none;\">(Download requires\u00a0<\/span><span style=\"color: rgb(33,37,41);text-decoration: none;\">the<span>\u00a0<\/span><\/span><a href=\"https:\/\/wiki.cancerimagingarchive.net\/display\/NBIA\/Downloading+TCIA+Images\" style=\"text-decoration: none;text-align: left;\" rel=\"nofollow\">NBIA Data Retriever<\/a>)<\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>Patient Mapping (csv)<\/p><p><span style=\"color: rgb(33,33,33);\">Evaluation\/De-identified<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM-Dataset%20patid_crosswalk.csv?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><\/tr><tr><td class=\"confluenceTd\"><p>UID Mapping (csv)<\/p><p><span style=\"color: rgb(33,33,33);\">Evaluation\/De-identified<\/span><\/p><\/td><td class=\"confluenceTd\"><div class=\"content-wrapper\"><p><br\/>\n\n<span class=\"confluence-embedded-file-wrapper confluence-embedded-manual-size\">\n   <a href=\"https:\/\/wiki.cancerimagingarchive.net\/download\/attachments\/80969777\/Pseudo-PHI-DICOM-Dataset%20uid_crosswalk.csv?api=v2\" rel=\"nofollow\"><button class=\"tcia-btn tcia-download-color\"><i class=\"fa fa-cloud-download\" \/> Download<\/button><\/a>\u00a0\n<\/span><\/p><\/div><\/td><\/tr><\/tbody><\/table><\/div><\/div><\/div><\/div><\/div><p><br\/><\/p><p><br\/><\/p>","versions":false,"additional_resources":"The NCI Cancer Research Data Commons (CRDC) provides access to additional data and a cloud-based data science infrastructure that connects data sets with analytics tools to allow users to share, integrate, analyze, and visualize cancer research data.\n<ul><li><a href=\"https:\/\/portal.imaging.datacommons.cancer.gov\/explore\/filters\/?collection_id=pseudo_phi_dicom_data\">Imaging Data Commons (IDC)<\/a>\u00a0(Imaging Data)<\/li><\/ul>","cancer_locations":["Various"],"collection_page_accessibility":"Public","publications_related":"","version_change_log":"","version_change_log_archived":"","analysis_results":"","collection_status":"Complete","publications_using":"TCIA maintains\u00a0<a href=\"https:\/\/www.cancerimagingarchive.net\/publications\/\">a list of publications<\/a> which leverage TCIA data.  If you have a manuscript you'd like to add please<a href=\"http:\/\/www.cancerimagingarchive.net\/support\/\"> contact the TCIA Helpdesk<\/a>.","species":["Human"],"collection_title":"A DICOM dataset for evaluation of medical image de-identification","detailed_description":"There are 21 patients, 22 studies, 26 series but the patient ids, study instance uids, and series instance uids are different between the 2 datasets thus resulting in a double count.","related_analysis_results":false,"subjects":"21","collection_short_title":"Pseudo-PHI-DICOM-Data","data_types":["CR","CT","DX","MG","MR","PT"],"date_updated":"2023-09-13","collection_browse_title":"","supporting_data":false,"collection_featured_image":false,"collection_summary":"Open access or shared research data must comply with (HIPAA) patient privacy regulations. These regulations require the de-identification of datasets before they can be placed in the public domain.\u00a0 The process of image de-identification is time consuming, requires significant human resources, and is prone to human error. \u00a0Automated image de-identification algorithms have been developed but the research community requires some method of evaluation before such tools can be widely accepted.\u00a0 This evaluation requires a robust dataset that can be used as part of an evaluation process for de-identification algorithms. \u00a0<p>We developed a DICOM dataset that can be used to evaluate the performance of de-identification algorithms.\u00a0DICOM image information objects were selected from datasets published in TCIA.\u00a0 Synthetic Protected Health Information (PHI) was generated and inserted into selected DICOM data elements to mimic typical clinical imaging exams.\u00a0 The evaluation dataset was de-identified by a TCIA curation team using standard TCIA tools and procedures. We are publishing the evaluation dataset (containing synthetic PHI) and de-identified evaluation dataset (result of TCIA curation) in advance of a potential competition, sponsored by the National Cancer Institute (NCI), for de-identification algorithm evaluation, and de-identification of medical image datasets. The evaluation dataset published here is a subset of a larger evaluation dataset that was created under contract for the National Cancer Institute. This subset is being published to allow researchers to test their de-identification algorithms and promote standardized procedures for validating automated de-identification.<\/p>","collection_acknowledgements":"We would like to acknowledge the National Cancer Institute for funding and actively participating in the project that generated the evaluation datasets being published here and the TCIA curation team, led by Ms. Geri Blake, who curated this data.\u00a0 Original data came from multiple institutions and multiple TCIA image collections.","collection_funding":"","hide_from_browse_table":[],"_links":{"self":[{"href":"https:\/\/cm.vastapps.dev\/api\/v1\/collections\/5526"}],"collection":[{"href":"https:\/\/cm.vastapps.dev\/api\/v1\/collections"}],"about":[{"href":"https:\/\/cm.vastapps.dev\/api\/wp\/v2\/types\/tcia_collection"}],"wp:attachment":[{"href":"https:\/\/cm.vastapps.dev\/api\/wp\/v2\/media?parent=5526"}],"wp:term":[{"taxonomy":"tcia_citation_tax","embeddable":true,"href":"https:\/\/cm.vastapps.dev\/api\/v1\/citation-tax?post=5526"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}