Skip to main content

HUNGARIAN-COLORECTAL-SCREENING

Hungarian-Colorectal-Screening | Digital pathological slides from Hungarian colorectal cancer screening

DOI: 10.7937/TCIA.9CJF-0127 | Page Accessibility: Public | Collection

Collection Snapshot
Size Status Updated
784GB Public, 2023/09/13

Acknowledgements

The research was financed by the Thematic Excellence Programme (Tématerületi Kiválósági Program, 2020-4.1.1.-TKP2020) of the Ministry for Innovation and Technology in Hungary, within the framework of the DigitalBiomarker thematic programme of the Semmelweis University. This work was supported by the National Research, Development and Innovation Office of Hungary grants OTKA 128881 and K128780, the National Quantum Technologies Program and the Hungarian Artificial Intelligence National Laboratory.

Data Access

Title Data Type Format Access Points Studies Series Images License
Tissue Slide Images Pathology MRXS 200 CC BY 4.0
Clinical data CSV CC BY 4.0

Detailed Description

Note about the data: 

From the article, these data include hematoxylin- and eosin- (H&E) stained whole slide imaging (WSI), with resolution 0.1213 μm/pixel, as acquired by 3DHistech Pannoramic 1000 Digital Slide Scanner. 

ICD10 explanations
The supporting csv metadata contains ICD10 codes for each slide. Below are some helpful links about this standard and the differences you might see depending on if you use the international ICD10 codes (which are coded with 4 characters), the Hungarian ICD10 codes (which are coded with 5 characters), or the Institute of 2.Pathology and 1.Pathology at the Semmelweis University which use an extended version of ICD10 and has 6 characters. 
Introduction to ICD10:
https://www.cdc.gov/nchs/data/dvs/icd10fct.pdf
https://datadictionary.nhs.uk/supporting_information/international_classification_of_diseases__icd_.html
https://icd.who.int/browse10/Content/statichtml/ICD10Volume2_en_2019.pdf
Code structure, restricted characters, and code length (note that the Hungarian variant does not use * and – ):
https://datadictionary.nhs.uk/data_elements/icd-10_code.html
https://www.cdc.gov/nchs/icd/icd10cm_pcs_background.htm
Further notes:
Some ICD10 codes in the list are shorter than 6 characters. These are older samples from when the institute used only 5 characters.

Citations & Data Usage Policy

Data Citation

Pataki, B. A., Olar, A., Ribli, D., Pesti, A., Kontsek, E., Gyongyosi, B., Bilecz, A., Kovács, T., Kovács, K. A., Zsofia, Kiss, A., Szócska, M., Pollner, P., & Csabai, I. (2021). Digital pathological slides from Hungarian (Europe) colorectal cancer screening (Version 2) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.9CJF-0127

Publication Citation

Pataki, B. Á., Olar, A., Ribli, D., Pesti, A., Kontsek, E., Gyöngyösi, B., Bilecz, Á., Kovács, T., Kovács, K. A., Kramer, Z., Kiss, A., Szócska, M., Pollner, P., & Csabai, I. (2022). HunCRC: annotated pathological slides to enhance deep learning applications in colorectal cancer screening. In Scientific Data (Vol. 9, Issue 1). https://doi.org/10.1038/s41597-022-01450-y

TCIA Citation

Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7

Other Publications Using This Data

TCIA maintains a list of publications which leverage TCIA data. If you have a manuscript you’d like to add please contact the TCIA Helpdesk.