The TCR cancer registry repository for annotating cancer data

Shin Bo Chen, Chien Yeh Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The huge amounts of biomedical data are stored in various formats and accessed through numerous interfaces. It is a crucial task for Data integration and exchange in cancer research. Data elements play an important role in data integration. The NCI supports a broad initiative to standardize the common data elements (CDEs) used in cancer research data capture and reporting. The Taiwan Cancer Registry (TCR), established in 1979, is organized and funded by the Health Department of the central government. The TCR's primary goal is to survey the incidence of cancer in Taiwan. The aim of the Taiwan Cancer Common Data Element Project (TCCDEP) is to facilitate convergence towards a common metadata standard in Taiwan cancer registry data. The project is implemented using a set of open source software and tools developed by the NCI, such as the caCORE SDK and caGrid. The experience of building, learning and using the open toolkit, Cancer Data Standards Repository (caDSR), developed by the National Cancer Institute's Center for Bioinformatics, NCICB, in the USA is reported. The caDSR is a metadata repository including CDEs used by NCI-sponsored organizations. The object of this work is to develop a database of metadata for medical data elements, referred to as the TCCDEP and to establish a common classification of data elements used in cancer registry. In this manuscript, we will be developing the common data elements using vocabulary standards, ontology and semantic modeling methodology. The CDEs included demographic data, clinical history, pathology data, and clinical outcome data including treatment, recurrence and vital status. These CDEs will be further enhanced to data sets across the participating cancer institutes to facilitate and supplement translational research. The Taiwan Cancer Registry (TCR) model and standard will be use as the basis for an electronic data standard repository to metadata or data descriptors. The TCCDEP developed 40 data elements to annotate the cancer registry data collected. In this project, we describe the process required to develop the model, the caDSR CDEs, and the results of the modeling effort. We address difficulties we encountered and modifications for solution. The caBIG (TM) grid project, gird model of Taiwan Cancer Registry (girdTCR), using the caCORE tools to define data elements for cancer registry has been shown to caBIG(TM)UML model project. Currently, the Taiwan cancer registry CDEs are released and available in CDE browser for reusing. Furthermore, we will extend our CDEs to daily clinical practice and trials, along with how the methods were used to fully implemented in hospitals and cancer research centers in Taiwan.

Original languageEnglish
Title of host publicationICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences
Pages297-300
Number of pages4
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event2011 2nd IEEE International Conference on Emergency Management and Management Sciences, ICEMMS 2011 - Beijing, China
Duration: Aug 8 2011Aug 10 2011

Conference

Conference2011 2nd IEEE International Conference on Emergency Management and Management Sciences, ICEMMS 2011
CountryChina
CityBeijing
Period8/8/118/10/11

Fingerprint

Cancer
Taiwan
Registry
Repository
Metadata
Data integration
Ontology
Pathology
Toolkit
Research center
Bioinformatics
Modeling
Health
Demographics
Data base
Modeling methodology
Grid
Central government
Open source software
Data exchange

Keywords

  • caBIG
  • caDSR
  • Cancer Registry
  • Common Data Elements

ASJC Scopus subject areas

  • Strategy and Management
  • Management Science and Operations Research

Cite this

Chen, S. B., & Hsu, C. Y. (2011). The TCR cancer registry repository for annotating cancer data. In ICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences (pp. 297-300). [6015680] https://doi.org/10.1109/ICEMMS.2011.6015680

The TCR cancer registry repository for annotating cancer data. / Chen, Shin Bo; Hsu, Chien Yeh.

ICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences. 2011. p. 297-300 6015680.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, SB & Hsu, CY 2011, The TCR cancer registry repository for annotating cancer data. in ICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences., 6015680, pp. 297-300, 2011 2nd IEEE International Conference on Emergency Management and Management Sciences, ICEMMS 2011, Beijing, China, 8/8/11. https://doi.org/10.1109/ICEMMS.2011.6015680
Chen SB, Hsu CY. The TCR cancer registry repository for annotating cancer data. In ICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences. 2011. p. 297-300. 6015680 https://doi.org/10.1109/ICEMMS.2011.6015680
Chen, Shin Bo ; Hsu, Chien Yeh. / The TCR cancer registry repository for annotating cancer data. ICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences. 2011. pp. 297-300
@inproceedings{56c744aa1dfa459d88c0fb52ec099f0d,
title = "The TCR cancer registry repository for annotating cancer data",
abstract = "The huge amounts of biomedical data are stored in various formats and accessed through numerous interfaces. It is a crucial task for Data integration and exchange in cancer research. Data elements play an important role in data integration. The NCI supports a broad initiative to standardize the common data elements (CDEs) used in cancer research data capture and reporting. The Taiwan Cancer Registry (TCR), established in 1979, is organized and funded by the Health Department of the central government. The TCR's primary goal is to survey the incidence of cancer in Taiwan. The aim of the Taiwan Cancer Common Data Element Project (TCCDEP) is to facilitate convergence towards a common metadata standard in Taiwan cancer registry data. The project is implemented using a set of open source software and tools developed by the NCI, such as the caCORE SDK and caGrid. The experience of building, learning and using the open toolkit, Cancer Data Standards Repository (caDSR), developed by the National Cancer Institute's Center for Bioinformatics, NCICB, in the USA is reported. The caDSR is a metadata repository including CDEs used by NCI-sponsored organizations. The object of this work is to develop a database of metadata for medical data elements, referred to as the TCCDEP and to establish a common classification of data elements used in cancer registry. In this manuscript, we will be developing the common data elements using vocabulary standards, ontology and semantic modeling methodology. The CDEs included demographic data, clinical history, pathology data, and clinical outcome data including treatment, recurrence and vital status. These CDEs will be further enhanced to data sets across the participating cancer institutes to facilitate and supplement translational research. The Taiwan Cancer Registry (TCR) model and standard will be use as the basis for an electronic data standard repository to metadata or data descriptors. The TCCDEP developed 40 data elements to annotate the cancer registry data collected. In this project, we describe the process required to develop the model, the caDSR CDEs, and the results of the modeling effort. We address difficulties we encountered and modifications for solution. The caBIG (TM) grid project, gird model of Taiwan Cancer Registry (girdTCR), using the caCORE tools to define data elements for cancer registry has been shown to caBIG(TM)UML model project. Currently, the Taiwan cancer registry CDEs are released and available in CDE browser for reusing. Furthermore, we will extend our CDEs to daily clinical practice and trials, along with how the methods were used to fully implemented in hospitals and cancer research centers in Taiwan.",
keywords = "caBIG, caDSR, Cancer Registry, Common Data Elements",
author = "Chen, {Shin Bo} and Hsu, {Chien Yeh}",
year = "2011",
doi = "10.1109/ICEMMS.2011.6015680",
language = "English",
isbn = "9781424496631",
pages = "297--300",
booktitle = "ICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences",

}

TY - GEN

T1 - The TCR cancer registry repository for annotating cancer data

AU - Chen, Shin Bo

AU - Hsu, Chien Yeh

PY - 2011

Y1 - 2011

N2 - The huge amounts of biomedical data are stored in various formats and accessed through numerous interfaces. It is a crucial task for Data integration and exchange in cancer research. Data elements play an important role in data integration. The NCI supports a broad initiative to standardize the common data elements (CDEs) used in cancer research data capture and reporting. The Taiwan Cancer Registry (TCR), established in 1979, is organized and funded by the Health Department of the central government. The TCR's primary goal is to survey the incidence of cancer in Taiwan. The aim of the Taiwan Cancer Common Data Element Project (TCCDEP) is to facilitate convergence towards a common metadata standard in Taiwan cancer registry data. The project is implemented using a set of open source software and tools developed by the NCI, such as the caCORE SDK and caGrid. The experience of building, learning and using the open toolkit, Cancer Data Standards Repository (caDSR), developed by the National Cancer Institute's Center for Bioinformatics, NCICB, in the USA is reported. The caDSR is a metadata repository including CDEs used by NCI-sponsored organizations. The object of this work is to develop a database of metadata for medical data elements, referred to as the TCCDEP and to establish a common classification of data elements used in cancer registry. In this manuscript, we will be developing the common data elements using vocabulary standards, ontology and semantic modeling methodology. The CDEs included demographic data, clinical history, pathology data, and clinical outcome data including treatment, recurrence and vital status. These CDEs will be further enhanced to data sets across the participating cancer institutes to facilitate and supplement translational research. The Taiwan Cancer Registry (TCR) model and standard will be use as the basis for an electronic data standard repository to metadata or data descriptors. The TCCDEP developed 40 data elements to annotate the cancer registry data collected. In this project, we describe the process required to develop the model, the caDSR CDEs, and the results of the modeling effort. We address difficulties we encountered and modifications for solution. The caBIG (TM) grid project, gird model of Taiwan Cancer Registry (girdTCR), using the caCORE tools to define data elements for cancer registry has been shown to caBIG(TM)UML model project. Currently, the Taiwan cancer registry CDEs are released and available in CDE browser for reusing. Furthermore, we will extend our CDEs to daily clinical practice and trials, along with how the methods were used to fully implemented in hospitals and cancer research centers in Taiwan.

AB - The huge amounts of biomedical data are stored in various formats and accessed through numerous interfaces. It is a crucial task for Data integration and exchange in cancer research. Data elements play an important role in data integration. The NCI supports a broad initiative to standardize the common data elements (CDEs) used in cancer research data capture and reporting. The Taiwan Cancer Registry (TCR), established in 1979, is organized and funded by the Health Department of the central government. The TCR's primary goal is to survey the incidence of cancer in Taiwan. The aim of the Taiwan Cancer Common Data Element Project (TCCDEP) is to facilitate convergence towards a common metadata standard in Taiwan cancer registry data. The project is implemented using a set of open source software and tools developed by the NCI, such as the caCORE SDK and caGrid. The experience of building, learning and using the open toolkit, Cancer Data Standards Repository (caDSR), developed by the National Cancer Institute's Center for Bioinformatics, NCICB, in the USA is reported. The caDSR is a metadata repository including CDEs used by NCI-sponsored organizations. The object of this work is to develop a database of metadata for medical data elements, referred to as the TCCDEP and to establish a common classification of data elements used in cancer registry. In this manuscript, we will be developing the common data elements using vocabulary standards, ontology and semantic modeling methodology. The CDEs included demographic data, clinical history, pathology data, and clinical outcome data including treatment, recurrence and vital status. These CDEs will be further enhanced to data sets across the participating cancer institutes to facilitate and supplement translational research. The Taiwan Cancer Registry (TCR) model and standard will be use as the basis for an electronic data standard repository to metadata or data descriptors. The TCCDEP developed 40 data elements to annotate the cancer registry data collected. In this project, we describe the process required to develop the model, the caDSR CDEs, and the results of the modeling effort. We address difficulties we encountered and modifications for solution. The caBIG (TM) grid project, gird model of Taiwan Cancer Registry (girdTCR), using the caCORE tools to define data elements for cancer registry has been shown to caBIG(TM)UML model project. Currently, the Taiwan cancer registry CDEs are released and available in CDE browser for reusing. Furthermore, we will extend our CDEs to daily clinical practice and trials, along with how the methods were used to fully implemented in hospitals and cancer research centers in Taiwan.

KW - caBIG

KW - caDSR

KW - Cancer Registry

KW - Common Data Elements

UR - http://www.scopus.com/inward/record.url?scp=80053323001&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80053323001&partnerID=8YFLogxK

U2 - 10.1109/ICEMMS.2011.6015680

DO - 10.1109/ICEMMS.2011.6015680

M3 - Conference contribution

AN - SCOPUS:80053323001

SN - 9781424496631

SP - 297

EP - 300

BT - ICEMMS 2011 - Proceedings: 2011 2nd IEEE International Conference on Emergency Management and Management Sciences

ER -