APIII - Advancing Practice, Instruction & Innovation Through Informatics

Marriott City Center, Pittsburgh, PA | September 20 - 23, 2009

Oncologic Pathology in Biomedical Terminologies: Challenges for Data Integration

Anita, Burgun MD; Rennes University; Olivier, Bodenreider MD; National Library of Medicine;

Content:

The domain of oncologic pathology is represented in several biomedical terminologies developed for epidemiology (e.g., ICD-O), clinical practice (e.g., SNOMED CT  SNCT) and research (e.g., NCI Thesaurus  NCIt). The two major dimensions in the description of neoplasms are topography and morphology. These two dimensions are present in ICD-O and SNCT. The neoplasms themselves are represented in SNCT and NCIt as pre-coordinated terms, but not in ICD-O. The anatomy component (topography) is present in all three terminologies..

Technology:

ICD-O, SNCT and NCIt are all integrated in the NCI Metathesaurus and pairwise mappings have been created between some of these terminologies.

Design:

The objective of this paper is to explore the degree to which the representations provided by the three terminologies are consistent. We present examples of inconsistencies collected from the three terminologies. The consequences of such inconsistencies on data integration are discussed.

Results:

Ovarian gynandroblastoma (OG) is represented in SNCT with links to the morphology Gynandroblastoma and the topography Ovary. In ICD-O, Ovary and Gynandroblastoma are represented explicitly, but OG is present only through the coordination of the topography and morphology concepts. In NCIt, OG and Ovary are present, but not the morphology concept. However, NCIt provides a mapping between OG and the morphology concept Gynandroblastoma in ICD-O. The concepts for OG (disorder) and Gynandroblastoma (morphology) are distinct in the NCI Metathesaurus. In contrast, the representation of Renal cell carcinoma (RCC) is more problematic as the Metathesaurus rearranges the morphology and disease terms for RCC in a way that is inconsistent with the original organization in SNCT (e.g., the terms renal cell carcinoma (morphologic abnormality) and adenocarcinoma of kidney are in the same concept C0007134). Moreover, the topology for RCC (Structure of parenchyma of kidney) in SNCT is finer grained than in ICD-O and NCIt (Kidney).

Conclusion:

Aggregating coded data from epidemiology, clinical and research sources through integrated terminologies remains challenging due to differences in the representation of oncologic pathology in these terminologies and terminology integration resources, such as the Metathesaurus. However, the existence of shared codes between ICD-O and SNCT and mappings between NCIt and ICD-O can help assess consistency and suggest corrections.

Search