DATASET

Multiple Primary Tumour Validation Ontology

Collection: ECIS : European Cancer Information System 

Description

OWL ontology for validating the international rules for multiple-primary malignant tumours

Contact

Email
nicholas.nicholson (at) ec.europa.eu

Contributors

  • Carmen Martos
  • Francesco Giusti

How to cite

Martos, Carmen; Giusti, Francesco (2021): Multiple Primary Tumour Validation Ontology. European Commission, Joint Research Centre (JRC) [Dataset] PID: http://data.europa.eu/89h/2a110a2e-d1e2-439d-9b9f-7e2e5436cc49

Keywords

Cancer registries Data validation Multiple primary tumours OWL ontology

Data access

Multiple Prmary Tumour Validation Ontology v1
Download 
  • OWL ontology containing the axioms for determining multiple primary tumours according to the associated international rules. Primarily intended for validating European cancer-registry data for inclusion into the European Cancer Information System.

Publications

Publication
An Ontology to Model the International Rules for Multiple Primary Malignant Tumours in Cancer Registration
Nicholson, N., Giusti, F., Bettio, M., Negrao De Carvalho, R., Dimitrova, N., Dyba, T.A., Flego, M., Neamtiu, L., Randi, G. and Martos Jimenez, M.D.C., An Ontology to Model the International Rules for Multiple Primary Malignant Tumours in Cancer Registration, APPLIED SCIENCES-BASEL, ISSN 2076-3417 (online), 11 (16), 2021, p. 7233, JRC124149.
  • MDPI, BASEL, SWITZERLAND
Publication page 
  • Abstract

    Population-based cancer registry data provide a key epidemiological resource for monitoring cancer in defined populations. Validation of the data variables contributing to a common data set is necessary to remove statistical bias; the process is currently performed centrally. An ontology-based approach promises advantages in devolving the validation process to the registry level but the checks regarding multiple primary tumours have presented a hurdle. This work presents a solution by modelling the international rules for multiple primary cancers in description logic. Topography groupings described in the rules had to be further categorised in order to simplify the axioms. Description logic expressivity was constrained as far as possible for reasons of automatic reasoning performance. The axioms were consistently able to trap all the different types of scenarios signalling violation of the rules. Batch processing of many records were performed using the Web Ontology Language application programme interface. Performance issues were circumvented for large data sets using the software interface to perform the reasoning operations on the basis of the axioms encoded in the ontology. These results remove one remaining hurdle in developing a purely ontology-based solution for performing the European harmonised data-quality checks, with a number of inherent advantages including the formalisation and integration of the validation rules within the domain data model itself.

Additional information

Published by
European Commission, Joint Research Centre
Created date
2021-02-22
Modified date
2022-09-06
Issued date
2021-02-22
Language(s)
English
Data theme(s)
Health, Science and technology
Update frequency
irregular
Identifier
http://data.europa.eu/89h/2a110a2e-d1e2-439d-9b9f-7e2e5436cc49
Popularity
06 Dec 2022: 1 visits