Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
517115 | Journal of Biomedical Informatics | 2014 | 11 Pages |
•Definition and a novel mining approach for characteristic phenotypes.•Class association rule mining algorithm on labelled data.•Automatic and human validation of the algorithm on skeletal dysplasia data.•Discussion on standard vs. class association rule mining algorithms.
Finding, capturing and describing characteristic features represents a key aspect in disorder definition, diagnosis and management. This process is particularly challenging in the case of rare disorders, due to the sparse nature of data and expertise. From a computational perspective, finding characteristic features is associated with some additional major challenges, such as formulating a computationally tractable definition, devising appropriate inference algorithms or defining sound validation mechanisms. In this paper we aim to deal with each of these problems in the context provided by the skeletal dysplasia domain. We propose a clear definition for characteristic phenotypes, we experiment with a novel, class association rule mining algorithm and we discuss our lessons learned from both an automatic and human-based validation of our approach.
Graphical abstractFigure optionsDownload full-size imageDownload high-quality image (101 K)Download as PowerPoint slide