| Article ID | Journal | Published Year | Pages | File Type | 
|---|---|---|---|---|
| 517111 | Journal of Biomedical Informatics | 2014 | 10 Pages | 
•We focus on latent clinical factors that can only be indirectly observed.•We propose a methodology of developing BNs that reason with latent variables.•A series of expert reviews reveal the relation between data and latent variables.•The method is illustrated by a medical case study on trauma care.•The case study displays significant predictive improvements from the expert reviews.
Many medical conditions are only indirectly observed through symptoms and tests. Developing predictive models for such conditions is challenging since they can be thought of as ‘latent’ variables. They are not present in the data and often get confused with measurements. As a result, building a model that fits data well is not the same as making a prediction that is useful for decision makers. In this paper, we present a methodology for developing Bayesian network (BN) models that predict and reason with latent variables, using a combination of expert knowledge and available data. The method is illustrated by a case study into the prediction of acute traumatic coagulopathy (ATC), a disorder of blood clotting that significantly increases the risk of death following traumatic injuries. There are several measurements for ATC and previous models have predicted one of these measurements instead of the state of ATC itself. Our case study illustrates the advantages of models that distinguish between an underlying latent condition and its measurements, and of a continuing dialogue between the modeller and the domain experts as the model is developed using knowledge as well as data.
Graphical abstractFigure optionsDownload full-size imageDownload high-quality image (132 K)Download as PowerPoint slide
