کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4966857 1449296 2017 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Finding patients using similarity measures in a rare diseases-oriented clinical data warehouse: Dr. Warehouse and the needle in the needle stack
ترجمه فارسی عنوان
پیدا کردن بیماران با استفاده از اقدامات تشابه در یک انبار داده های بالینی مبتنی بر بیماری: دکتر انبار و سوزن در سوزن پشته
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


- In the rare diseases context, clinicians look for similar patients to find new cases.
- Narrative reports represent a valuable source for automated phenotyping.
- VSM has been successfully used to compute similarity on narrative reports.
- Adding TF-IDF features and polarity improves the VSM similarity measures.

ObjectiveIn the context of rare diseases, it may be helpful to detect patients with similar medical histories, diagnoses and outcomes from a large number of cases with automated methods. To reduce the time to find new cases, we developed a method to find similar patients given an index case leveraging data from the electronic health records.Materials and methodsWe used the clinical data warehouse of a children academic hospital in Paris, France (Necker-Enfants Malades), containing about 400,000 patients. Our model was based on a vector space model (VSM) to compute the similarity distance between an index patient and all the patients of the data warehouse. The dimensions of the VSM were built upon Unified Medical Language System concepts extracted from clinical narratives stored in the clinical data warehouse. The VSM was enhanced using three parameters: a pertinence score (TF-IDF of the concepts), the polarity of the concept (negated/not negated) and the minimum number of concepts in common. We evaluated this model by displaying the most similar patients for five different rare diseases: Lowe Syndrome (LOWE), Dystrophic Epidermolysis Bullosa (DEB), Activated PI3K delta Syndrome (APDS), Rett Syndrome (RETT) and Dowling Meara (EBS-DM), from the clinical data warehouse representing 18, 103, 21, 84 and 7 patients respectively.ResultsThe percentages of index patients returning at least one true positive similar patient in the Top30 similar patients were 94% for LOWE, 97% for DEB, 86% for APDS, 71% for EBS-DM and 99% for RETT. The mean number of patients with the exact same genetic diseases among the 30 returned patients was 51%.ConclusionThis tool offers new perspectives in a translational context to identify patients for genetic research. Moreover, when new molecular bases are discovered, our strategy will help to identify additional eligible patients for genetic screening.

247

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Biomedical Informatics - Volume 73, September 2017, Pages 51-61
نویسندگان
, , , , , , , , , , ,