کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
8408410 1545070 2017 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A random forest classifier for detecting rare variants in NGS data from viral populations
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی بیوتکنولوژی یا زیست‌فناوری
پیش نمایش صفحه اول مقاله
A random forest classifier for detecting rare variants in NGS data from viral populations
چکیده انگلیسی
We evaluate MultiRes on simulated and real viral population datasets, which consist of many low frequency variants, and compare it to the error detection methods used in correction tools known in the literature. MultiRes has 4 to 500 times less false positives k-mer predictions compared to other methods, essential for accurate estimation of viral population diversity and their de-novo assembly. It has high recall of the true k-mers, comparable to other error correction methods. MultiRes also has greater than 95% recall for detecting single nucleotide polymorphisms (SNPs) and fewer false positive SNPs, while detecting higher number of rare variants compared to other variant calling methods for viral populations. The software is available freely from the GitHub link https://github.com/raunaq-m/MultiRes.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational and Structural Biotechnology Journal - Volume 15, 2017, Pages 388-395
نویسندگان
, , , ,