Article ID Journal Published Year Pages File Type
5675508 Virus Research 2017 16 Pages PDF
Abstract

•Statistical methods for local diversity estimation in virus populations, as well as computational approaches for global reconstruction of viral haplotypes are described.•Strategies for read mapping are briefly described, as well as limitations of current aligners.•We describe experimental protocols developed to overcome limitations associated with short and error prone reads.

Rapidly evolving RNA viruses prevail within a host as a collection of closely related variants, referred to as viral quasispecies. Advances in high-throughput sequencing (HTS) technologies have facilitated the assessment of the genetic diversity of such virus populations at an unprecedented level of detail. However, analysis of HTS data from virus populations is challenging due to short, error-prone reads. In order to account for uncertainties originating from these limitations, several computational and statistical methods have been developed for studying the genetic heterogeneity of virus population. Here, we review methods for the analysis of HTS reads, including approaches to local diversity estimation and global haplotype reconstruction. Challenges posed by aligning reads, as well as the impact of reference biases on diversity estimates are also discussed. In addition, we address some of the experimental approaches designed to improve the biological signal-to-noise ratio. In the future, computational methods for the analysis of heterogeneous virus populations are likely to continue being complemented by technological developments.

Related Topics
Life Sciences Immunology and Microbiology Virology
Authors
, , ,