Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
37224 | Trends in Biotechnology | 2011 | 9 Pages |
Abstract
Nearly 2200 genomes that encode around 6 million proteins have now been sequenced. Around 40% of these proteins are of unknown function, even when function is loosely and minimally defined as ‘belonging to a superfamily’. In addition to in silico methods, the swelling stream of high-throughput experimental data can give valuable clues for linking these unknowns with precise biological roles. The goal is to develop integrative data-mining platforms that allow the scientific community at large to access and utilize this rich source of experimental knowledge. To this end, we review recent advances in generating whole-genome experimental datasets, where this data can be accessed, and how it can be used to drive prediction of gene function.
Related Topics
Physical Sciences and Engineering
Chemical Engineering
Bioengineering
Authors
Crysten E. Blaby-Haas, Valérie de Crécy-Lagard,