کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2831796 1163818 2011 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A bioinformatics pipeline to build a knowledge database for in silico antibody engineering
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شناسی مولکولی
پیش نمایش صفحه اول مقاله
A bioinformatics pipeline to build a knowledge database for in silico antibody engineering
چکیده انگلیسی

A challenge to antibody engineering is the large number of positions and nature of variation and opposing concerns of introducing unfavorable biochemical properties. While large libraries are quite successful in identifying antibodies with improved binding or activity, still only a fraction of possibilities can be explored and that would require considerable effort. The vast array of natural antibody sequences provides a potential wealth of information on (1) selecting hotspots for variation, and (2) designing mutants to mimic natural variations seen in hotspots.The human immune system can generate an enormous diversity of immunoglobulins against an almost unlimited range of antigens by gene rearrangement of a limited number of germline variable, diversity and joining genes followed by somatic hypermutation and antigen selection. All the antibody sequences in NCBI database can be assigned to different germline genes. As a result, a position specific scoring matrix for each germline gene can be constructed by aligning all its member sequences and calculating the amino acid frequencies for each position. The position specific scoring matrix for each germline gene characterizes “hotspots” and the nature of variations, and thus reduces the sequence space of exploration in antibody engineering.We have developed a bioinformatics pipeline to conduct analysis of human antibody sequences, and generated a comprehensive knowledge database for in silico antibody engineering. The pipeline is fully automatic and the knowledge database can be refreshed anytime by re-running the pipeline. The refresh process is fast, typically taking 1 min on a Lenovo ThinkPad T60 laptop with 3G memory.Our knowledge database consists of (1) the individual germline gene usage in generation of natural antibodies; (2) the CDR length distributions; and (3) the position specific scoring matrix for each germline gene. The knowledge database provides comprehensive support for antibody engineering, including de novo library design in selection of favorable germline V gene scaffolds and CDR lengths. In addition, we have also developed a web application framework to present our knowledge database, and the web interface can help people to easily retrieve a variety of information from the knowledge database.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Molecular Immunology - Volume 48, Issue 8, April 2011, Pages 1019–1026
نویسندگان
, ,