کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6452325 1417010 2016 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Refined Pichia pastoris reference genome sequence
موضوعات مرتبط
مهندسی و علوم پایه مهندسی شیمی بیو مهندسی (مهندسی زیستی)
پیش نمایش صفحه اول مقاله
Refined Pichia pastoris reference genome sequence
چکیده انگلیسی


- The work leading to this paper identifies insertions, deletions, gaps, rearrangements and incorrectly annotated open reading frames in the previously published draft genome sequences.
- RNA-seq data from several P. pastoris strains were used to correctly predict intron splicing events.
- For the first time, the P. pastoris killer plasmid sequences and putative centromere sequences are reported.

Strains of the species Komagataella phaffii are the most frequently used “Pichia pastoris” strains employed for recombinant protein production as well as studies on peroxisome biogenesis, autophagy and secretory pathway analyses. Genome sequencing of several different P. pastoris strains has provided the foundation for understanding these cellular functions in recent genomics, transcriptomics and proteomics experiments. This experimentation has identified mistakes, gaps and incorrectly annotated open reading frames in the previously published draft genome sequences. Here, a refined reference genome is presented, generated with genome and transcriptome sequencing data from multiple P. pastoris strains. Twelve major sequence gaps from 20 to 6000 base pairs were closed and 5111 out of 5256 putative open reading frames were manually curated and confirmed by RNA-seq and published LC-MS/MS data, including the addition of new open reading frames (ORFs) and a reduction in the number of spliced genes from 797 to 571. One chromosomal fragment of 76 kbp between two previous gaps on chromosome 1 and another 134 kbp fragment at the end of chromosome 4, as well as several shorter fragments needed re-orientation. In total more than 500 positions in the genome have been corrected. This reference genome is presented with new chromosomal numbering, positioning ribosomal repeats at the distal ends of the four chromosomes, and includes predicted chromosomal centromeres as well as the sequence of two linear cytoplasmic plasmids of 13.1 and 9.5 kbp found in some strains of P. pastoris.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Biotechnology - Volume 235, 10 October 2016, Pages 121-131
نویسندگان
, , , , , , , , , , , , , , ,