کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
377957 658857 2009 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Virtual genetic coding and time series analysis for alternative splicing prediction in C. elegans
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Virtual genetic coding and time series analysis for alternative splicing prediction in C. elegans
چکیده انگلیسی

SummaryMotivationPrediction of alternative splicing has been traditionally based on the study of expressed sequences, helped by homology considerations and the analysis of local discriminative features. More recently, machine learning algorithms have been developed that try avoid or reduce the use of a priori information, with partial success.Objective and methodWith the aim of developing a fully automatic procedure of recognition of alternative splicing events based only on the genomic sequence, we first introduce a virtual genetic coding scheme to numerically modeling the information content of sequences in an effective way, then we use time series analysis to extract a fixed-length set of features from each sequence and finally we adopt a supervised learning method, namely the support vector machine, to predict alternative splicing events.ResultsOn the base of real C. elegans data, we show that it is possible within this purely numeric framework to obtain results better than the state of the art, without any explicit modeling of homology or positions in the splice site, nor any use of other local features.ConclusionThe virtual genetic coding together with time series analysis allows us to introduce an effective and powerful sequence coding scheme, that may be useful in various areas of genomics and transcriptomics.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Artificial Intelligence in Medicine - Volume 45, Issues 2–3, February–March 2009, Pages 109–115
نویسندگان
, ,