کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
494579 862799 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Gene discretization based on EM clustering and adaptive sequential forward gene selection for molecular classification
ترجمه فارسی عنوان
گسسته سازی ژن در خوشه بندی EM و انتخاب ژن رو به جلو متوالی تطبیقی برای طبقه بندی بر اساس مولکولی
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Boost gene discrimination capability by feature discretization with EM clustering.
• Explore subsets of informative genes by an adaptive sequential forward search algorithm.
• Cancer classification based solely on the discretized gene expression monitoring.
• Predict distinction between multiple subclasses without previous biological knowledge.

The mismatch in gene dimension as opposed to sample dimension poses a great challenge for many modelling problems in bioinformatics. Feature selection in immense quantities of high-dimensional data for molecular classification renews the tasks to the modern data mining techniques. The advent of microarray datasets pushed research in bioinformatics to a new boundary in the last decade. Many bioinformatics applications necessiate feature selection or dimensionality reduction techniques for identifying informative genes or selecting subset of genes with discrimination power. Here, gene discretization based on EM clustering for complexity simplification and better discrimination capability is employed. Then, an adaptive sequential forward search algorithm for the exploration of distinct subsets of genes with discrimination power is proposed. By monitoring the information gain acquired from a collection of selected features, we are able to predict distinction between multiple subclasses without previous knowledge of these subclasses. Experimental results demonstrate the feasibility of cancer classification based solely on the discretized gene expression monitoring, completely independent of previous biological knowledge.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 48, November 2016, Pages 683–690
نویسندگان
,