کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2814953 1159840 2016 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Application of Euclidean distance measurement and principal component analysis for gene identification
ترجمه فارسی عنوان
استفاده از اندازه گیری فاصله اقلیدس و تجزیه و تحلیل مولفه اصلی برای شناسایی ژن
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی ژنتیک
چکیده انگلیسی


• Gene is predicted based on similarity search approach using least Euclidean distance and PCA based distance method.
• Discrete Fourier transform technique is used for better and accurate representation of the signal.
• Binary mapping technique is considered for simplicity of computation.
• Principal component analysis method is used for dimensionality reduction of genes.
• ROC curve is used for performance analysis of the methods.

Gene systems are extremely complex, heterogeneous, and noisy in nature. Many statistical tools which are used to extract relevant feature from genes provide fuzzy and ambiguous information. High-dimensional gene expression database available in public domain usually contains thousands of genes. Efficient prediction method is demanding nowadays for accurate identification of such database. Euclidean distance measurement and principal component analysis methods are applied on such databases to identify the genes. In both methods, prediction algorithm is based on homology search approach. Digital Signal Processing technique along with statistical method is used for analysis of genes in both cases. A two-level decision logic is used for gene classification as healthy or cancerous. This binary logic minimizes the prediction error and improves prediction accuracy. Superiority of the method is judged by receiver operating characteristic curve.

Figure optionsDownload high-quality image (215 K)Download as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Gene - Volume 583, Issue 2, 1 June 2016, Pages 112–120
نویسندگان
, ,