An efficient similarity search based on indexing in large DNA databases

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
15345	1405	2010	6 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Approximate string matching - تطابق رشته تقریبی Similarity search - جستجوی مشابهی DNA sequence - دنباله DNA Indexing - نمایه سازی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی شیمی بیو مهندسی (مهندسی زیستی)

پیش نمایش صفحه اول مقاله

An efficient similarity search based on indexing in large DNA databases

چکیده انگلیسی

Index-based search algorithms are an important part of a genomic search, and how to construct indices is the key to an index-based search algorithm to compute similarities between two DNA sequences. In this paper, we propose an efficient query processing method that uses special transformations to construct an index. It uses small storage and it rapidly finds the similarity between two sequences in a DNA sequence database. At first, a sequence is partitioned into equal length windows. We select the likely subsequences by computing Hamming distance to query sequence. The algorithm then transforms the subsequences in each window into a multidimensional vector space by indexing the frequencies of the characters, including the positional information of the characters in the subsequences. The result of our experiments shows that the algorithm has faster run time than other heuristic algorithms based on index structure. Also, the algorithm is as accurate as those heuristic algorithms.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational Biology and Chemistry - Volume 34, Issue 2, April 2010, Pages 131–136

نویسندگان

In-Seon Jeong, Kyoung-Wook Park, Seung-Ho Kang, Hyeong-Seok Lim,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

An efficient similarity search based on indexing in large DNA databases

دسترسی سریع

ارتباط

English Website