Efficient index-based KNN join processing for high-dimensional data

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
552089	873174	2007	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

KNN High-dimensional data - داده های با ابعاد بزرگ Similarity join - پیوستن به شباهت

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر تعامل انسان و کامپیوتر

پیش نمایش صفحه اول مقاله

Efficient index-based KNN join processing for high-dimensional data

چکیده انگلیسی

In many advanced database applications (e.g., multimedia databases), data objects are transformed into high-dimensional points and manipulated in high-dimensional space. One of the most important but costly operations is the similarity join that combines similar points from multiple datasets. In this paper, we examine the problem of processing K-nearest neighbor similarity join (KNN join). KNN join between two datasets, R and S, returns for each point in R its K most similar points in S. We propose a new index-based KNN join approach using the iDistance as the underlying index structure. We first present its basic algorithm and then propose two different enhancements. In the first enhancement, we optimize the original KNN join algorithm by using approximation bounding cubes. In the second enhancement, we exploit the reduced dimensions of data space. We conducted an extensive experimental study using both synthetic and real datasets, and the results verify the performance advantage of our schemes over existing KNN join algorithms.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information and Software Technology - Volume 49, Issue 4, April 2007, Pages 332–344

نویسندگان

Cui Yu, Bin Cui, Shuguang Wang, Jianwen Su,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Efficient index-based KNN join processing for high-dimensional data

دسترسی سریع

ارتباط

English Website