کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
527351 869315 2015 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A framework for live and cross platform fingerspelling recognition using modified shape matrix variants on depth silhouettes
ترجمه فارسی عنوان
چارچوبی برای شناسایی انگشتان پا و زنده با استفاده از فرمهای ماتریس شکل اصلاح شده در شبح عمق
کلمات کلیدی
عمق سنجش، ماتریس شکل، خط مرجع، محور اصلی، تشخیص انگشتی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• Live fingerspelling recognition using modified interpretation of shape matrix.
• Cross platform solution for ISL, ASL, NTU datasets.
• Region, contour, and depth based shape matrix variants.
• Recognizes both one and two handed postures.

Automatic recognition of fingerspelling postures in a live environment is a challenging task primarily due to the complex computation of popular moment-based and spectral descriptors. Shape matrix offers a time-efficient alternative that samples the shape region through the intersection points of adjacent log-polar sections. However, sparse sampling of the region by discrete log-polar intersection points cannot capture salience of the shape. This manuscript proposes modified forms of the shape matrix which can capture salience of the fingerspelling postures by the precise sampling of contours and regions. For effective segmentation and subsequent description, hand postures are acquired through the depth sensor. Proposed shape matrix variants are evaluated for fingerspelling recognition with one-handed and two-handed postures. Experiments are rigorously performed on three datasets including one-handed signs of American Sign Language (ASL), NTU hand digits, and both one-handed and two-handed signs of Indian Sign Language (ISL). Proposed shape matrix variants supersede the benchmark shape context and Gabor features by obtaining 94.15% accuracy on ISL dataset with minimum mean running time of 0.029 s. On ASL and NTU datasets, 91.86% and 95.11% accuracies are obtained with 0.0172 and 0.0483 s mean running times, respectively.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Vision and Image Understanding - Volume 141, December 2015, Pages 138–151
نویسندگان
, ,