کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10322220 660850 2015 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video
ترجمه فارسی عنوان
یک توصیفگر هیستوگرام جهت لحظه ای جدید برای تشخیص متن متحرک چند گانه در ویدیو
کلمات کلیدی
لحظات مرکزی، هیستوگرام لحظات پایدار، گرادیان گرادیان هیستوگرام، تشخیص متن ویدئو، جریان نوری، حرکت تشخیص متن عنوان،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
Developing an expert text detection system for video indexing and retrieving is a challenging task due to low resolution, complex background, non-illumination and movement of text present in a video. Besides, text detection is vital for several real time applications, such as license plate recognition, assisting a blind person and other surveillance applications. In this paper, we introduce a new descriptor called Histogram Oriented Moments (HOM) for text detection in video, which is invariant to rotation, scaling, font, and font size variations. The HOM finds orientations with the second order geometrical moments for each sliding window (overlapped block) of the input frame. The proposed method performs histogram operations on the orientations of each window to identify the dominant orientation (as a representative). Then, a new hypothesis is defined based on the dominant orientations of a connected component as the numbers of orientations, which point towards centroid of the connected components are larger than the number of dominant orientations which point away from the centroid of the connected components. The components that satisfy the above hypothesis are considered as text candidates, or else as non-text candidates. Further, to detect a moving text- we explore optical flow properties, such as velocity of text candidates to estimate the motions between temporal frames. The components which move with constant velocity and uniform direction are considered as text candidates otherwise non-text candidates. We demonstrate the proposed method's dominance over state of the art methods by testing on benchmark database, namely ICDAR 2013 and our own video datasets in terms of recall, precision and F-measure.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 42, Issue 21, 30 November 2015, Pages 7627-7640
نویسندگان
, , ,