کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
537762 870871 2011 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An efficient VLSI processor chip for variable block size integer motion estimation in H.264/AVC
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
An efficient VLSI processor chip for variable block size integer motion estimation in H.264/AVC
چکیده انگلیسی

Motion estimation (ME) is the most critical component of a video coding standard. H.264/AVC adopts the variable block size motion estimation (VBSME) to obtain excellent coding efficiency, but the high computational complexity makes design difficult. This paper presents an effective processor chip for integer motion estimation (IME) in H264/AVC based on the full-search block-matching algorithm (FSBMA). It uses architecture with a configurable 2D systolic array to obtain a high data reuse of search area. This systolic array supports a three-direction scan format in which only one row of pixels is changed between the two adjacent subblocks, thus reducing the memory accesses and saving clock cycles. A computing array of 64 PEs calculates the SAD of basic 4×4 subblocks and a modified Lagrangian cost is used as matching criterion to find the best 41 variable-size blocks by means of a tree pipeline parallel architecture. Finally, a mode decision module uses serial data flow to find the best mode by comparing the total minimum Lagrangian costs. The IME processor chip was designed in UMC 0.18 μm technology resulting in a circuit with only 32.3 k gates and 6 RAMs (total 59kBits on-chip memory). In typical working conditions (25 °C, 1.8 V), a clock frequency of 300 MHz can be estimated with a processing capacity for HDTV (1920×1088 @ 30 fps) and a search range of 32×32.


► Full search VLSI processor chip of Integer Motion Estimation in H.264/AVC for 1080HD real-time video.
► Configurable 2D systolic array with high data reuse for variable search area.
► Modified Lagragian cost for parallel architecture.
► Best mode and best motion vectors computed in a parallel pipeline architecture.
► Reduced area (32.3 k gates and 4.4 kBytes RAM) operating at 300 MHz

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Signal Processing: Image Communication - Volume 26, Issue 6, July 2011, Pages 289–303
نویسندگان
, ,