کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
527695 869346 2014 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A visualization framework for team sports captured using multiple static cameras
ترجمه فارسی عنوان
یک چارچوب تجسم برای ورزش های گروهی با استفاده از دوربین های چند استاتیک گرفته شده است
کلمات کلیدی
استدلال ادراکی تجزیه و تحلیل ویدئو، تجسم ورزش، ردیابی چند دوربین همجوشی داده ها، بینش کامپیوتری
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• K-partite graph is a useful model for multi-source matching problems.
• It is useful to have mid-level representations to fuse data from multiple sources.
• Alternate background models can help characterize the background scenes accurately.
• Shadow pixels can be accurately removed by using their planner view-invariance.
• Appearance based Bhattacharyya distance is a robust blob-similarity measure.

We present a novel approach for robust localization of multiple people observed using a set of static cameras. We use this location information to generate a visualization of the virtual offside line in soccer games. To compute the position of the offside line, we need to localize players’ positions, and identify their team roles. We solve the problem of fusing corresponding players’ positional information by finding minimum weight K-length cycles in a complete K-partite graph. Each partite of the graph corresponds to one of the K cameras, whereas each node of a partite encodes the position and appearance of a player observed from a particular camera. To find the minimum weight cycles in this graph, we use a dynamic programming based approach that varies over a continuum from maximally to minimally greedy in terms of the number of graph-paths explored at each iteration. We present proofs for the efficiency and performance bounds of our algorithms. Finally, we demonstrate the robustness of our framework by testing it on 82,000 frames of soccer footage captured over eight different illumination conditions, play types, and team attire. Our framework runs in near-real time, and processes video from 3 full HD cameras in about 0.4 s for each set of corresponding 3 frames.

Figure optionsDownload high-quality image (212 K)Download as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Vision and Image Understanding - Volume 118, January 2014, Pages 171–183
نویسندگان
, , , ,