کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
524890 868868 2015 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Robust causal dependence mining in big data network and its application to traffic flow predictions
ترجمه فارسی عنوان
معادله وابستگی مقاومتی علیه در شبکه داده بزرگ و کاربرد آن در پیش بینی جریان ترافیک
کلمات کلیدی
اطلاعات بزرگ، پیش بینی جریان ترافیک، وابستگی علمی، رگرسیون کاسو، قدرتمند
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Build traffic flow models based on massive available data.
• Consider both temporal characteristics and spatial dependence of traffic time series.
• Design robust algorithms to handle bursts in traffic time series.
• Design fast algorithms for big data.
• Discuss the relationship between Granger causality and prediction.

In this paper, we focus on a special problem in transportation studies that concerns the so called “Big Data” challenge, which is: how to build concise yet accurate traffic flow prediction models based on the massive data collected by different sensors? The size of the data, the hidden causal dependence and the complexity of traffic time series are some of the obstacles that affect making reliable forecast at a reasonable cost, both time-wise and computation-wise. To better prepare the data for traffic modeling, we introduce a multiple-step strategy to process the raw “Big Data” into compact time series that are better suited for regression and causality analysis. First, we use the Granger causality to define and determine the potential dependence among data, and produce a much condensed set of times series who are also highly dependent. Next, we deploy a decomposition algorithm to separate daily-similar trend and nonstationary bursts components from the traffic flow time series yielded by the Granger test. The decomposition results are then treated by two rounds of Lasso regression: the standard Lasso method is first used to quickly filter out most of the irrelevant data, followed by a robust Lasso method to further remove the disturbance caused by bursts components and recover the strongest dependence among the remaining data. Test results show that the proposed method significantly reduces the costs of building prediction models. Moreover, the obtained causal dependence graph reveals the relationship between the structure of road networks and the correlations among traffic time series. All these findings are useful for building better traffic flow prediction models.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Transportation Research Part C: Emerging Technologies - Volume 58, Part B, September 2015, Pages 292–307
نویسندگان
, , , , , ,