Article ID Journal Published Year Pages File Type
379633 Electronic Commerce Research and Applications 2015 14 Pages PDF
Abstract

•We propose a MIDAS framework which incorporates Google search data to predict Chinese inflation index.•We extract five groups of keywords, which include both subject terms and trends terms to construct search data.•The high frequency part of Google search data is directly used in MIDAS model to predict monthly CPI.•The MIDAS model outperforms other benchmark models in CPI forecast.

Increased internet penetration makes it possible for user generated content (UGC) to reflect people’s insights and expectations on economic activities. As representative and easily accessible UGC data that reflect public opinions on economic issues, Google search data have been used to forecast macroeconomic indicators in existing literatures. However, very little empirical research has directly used Google search data to improve the forecast accuracy. This paper proposes an integrated framework, which constructs keywords base and extracts search data accordingly, and then incorporates the search data into a mixed data sampling (MIDAS) model. Five groups of search data are extracted based on the constructed keywords and are then used in MIDAS model to forecast Chinese consumer price index (CPI) from 2004 to 2012. The empirical results indicate that the search data are strongly correlated with CPI, which is officially released by the Statistic Bureau of China; the MIDAS model including the search data outperforms the benchmark models, with the average reduction of root mean square error (RMSE) being 32.9%. This research provides a rigorous and generalizable framework for macroeconomic trend prediction using Google search data, and would have great potential in supporting business decisions by eliciting relevant information from UGC data in the Internet.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,