Article ID Journal Published Year Pages File Type
1110956 Procedia - Social and Behavioral Sciences 2015 5 Pages PDF
Abstract

Big data is a broad term for data sets so large and complex that traditional data processing applications are inadequate. A new field, Predictive Analytics, is trying to extract value from those big (unstructured) data. In Corpus Linguistics, researchers usually deal with small data. In this paper, we compare the amount and the quality of information with respect to a single topic (flu) in Twitter and in MultiMedica (a corpus of medicine texts).

Related Topics
Social Sciences and Humanities Arts and Humanities Arts and Humanities (General)