Article ID Journal Published Year Pages File Type
4942983 Expert Systems with Applications 2017 24 Pages PDF
Abstract
This paper studies discriminant learning for guessing one of these demographic facets: the gender of an asker. In so doing, it capitalizes on a large-scale corpus automatically constructed from the integration of Yahoo! Search and Yahoo! Answer profiles. Then, this corpus is utilized for examining the impact of numerous features extracted from assorted sources: texts, demographics, meta-data, social interactions and web search. In brief, good non-linguistic gender indicators were age, industry and second-level question categories. If these are inaccessible, our outcomes indicate that models can still infer them, to some extent, from textual sources by means of semantic analysis and dependency relations. Overall, our best configuration reached an accuracy of 74.50%.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
,