Article ID Journal Published Year Pages File Type
427239 Information Processing Letters 2015 4 Pages PDF
Abstract

•We define the Google Scholar Merge Problem after the user interface used on the profile pages of Google Scholar.•We prove that the Google Scholar Merge Problem is NP-complete by reduction from 3-partition.•We list open problems that are variations of the Google Scholar Merge Problem.

With Google Scholar, scientists can maintain their publications on personal profile pages, while the citations to these works are automatically collected and counted. Maintenance of publications is done manually by the researcher herself, and involves deleting erroneous ones, merging ones that are the same but which were not recognized as the same, adding forgotten co-authors, and correcting titles of papers and venues. The publications are presented on pages with 20 or 100 papers in the web page interface from 2012–2014. (Since mid 2014, Google Scholar's profile pages allow any number of papers on a single page.) The interface does not allow a scientist to merge two versions of a paper if they appear on different pages. This not only implies that a scientist who wants to merge certain subsets of publications will sometimes be unable to do so, but also, we show in this note that the decision problem to determine if it is possible to merge given subsets of papers is NP-complete.

Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics
Authors
, ,