Article ID Journal Published Year Pages File Type
515601 Information Processing & Management 2012 14 Pages PDF
Abstract

An automatic patent categorization system would be invaluable to individual inventors and patent attorneys, saving them time and effort by quickly identifying conflicts with existing patents. In recent years, it has become more and more common to classify all patent documents using the International Patent Classification (IPC), a complex hierarchical classification system comprised of eight sections, 128 classes, 648 subclasses, about 7200 main groups, and approximately 72,000 subgroups. So far, however, no patent categorization method has been developed that can classify patents down to the subgroup level (the bottom level of the IPC). Therefore, this paper presents a novel categorization method, the three phase categorization (TPC) algorithm, which classifies patents down to the subgroup level with reasonable accuracy. The experimental results for the TPC algorithm, using the WIPO-alpha collection, indicate that our classification method can achieve 36.07% accuracy at the subgroup level. This is approximately a 25,764-fold improvement over a random guess.

► So far no patent categorization method can classify patents down to the bottom level of IPC. ► This paper presents a novel categorization method named with the three phase categorization (TPC) algorithm. ► The TPC algorithm can classify patents down to the bottom level with a reasonable accuracy. ► The three phase approach provides a good framework to develop efficient patent classification algorithms for future research.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
, ,