Article ID Journal Published Year Pages File Type
711894 IFAC-PapersOnLine 2015 6 Pages PDF
Abstract

We present a highly visual process for creating and combining elementary information extraction rules, based on their results, in order to find the rules combination that produces the most accurate information extraction results. A rule's accuracy is determined by its F-Score which is the harmonic mean of the precision and the recall of that rule. Rules are combined using logical OR and AND operators. Running a few hundreds rules combinations over a corpus, in order to determine their accuracies, can take days. Using our approach, millions of rules combinations can be tested and their accuracies (F-Score) can be calculated in few seconds. A prototype was created to demonstrate the effectiveness of our approach

Related Topics
Physical Sciences and Engineering Engineering Computational Mechanics