کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
472662 | 698737 | 2011 | 15 صفحه PDF | دانلود رایگان |

Many applications today need to manage uncertain data, such as information extraction (IE), data integration, sensor RFID networks, and scientific experiments. Top-kk queries are often natural and useful in analyzing uncertain data in those applications. In this paper, we study the problem of answering top-kk queries in a probabilistic framework from a state-of-the-art statistical IE model—semi-conditional random fields (CRFs)—in the setting of probabilistic databases that treat statistical models as first-class data objects. We investigate the problem of ranking the answers to probabilistic database queries. We present an efficient algorithm for finding the best approximating parameters in such a framework for efficiently retrieving the top-kk ranked results. An empirical study using real data sets demonstrates the effectiveness of probabilistic top-kk queries and the efficiency of our method.
Journal: Computers & Mathematics with Applications - Volume 62, Issue 7, October 2011, Pages 2755–2769