کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
394558 | 665813 | 2009 | 13 صفحه PDF | دانلود رایگان |
![عکس صفحه اول مقاله: Contingency matrix theory: Statistical dependence in a contingency table Contingency matrix theory: Statistical dependence in a contingency table](/preview/png/394558.png)
Chance discovery aims at understanding the meaning of functional dependency from the viewpoint of unexpected relations. One of the most important observations is that such a chance is hidden under a huge number of coocurrencies extracted from a given data. On the other hand, conventional data-mining methods are strongly dependent on frequencies and statistics rather than interestingness or unexpectedness. This paper discusses some limitations of ideas of statistical dependence, especially focusing on the formal characteristics of Simpson’s paradox from the viewpoint of linear algebra. Theoretical results show that such a Simpson’s paradox can be observed when a given contingency table as a matrix is not regular, in other words, the rank of a contingency matrix is not full. Thus, data-ordered evidence gives some limitations, which should be compensated by human-oriented reasoning.
Journal: Information Sciences - Volume 179, Issue 11, 13 May 2009, Pages 1615–1627