On "one of the few" objects

7 years 12 months ago
On "one of the few" objects
Objects with multiple numeric attributes can be compared within any “subspace” (subset of attributes). In applications such as computational journalism, users are interested in claims of the form: Karl Malone is one of the only two players in NBA history with at least 25,000 points, 12,000 rebounds, and 5,000 assists in one’s career. One challenge in identifying such “one-of-the-k” claims (k = 2 above) is ensuring their “interestingness.” A small k is not a good indicator for interestingness, as one can often make such claims for many objects by increasing the dimensionality of the subspace considered. We propose a uniqueness-based interestingness measure for one-of-the-few claims that is intuitive for nontechnical users, and we design algorithms for finding all interesting claims (across all subspaces) from a dataset. Sometimes, users are interested primarily in the objects appearing in these claims. Building on our notion of interesting claims, we propose a scheme for...
You Wu, Pankaj K. Agarwal, Chengkai Li, Jun Yang 0
Added 28 Sep 2012
Updated 28 Sep 2012
Type Journal
Year 2012
Where KDD
Authors You Wu, Pankaj K. Agarwal, Chengkai Li, Jun Yang 0001, Cong Yu
Comments (0)