Sciweavers

967 search results - page 97 / 194
» Text Mining
Sort
View
CASCON
2006
150views Education» more  CASCON 2006»
15 years 4 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement ea...
Nawei Chen, Hagit Shatkay, Dorothea Blostein
108
Voted
SDM
2004
SIAM
142views Data Mining» more  SDM 2004»
15 years 4 months ago
Learning to Read Between the Lines: The Aspect Bernoulli Model
We present a novel probabilistic multiple cause model for binary observations. In contrast to other approaches, the model is linear and it infers reasons behind both observed and ...
Ata Kabán, Ella Bingham, T. Hirsimäki
125
Voted
IJCAI
2003
15 years 4 months ago
Web Page Cleaning for Web Mining through Feature Weighting
Unlike conventional data or text, Web pages typically contain a large amount of information that is not part of the main contents of the pages, e.g., banner ads, navigation bars, ...
Lan Yi, Bing Liu
WWW
2009
ACM
16 years 4 months ago
Deducing trip related information from flickr
Uploading tourist photos is a popular activity on photo sharing platforms. These photographs and their associated metadata (tags, geo-tags, and temporal information) should be use...
Adrian Popescu, Gregory Grefenstette
CIKM
2008
Springer
15 years 5 months ago
Information shared by many objects
If Kolmogorov complexity [25] measures information in one object and Information Distance [4, 23, 24, 42] measures information shared by two objects, how do we measure information...
Chong Long, Xiaoyan Zhu, Ming Li, Bin Ma