Sciweavers

214 search results - page 42 / 43
» Text Categorization for Multi-label Documents and Many Categ...
Sort
View
SIGIR
2008
ACM
13 years 5 months ago
Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization
Multi-document summarization aims to create a compressed summary while retaining the main characteristics of the original set of documents. Many approaches use statistics and mach...
Dingding Wang, Tao Li, Shenghuo Zhu, Chris H. Q. D...
WWW
2011
ACM
13 years 10 days ago
Geographical topic discovery and comparison
This paper studies the problem of discovering and comparing geographical topics from GPS-associated documents. GPSassociated documents become popular with the pervasiveness of loc...
Zhijun Yin, Liangliang Cao, Jiawei Han, Chengxiang...
SDM
2003
SIAM
184views Data Mining» more  SDM 2003»
13 years 6 months ago
Finding Clusters of Different Sizes, Shapes, and Densities in Noisy, High Dimensional Data
The problem of finding clusters in data is challenging when clusters are of widely differing sizes, densities and shapes, and when the data contains large amounts of noise and out...
Levent Ertöz, Michael Steinbach, Vipin Kumar
WWW
2005
ACM
14 years 6 months ago
Extracting context to improve accuracy for HTML content extraction
Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...
Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo
WWW
2004
ACM
14 years 6 months ago
Building a companion website in the semantic web
A problem facing many textbook authors (including one of the authors of this paper) is the inevitable delay between new advances in the subject area and their incorporation in a n...
Timothy Miles-Board, Christopher Bailey, Wendy Hal...