Sciweavers

6103 search results - page 973 / 1221
» Multimedia Retrieval Algorithmics
Sort
View
120
Voted
WWW
2010
ACM
15 years 9 months ago
A scalable machine-learning approach for semi-structured named entity recognition
Named entity recognition studies the problem of locating and classifying parts of free text into a set of predefined categories. Although extensive research has focused on the de...
Utku Irmak, Reiner Kraft
142
Voted
WWW
2010
ACM
15 years 9 months ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...
112
Voted
WSDM
2009
ACM
161views Data Mining» more  WSDM 2009»
15 years 9 months ago
Predicting the readability of short web summaries
Readability is a crucial presentation attribute that web summarization algorithms consider while generating a querybaised web summary. Readability quality also forms an important ...
Tapas Kanungo, David Orr
113
Voted
WSDM
2009
ACM
115views Data Mining» more  WSDM 2009»
15 years 9 months ago
Discovering and using groups to improve personalized search
Personalized Web search takes advantage of information about an individual to identify the most relevant results for that person. A challenge for personalization lies in collectin...
Jaime Teevan, Meredith Ringel Morris, Steve Bush
122
Voted
KDD
2009
ACM
193views Data Mining» more  KDD 2009»
15 years 9 months ago
Category detection using hierarchical mean shift
Many applications in surveillance, monitoring, scientific discovery, and data cleaning require the identification of anomalies. Although many methods have been developed to iden...
Pavan Vatturi, Weng-Keen Wong