Sciweavers

161 search results - page 13 / 33
» Improving Similarity Measures for Short Segments of Text
Sort
View
IJCAI
2003
14 years 10 months ago
Employing Trainable String Similarity Metrics for Information Integration
The problem of identifying approximately duplicate objects in databases is an essential step for the information integration process. Most existing approaches have relied on gener...
Mikhail Bilenko, Raymond J. Mooney
TOIS
2008
145views more  TOIS 2008»
14 years 9 months ago
Classification-aware hidden-web text database selection
Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over multip...
Panagiotis G. Ipeirotis, Luis Gravano
69
Voted
AIMSA
2008
Springer
15 years 3 months ago
Thematic Segment Retrieval Revisited
Documents, especially long ones, may contain very diverse passages related to different topics. Passages Retrieval approaches have shown that, in most cases, there is a great pote...
Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...
KCAP
2005
ACM
15 years 3 months ago
Extracting knowledge from evaluative text
Capturing knowledge from free-form evaluative texts about an entity is a challenging task. New techniques of feature extraction, polarity determination and strength evaluation hav...
Giuseppe Carenini, Raymond T. Ng, Ed Zwart
WWW
2009
ACM
15 years 10 months ago
A class-feature-centroid classifier for text categorization
Automated text categorization is an important technique for many web applications, such as document indexing, document filtering, and cataloging web resources. Many different appr...
Hu Guan, Jingyu Zhou, Minyi Guo