Sciweavers

538 search results - page 42 / 108
» Mining Relevant Text from Unlabelled Documents
Sort
View
ICDM
2008
IEEE
182views Data Mining» more  ICDM 2008»
15 years 4 months ago
Multiple-Instance Regression with Structured Data
We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) op...
Kiri L. Wagstaff, Terran Lane, Alex Roper
84
Voted
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 4 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
ECIR
1998
Springer
14 years 11 months ago
Independence of Contributing Retrieval Strategies in Data Fusion for Effective Information Retrieval
: In information retrieval, data fusion is a technique for combining the outputs of more than one retrieval strategy which rank documents for retrieval. One of the observations oft...
Alan F. Smeaton
ICDM
2009
IEEE
109views Data Mining» more  ICDM 2009»
15 years 4 months ago
Knowledge Discovery from Citation Networks
—Knowledge discovery from scientific articles has received increasing attentions recently since huge repositories are made available by the development of the Internet and digit...
Zhen Guo, Zhongfei Zhang, Shenghuo Zhu, Yun Chi, Y...
AI
2000
Springer
15 years 2 months ago
Using Noun Phrase Heads to Extract Document Keyphrases
Automatically extracting keyphrases from documents is a task with many applications in information retrieval and natural language processing. Document retrieval can be biased towar...
Ken Barker, Nadia Cornacchia