Sciweavers

1319 search results - page 49 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
16 years 3 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
CIKM
2005
Springer
15 years 8 months ago
Structural features in content oriented XML retrieval
The structural features of XML components are an extra source of information that should be used in a contentoriented retrieval task on this type of documents. This paper explores...
Georgina Ramírez, Thijs Westerveld, Arjen P...
SPIRE
2005
Springer
15 years 8 months ago
Retrieval Status Values in Information Retrieval Evaluation
Retrieval systems rank documents according to their retrieval status values (RSV) if these are monotonously increasing with the probability of relevance of documents. In this work,...
Amélie Imafouo, Xavier Tannier
SIGIR
2008
ACM
15 years 3 months ago
Detecting synonyms in social tagging systems to improve content retrieval
Collaborative tagging used in online social content systems is naturally characterized by many synonyms, causing low precision retrieval. We propose a mechanism based on user pref...
Maarten Clements, Arjen P. de Vries, Marcel J. T. ...
KES
2010
Springer
15 years 1 months ago
DOCODE-Lite: A Meta-Search Engine for Document Similarity Retrieval
The retrieval of similar documents from large scale datasets has been the one of the main concerns in knowledge management environments, such as plagiarism detection, news impact a...
Felipe Bravo-Marquez, Gaston L'Huillier, Sebasti&a...