Sciweavers

1319 search results - page 87 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
146
Voted
CLEF
2004
Springer
15 years 7 months ago
UB at CLEF2004: Cross Language Information Retrieval Using Statistical Language Models
This paper presents the results of the State University of New York at Buffalo (UB) in the Mono-lingual and Multi-lingual tasks at CLEF 2004. For these tasks we used an approach ba...
Miguel E. Ruiz, Munirathnam Srikanth
JCDL
2003
ACM
160views Education» more  JCDL 2003»
15 years 8 months ago
Automatic Document Metadata Extraction Using Support Vector Machines
Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadat...
Hui Han, C. Lee Giles, Eren Manavoglu, Hongyuan Zh...
JCDL
2006
ACM
237views Education» more  JCDL 2006»
15 years 9 months ago
Automatic extraction of table metadata from digital documents
Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and high...
Ying Liu, Prasenjit Mitra, C. Lee Giles, Kun Bai
135
Voted
TREC
2003
15 years 4 months ago
Improving the Robustness of Language Models - UIUC TREC 2003 Robust and Genomics Experiments
In this paper, we report our experiments in the TREC 2003 Genomics Track and the Robust Track. A common theme that we explored is the robustness of a basic language modeling retri...
ChengXiang Zhai, Tao Tao, Hui Fang, Zhidi Shang
134
Voted
HT
2005
ACM
15 years 9 months ago
As we may perceive: inferring logical documents from hypertext
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov