Sciweavers

1319 search results - page 126 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
DGO
2011
264views Education» more  DGO 2011»
14 years 3 months ago
Developing an ontology for the U.S. patent system
The past few years have experienced an explosive growth in scientific and regulatory documents related to the patent system. Relevant information is siloed into many heterogeneous...
Siddharth Taduri, Gloria T. Lau, Kincho H. Law, Ha...
135
Voted
IR
2006
15 years 3 months ago
Table extraction for answer retrieval
The ability to find tables and extract information from them is a necessary component of many information retrieval tasks. Documents often contain tables in order to communicate d...
Xing Wei, W. Bruce Croft, Andrew McCallum
146
Voted
TDM
2004
202views Database» more  TDM 2004»
15 years 5 months ago
Combining Indexing Schemes to Accelerate Querying XML on Content and Structure
This paper presents the advantages of combining multiple document representation schemes for query processing of XML queries on content and structure. We show how extending the Te...
Georgina Ramírez, Arjen P. de Vries
126
Voted
WWW
2009
ACM
16 years 4 months ago
Tag-oriented document summarization
Social annotations on a Web document are highly generalized description of topics contained in that page. Their tagged frequency indicates the user attentions with various degrees...
Junyan Zhu, Can Wang, Xiaofei He, Jiajun Bu, Chun ...
160
Voted
SIGIR
2004
ACM
15 years 9 months ago
On scaling latent semantic indexing for large peer-to-peer systems
The exponential growth of data demands scalable infrastructures capable of indexing and searching rich content such as text, music, and images. A promising direction is to combine...
Chunqiang Tang, Sandhya Dwarkadas, Zhichen Xu