Sciweavers

1319 search results - page 145 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
142
Voted
PDIS
1996
IEEE
15 years 7 months ago
Querying the World Wide Web
The World Wide Web is a large, heterogeneous, distributedcollectionof documents connected by hypertext links. The most common technologycurrently used for searching the Web depend...
Alberto O. Mendelzon, George A. Mihaila, Tova Milo
228
Voted
SIGMOD
2008
ACM
122views Database» more  SIGMOD 2008»
16 years 3 months ago
Building query optimizers for information extraction: the SQoUT project
Text documents often embed data that is structured in nature. This structured data is increasingly exposed using information extraction systems, which generate structured relation...
Alpa Jain, Panagiotis G. Ipeirotis, Luis Gravano
153
Voted
CIKM
2010
Springer
15 years 1 months ago
Fast dimension reduction for document classification based on imprecise spectrum analysis
This paper proposes an algorithm called Imprecise Spectrum Analysis (ISA) to carry out fast dimension reduction for document classification. ISA is designed based on the one-sided...
Hu Guan, Bin Xiao, Jingyu Zhou, Minyi Guo, Tao Yan...
143
Voted
EACL
2006
ACL Anthology
15 years 5 months ago
Computing Term Translation Probabilities with Generalized Latent Semantic Analysis
Term translation probabilities proved an effective method of semantic smoothing in the language modelling approach to information retrieval. We use Generalized Latent Semantic Ana...
Irina Matveeva, Gina-Anne Levow
130
Voted
EVOW
2008
Springer
15 years 5 months ago
Improving Query Expansion with Stemming Terms: A New Genetic Algorithm Approach
Nowadays, searching information in the web or in any kind of document collection has become one of the most frequent activities. However, user queries can be formulated in a way th...
Lourdes Araujo, José R. Pérez-Ag&uum...