Sciweavers

1319 search results - page 131 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
128
Voted
ICDT
2003
ACM
127views Database» more  ICDT 2003»
15 years 8 months ago
Incremental Validation of XML Documents
We investigate the incremental validation of XML documents with respect to DTDs and XML Schemas, under updates consisting of element tag renamings, insertions and deletions. DTDs ...
Yannis Papakonstantinou, Victor Vianu
118
Voted
SIGIR
2006
ACM
15 years 9 months ago
Load balancing for term-distributed parallel retrieval
Large-scale web and text retrieval systems deal with amounts of data that greatly exceed the capacity of any single machine. To handle the necessary data volumes and query through...
Alistair Moffat, William Webber, Justin Zobel
KDD
1998
ACM
101views Data Mining» more  KDD 1998»
15 years 7 months ago
Probabilistic Modeling for Information Retrieval with Unsupervised Training Data
We apply a well-known Bayesian probabilistic model to textual information retrieval: the classification of documents based on their relevance to a query. This model was previously...
Ernest P. Chan, Santiago Garcia, Salim Roukos
119
Voted
ACL
2009
15 years 1 months ago
Optimizing Language Model Information Retrieval System with Expectation Maximization Algorithm
Statistical language modeling (SLM) has been used in many different domains for decades and has also been applied to information retrieval (IR) recently. Documents retrieved using...
Justin Liang-Te Chiu, Jyun-Wei Huang
141
Voted
BMCBI
2007
142views more  BMCBI 2007»
15 years 3 months ago
LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomics
nd: A key abstraction in representing proteomics knowledge is the notion of unique identifiers for individual entities (e.g. proteins) and the massive graph of relationships among...
Andrew K. Smith, Kei-Hoi Cheung, Kevin Y. Yip, Mar...