Sciweavers

3152 search results - page 201 / 631
» Retrieval of Partial Documents
Sort
View
NIPS
2000
15 years 6 months ago
The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity
We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...
David A. Cohn, Thomas Hofmann
114
Voted
JUCS
2008
107views more  JUCS 2008»
15 years 5 months ago
Informatics for Historians: Tools for Medieval Document XML Markup, and their Impact on the History-Sciences
: This article is a revised and extended version of [VBG, 07]. We conjecture that the digitalization of historical text documents as a basis of data mining and information retrieva...
Benjamin Burkard, Georg Vogeler, Stefan Gruner
WWW
2005
ACM
16 years 5 months ago
Web data extraction based on partial tree alignment
This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items...
Yanhong Zhai, Bing Liu
144
Voted
CIKM
2009
Springer
15 years 9 months ago
Space-economical partial gram indices for exact substring matching
Exact substring matching queries on large data collections can be answered using q-gram indices, that store for each occurring q-byte pattern an (ordered) posting list with the po...
Nan Tang, Lefteris Sidirourgos, Peter A. Boncz
143
Voted
EKAW
2008
Springer
15 years 6 months ago
Ontological Profiles in Enterprise Search
Ontology-driven search applications use ontological concepts either to index documents or to guide and understand the users. Since ontologies by nature are domain-dependent and app...
Geir Solskinnsbakk, Jon Atle Gulla