Sciweavers

3152 search results - page 201 / 631
» Retrieval of Partial Documents
Sort
View
NIPS
2000
14 years 12 months ago
The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity
We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...
David A. Cohn, Thomas Hofmann
64
Voted
JUCS
2008
107views more  JUCS 2008»
14 years 10 months ago
Informatics for Historians: Tools for Medieval Document XML Markup, and their Impact on the History-Sciences
: This article is a revised and extended version of [VBG, 07]. We conjecture that the digitalization of historical text documents as a basis of data mining and information retrieva...
Benjamin Burkard, Georg Vogeler, Stefan Gruner
WWW
2005
ACM
15 years 11 months ago
Web data extraction based on partial tree alignment
This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items...
Yanhong Zhai, Bing Liu
84
Voted
CIKM
2009
Springer
15 years 3 months ago
Space-economical partial gram indices for exact substring matching
Exact substring matching queries on large data collections can be answered using q-gram indices, that store for each occurring q-byte pattern an (ordered) posting list with the po...
Nan Tang, Lefteris Sidirourgos, Peter A. Boncz
EKAW
2008
Springer
15 years 8 days ago
Ontological Profiles in Enterprise Search
Ontology-driven search applications use ontological concepts either to index documents or to guide and understand the users. Since ontologies by nature are domain-dependent and app...
Geir Solskinnsbakk, Jon Atle Gulla