Sciweavers

3693 search results - page 63 / 739
» Network Processing of Documents, for Documents, by Documents
Sort
View
CORR
2006
Springer
71views Education» more  CORR 2006»
14 years 11 months ago
Using NLP to build the hypertextuel network of a back-of-the-book index
Relying on the idea that back-of-the-book indexes are traditional devices for navigation through large documents, we have developed a method to build a hypertextual network that h...
Touria Aït El Mekki, Adeline Nazarenko
WWW
2011
ACM
14 years 6 months ago
Inverted index compression via online document routing
Modern search engines are expected to make documents searchable shortly after they appear on the ever changing Web. To satisfy this requirement, the Web is frequently crawled. Due...
Gal Lavee, Ronny Lempel, Edo Liberty, Oren Somekh
ICDAR
1997
IEEE
15 years 3 months ago
Local Skew Angle Estimation from Background Space in Text Regions
Almost all document analysis approaches need to perform a global analysis of the page orientation as a separate process at an early stage. It would be preferable to estimate the o...
Apostolos Antonacopoulos
PVLDB
2008
85views more  PVLDB 2008»
14 years 10 months ago
Scalable ad-hoc entity extraction from text collections
Supporting entity extraction from large document collections is important for enabling a variety of important data analysis tasks. In this paper, we introduce the "ad-hoc&quo...
Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaud...
WWW
2010
ACM
15 years 6 months ago
LCA-based selection for XML document collections
In this paper, we address the problem of database selection for XML document collections, that is, given a set of collections and a user query, how to rank the collections based o...
Georgia Koloniari, Evaggelia Pitoura