Sciweavers

112 search results - page 10 / 23
» Clustering Template Based Web Documents
Sort
View
95
Voted
ICDT
2007
ACM
143views Database» more  ICDT 2007»
15 years 3 months ago
Hierarchical Summarizing and Evaluating for Web Pages
In this investigation we propose a novel summarization method of Web pages using hierarchical expression. We discuss close relationship between summarization and hierarchical clust...
Kou Takahashi, Takao Miura, Isamu Shioya
CVPR
2011
IEEE
14 years 3 months ago
Registration of Camera Captured Documents Under Non-rigid Deformation
Document registration is a problem where the image of a template document whose layout is known is registered with a test document image. Given the registration parameters, layout...
Venkata Edupuganti, Suryaprakash Kompalli, Vinayak...
IADIS
2004
15 years 1 months ago
'surfing for knowledge' finding semantically similar Web clusters
In this paper we present our technique for finding semantically similar clusters within web documents obtained from a set of queries retrieved from the Google search engine. This ...
David Cleary, Diarmuid O'Donoghue
WWW
2008
ACM
16 years 13 days ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
COLING
2010
14 years 6 months ago
Open Entity Extraction from Web Search Query Logs
In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mi...
Alpa Jain, Marco Pennacchiotti