Sciweavers

3152 search results - page 283 / 631
» Retrieval of Partial Documents
Sort
View
WWW
2006
ACM
16 years 5 months ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner
WWW
2005
ACM
16 years 5 months ago
Automatically learning document taxonomies for hierarchical classification
While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...
Kunal Punera, Suju Rajan, Joydeep Ghosh
WACV
2007
IEEE
15 years 11 months ago
Warped Document Image Restoration Using Shape-from-Shading and Physically-Based Modeling
With the pervasive use of handheld digital devices such as camera phones and PDAs, people have started to capture images as a way of recording information. However, due to the non...
Li Zhang, Chew Lim Tan
CIKM
2004
Springer
15 years 10 months ago
Document clustering based on cluster validation
This paper presents a cluster validation based document clustering algorithm, which is capable of identifying both important feature words and true model order (cluster number). I...
Zheng-Yu Niu, Dong-Hong Ji, Chew Lim Tan
IRAL
2003
ACM
15 years 10 months ago
Extraction of user preferences from a few positive documents
In this work, we propose a new method for extracting user preferences from a few documents that might interest users. For this end, we first extract candidate terms and choose a n...
Byeong Man Kim, Qing Li, Jong-Wan Kim