Sciweavers

1071 search results - page 90 / 215
» A kernel-based approach to document retrieval
Sort
View
CIKM
2008
Springer
14 years 12 months ago
Winnowing-based text clustering
We present an approach to document clustering based on winnowing fingerprints that achieved good values of effectiveness with considerable save in memory space and computation tim...
Javier Parapar, Alvaro Barreiro
CLEF
2005
Springer
15 years 3 months ago
Dublin City University at CLEF 2005: Experiments with the ImageCLEF St Andrew's Collection
The aim of the Dublin City University’s participation in the CLEF 2005 ImageCLEF St Andrew’s Collection task was to explore an alternative approach to exploiting text annotatio...
Gareth J. F. Jones, Kieran McDonald
ICDAR
2009
IEEE
15 years 4 months ago
Keyword Spotting in Document Images through Word Shape Coding
With large databases of document images available, a method for users to find keywords in documents will be useful. One approach is to perform Optical Character Recognition (OCR) ...
Shuyong Bai, Linlin Li, Chew Lim Tan
DIAL
2004
IEEE
156views Image Analysis» more  DIAL 2004»
15 years 1 months ago
Xed: A New Tool for eXtracting Hidden Structures from Electronic Documents
PDF became a very common format for exchanging printable documents. Further, it can be easily generated from the major documents formats, which make a huge number of PDF documents...
Karim Hadjar, Maurizio Rigamonti, Denis Lalanne, R...
AAAI
1997
14 years 11 months ago
Template-Based Information Mining from HTML Documents
Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...
Jane Yung-jen Hsu, Wen-tau Yih