Sciweavers

462 search results - page 72 / 93
» Experiments with English-Persian text retrieval
Sort
View
83
Voted
CIKM
2011
Springer
13 years 9 months ago
Partial duplicate detection for large book collections
A framework is presented for discovering partial duplicates in large collections of scanned books with optical character recognition (OCR) errors. Each book in the collection is r...
Ismet Zeki Yalniz, Ethem F. Can, R. Manmatha
CICLING
2009
Springer
15 years 10 months ago
Business Specific Online Information Extraction from German Websites
This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifi...
Yeong Su Lee, Michaela Geierhos
72
Voted
ICDAR
2009
IEEE
15 years 4 months ago
Keyword Spotting in Document Images through Word Shape Coding
With large databases of document images available, a method for users to find keywords in documents will be useful. One approach is to perform Optical Character Recognition (OCR) ...
Shuyong Bai, Linlin Li, Chew Lim Tan
CIVR
2009
Springer
221views Image Analysis» more  CIVR 2009»
15 years 4 months ago
Movie segmentation into scenes and chapters using locally weighted bag of visual words
Movies segmentation into semantically correlated units is a quite tedious task due to ”semantic gap”. Low-level features do not provide useful information about the semantical...
Vasileios Chasanis, Argyris Kalogeratos, Aristidis...
ADMA
2008
Springer
240views Data Mining» more  ADMA 2008»
15 years 4 months ago
Automatic Web Tagging and Person Tagging Using Language Models
Abstract. Social bookmarking has become an important web2.0 application recently, which is concerned with the dual user behavior to search - tagging. Although social bookmarking we...
Qiaozhu Mei, Yi Zhang