Sciweavers

821 search results - page 13 / 165
» Retrieval from Document Image Collections
Sort
View
CIKM
2003
Springer
15 years 4 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
ICDAR
2009
IEEE
15 years 6 months ago
Enhanced Text Extraction from Arabic Degraded Document Images Using EM Algorithm
This paper presents a new enhanced text extraction algorithm from degraded document images on the basis of the probabilistic models. The observed document image is considered as a...
Wafa Boussellaa, Aymen Bougacha, Abderrazak Zahour...
133
Voted
ICDAR
2011
IEEE
13 years 11 months ago
Towards Searchable Digital Urdu Libraries - A Word Spotting Based Retrieval Approach
—Libraries in South Asia hold huge collections of valuable printed documents in Urdu and it is of interest to digitize these collections to make them more accessible. The unavail...
Ali Abidi, Imran Siddiqi, Khurram Khurshid
ERCIMDL
2006
Springer
158views Education» more  ERCIMDL 2006»
15 years 3 months ago
A Content-Based Image Retrieval Service for Archaeology Collections
Archeological sites have heterogeneous information ranging from different artifacts, image data, geo-spatial information, chronological data, and other relevant metadata. ETANA-DL,...
Naga Srinivas Vemuri, Ricardo da Silva Torres, Rao...
IPM
2010
174views more  IPM 2010»
14 years 9 months ago
Managing structured queries in probabilistic XML retrieval systems
Focusing on the context of XML retrieval, in this paper we propose a general methodology for managing structured queries (involving both content and structure) within any given st...
Luis M. de Campos, Juan M. Fernández-Luna, ...