Sciweavers

CIKM
2003
Springer
13 years 8 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
WEBDB
2005
Springer
88views Database» more  WEBDB 2005»
13 years 9 months ago
Vague Content and Structure (VCAS) Retrieval for Document-centric XML Collections
Querying document-centric XML collections with structure conditions improves retrieval precisions. The structures of such XML collections, however, are often too complex for users...
Shaorong Liu, Wesley W. Chu, Ruzan Shahinian
WWW
2007
ACM
14 years 4 months ago
Bayesian network based sentence retrieval model
This paper makes an intensive investigation of the application of Bayesian network in sentence retrieval and introduces three Bayesian network based sentence retrieval models with...
Keke Cai, Jiajun Bu, Chun Chen, Kangmiao Liu, Wei ...
CVPR
1999
IEEE
14 years 5 months ago
The Customized-Queries Approach to CBIR Using EM
This paper makes two contributions. The first contribution is an approach called the "customized-queries" approach (CQA) to content-based image retrieval. The second is ...
Jennifer G. Dy, Carla E. Brodley, Avinash C. Kak, ...