Sciweavers

57 search results - page 1 / 12
» Content Characterization Using Word Shape Tokens
Sort
View
CICLING
2007
Springer
13 years 11 months ago
On the Impact of Lexical and Linguistic Features in Genre- and Domain-Based Categorization
Abstract. Classification in genres and domains is a major field of research for Information Retrieval (scientific and technical watch, datamining, etc.) and the selection of app...
Guillaume Cleuziou, Céline Poudat
ANLP
1994
105views more  ANLP 1994»
13 years 6 months ago
Modeling Content Identification from Document Images
A new technique to locate content-representing words for a given document image using representation of character shapes is described. A character shape code representation define...
Takehiro Nakayama
ANLP
1994
104views more  ANLP 1994»
13 years 6 months ago
Language Determination: Natural Language Processing from Scanned Document Images
Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...
Penelope Sibun, A. Lawrence Spitz
PAKDD
2009
ACM
263views Data Mining» more  PAKDD 2009»
13 years 11 months ago
Spatial Weighting for Bag-of-Visual-Words and Its Application in Content-Based Image Retrieval
It is a challenging and important task to retrieve images from a large and highly varied image data set based on their visual contents. Problems like how to fill the semantic gap b...
Xin Chen, Xiaohua Hu, Xiajiong Shen