Sciweavers

367 search results - page 42 / 74
» Indexing Text Documents Based on Topic Identification
Sort
View
BIRD
2007
Springer
168views Bioinformatics» more  BIRD 2007»
15 years 3 months ago
Ontology-Based MEDLINE Document Classification
Abstract. An increasing and overwhelming amount of biomedical information is available in the research literature mainly in the form of free-text. Biologists need tools that automa...
Fabrice Camous, Stephen Blott, Alan F. Smeaton
ICDAR
2011
IEEE
13 years 11 months ago
A Handwritten Character Extraction Algorithm for Multi-language Document Image
—In this paper, we propose a novel method for extracting handwritten characters from multi-language document images, which may contain various types of characters, e.g. Chinese, ...
Yonghong Song, Guilin Xiao, Yuanlin Zhang, Lei Yan...
ECIR
2008
Springer
15 years 1 months ago
Filaments of Meaning in Word Space
Word space models, in the sense of vector space models built on distributional data taken from texts, are used to model semantic relations between words. We argue that the high dim...
Jussi Karlgren, Anders Holst, Magnus Sahlgren
SYNASC
2007
IEEE
136views Algorithms» more  SYNASC 2007»
15 years 6 months ago
Wikipedia-Based Kernels for Text Categorization
In recent years several models have been proposed for text categorization. Within this, one of the widely applied models is the vector space model (VSM), where independence betwee...
Zsolt Minier, Zalan Bodo, Lehel Csató
IVC
2007
111views more  IVC 2007»
14 years 11 months ago
Colour text segmentation in web images based on human perception
There is a significant need to extract and analyse the text in images on Web documents, for effective indexing, semantic analysis and even presentation by non-visual means (e.g....
Dimosthenis Karatzas, Apostolos Antonacopoulos