Sciweavers

6258 search results - page 143 / 1252
» Applied Text Generation
Sort
View
ICPR
2002
IEEE
15 years 11 months ago
Word Segmentation of Printed Text Lines Based on Gap Clustering and Special Symbol Detection
This paper proposes a word segmentation method for machine-printed text lines. It utilizes gaps and special symbols as delimiters between words. A gap clustering technique is used...
Soo-Hyung Kim, Chang Bu Jeong, Hee K. Kwag, Ching ...
ICML
1997
IEEE
15 years 11 months ago
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
The Rocchio relevance feedback algorithm is one of the most popular and widely applied learning methods from information retrieval. Here, a probabilistic analysis of this algorith...
Thorsten Joachims
EUROPAR
2007
Springer
15 years 4 months ago
Parallel Nearest Neighbour Algorithms for Text Categorization
In this paper we describe the parallelization of two nearest neighbour classification algorithms. Nearest neighbour methods are well-known machine learning techniques. They have be...
Reynaldo Gil-García, José Manuel Bad...
VLDB
2007
ACM
141views Database» more  VLDB 2007»
15 years 4 months ago
BlogScope: A System for Online Analysis of High Volume Text Streams
We present BlogScope (www.blogscope.net), a system for online analysis of temporally ordered streaming text, currently applied to the analysis of the Blogosphere1 . The system cur...
Nilesh Bansal, Nick Koudas
ACSC
2002
IEEE
15 years 3 months ago
Enhanced Word-Based Block-Sorting Text Compression
The Block Sorting process of Burrows and Wheeler can be applied to any sequence in which symbols are (or might be) conditioned upon each other. In particular, it is possible to pa...
R. Yugo Kartono Isal, Alistair Moffat, A. C. H. Ng...