Sciweavers

2827 search results - page 25 / 566
» Marking Text Documents
Sort
View
67
Voted
PR
2002
129views more  PR 2002»
14 years 9 months ago
Text extraction in complex color documents
Text extraction in mixed-type documents is a pre-processing and necessary stage for many document applications. In mixed-type color documents, text, drawings and graphics appear w...
Charalambos Strouthopoulos, Nikos Papamarkos, Anto...
ICML
2002
IEEE
15 years 10 months ago
Partially Supervised Classification of Text Documents
We investigate the following problem: Given a set of documents of a particular topic or class ?, and a large set ? of mixed documents that contains documents from class ? and othe...
Bing Liu, Wee Sun Lee, Philip S. Yu, Xiaoli Li
CIS
2005
Springer
15 years 3 months ago
Concept Chain Based Text Clustering
Different from familiar clustering objects, text documents have sparse data spaces. A common way of representing a document is as a bag of its component words, but the semantic re...
Shaoxu Song, Jian Zhang, Chunping Li
79
Voted
ICA
2007
Springer
15 years 1 months ago
Text Clustering on Latent Thematic Spaces: Variants, Strengths and Weaknesses
Deriving a thematically meaningful partition of an unlabeled document corpus is a challenging task. In this context, the use of document representations based on latent thematic ge...
Xavier Sevillano, Germán Cobo, Francesc Al&...
SEKE
2007
Springer
15 years 3 months ago
Software Documents: Comparison and Measurement
Tom Arbuckle, Adam Balaban, Dennis K. Peters, Mark...