Sciweavers

850 search results - page 21 / 170
» Representing Text Chunks
Sort
View
ICNC
2005
Springer
15 years 3 months ago
Using SOFM to Improve Web Site Text Content
We introduce a new method to improve web site text content by identifying the most relevant free text in the web pages. In order to understand the variations in web page text, we c...
Sebastián A. Ríos, Juan D. Vel&aacut...
LREC
2010
162views Education» more  LREC 2010»
14 years 11 months ago
Text Cluster Trimming for Better Descriptions and Improved Quality
Text clustering is potentially very useful for exploration of text sets that are too large to study manually. The success of such a tool depends on whether the results can be expl...
Magnus Rosell
ICIP
1999
IEEE
15 years 11 months ago
Digipaper: A Versatile Color Document Image Representation
We describe a segmentation method and associated file format for storing images of color documents. We separate each page of the document into three layers, containing the backgro...
Daniel P. Huttenlocher, Pedro F. Felzenszwalb, Wil...
ECAI
2008
Springer
14 years 11 months ago
Author Identification Using a Tensor Space Representation
Author identification is a text categorization task with applications in intelligence, criminal law, computer forensics, etc. Usually, in such cases there is shortage of training t...
Spyridon Plakias, Efstathios Stamatatos
ERCIMDL
1997
Springer
130views Education» more  ERCIMDL 1997»
15 years 1 months ago
Modelling the Retrieval of Structured Documents Containing Texts and Images
Abstract. We present a model for complex documents possibly consisting of a hierarchically structured set of images or texts. Documents are represented both at the form level (as s...
Carlo Meghini, Fabrizio Sebastiani, Umberto Stracc...