Sciweavers

87 search results - page 2 / 18
» Document zone content classification and its performance eva...
Sort
View
AIRS
2006
Springer
13 years 8 months ago
Learning to Separate Text Content and Style for Classification
Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...
Dell Zhang, Wee Sun Lee
CIVR
2008
Springer
220views Image Analysis» more  CIVR 2008»
13 years 6 months ago
Web-based information content and its application to concept-based video retrieval
Semantic similarity between words or phrases is frequently used to find matching correlations between search queries and documents when straightforward matching of terms fails. Th...
Alexander Haubold, Apostol Natsev
ICONIP
2007
13 years 6 months ago
Classification of Documents Based on the Structure of Their DOM Trees
In this paper, we discuss kernels that can be applied for the classification of XML documents based on their DOM trees. DOM trees are ordered trees in which every node might be la...
Peter Geibel, Olga Pustylnikov, Alexander Mehler, ...
RIAO
2000
13 years 6 months ago
Language sensitive text classification
It is a traditional belief that in order to scale-up to more effective retrieval and access methods modern Information Retrieval has to consider more the text content. The modalit...
Roberto Basili, Alessandro Moschitti, Maria Teresa...
SIGIR
2005
ACM
13 years 10 months ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...