Sciweavers

86 search results - page 1 / 18
» Type Classification of Semi-Structured Documents
Sort
View
VLDB
1995
ACM
112views Database» more  VLDB 1995»
13 years 7 months ago
Type Classification of Semi-Structured Documents
Semi-structured documents (e.g. journal art,icles, electronic mail, television programs, mail order catalogs, ...) a.re often not explicitly typed; the only available t,ype inform...
Markus Tresch, Neal Palmer, Allen Luniewski
CEAS
2006
Springer
13 years 8 months ago
An Adaptive, Semi-Structured Language Model Approach to Spam Filtering on a New Corpus
Motivated by current efforts to construct more realistic spam filtering experimental corpora, we present a newly assembled, publicly available corpus of genuine and unsolicited (s...
Ben Medlock
WEBI
2004
Springer
13 years 9 months ago
Semi-Structured Complex List Extraction
The semi-structured information available in HTML and similar documents provide valuable information that can be used for information extraction applications. This information tog...
Anders Arpteg
CICLING
2010
Springer
12 years 11 months ago
An Empirical Study on the Feature's Type Effect on the Automatic Classification of Arabic Documents
The Arabic language is a highly flexional and morphologically very rich language. It presents serious challenges to the automatic classification of documents, one of which is deter...
Saeed Raheel, Joseph Dichy
ICDAR
2009
IEEE
13 years 2 months ago
Image Classification to Improve Printing Quality of Mixed-Type Documents
Functional image classification is the assignment of different image types to separate classes to optimize their rendering for reading or other specific end task, and is an import...
Rafael Dueire Lins, Gabriel Pereira e Silva, Steve...