Sciweavers

583 search results - page 3 / 117
» Automatic extraction of titles from general documents using ...
Sort
View
CIKM
2010
Springer
13 years 3 months ago
Clickthrough-based translation models for web search: from word models to phrase models
Web search is challenging partly due to the fact that search queries and Web documents use different language styles and vocabularies. This paper provides a quantitative analysis ...
Jianfeng Gao, Xiaodong He, Jian-Yun Nie
CICLING
2005
Springer
13 years 10 months ago
A Machine Learning Approach to Information Extraction
Information extraction is concerned with applying natural language processing to automatically extract the essential details from text documents. A great disadvantage of current ap...
Alberto Téllez-Valero, Manuel Montes-y-G&oa...
IRCDL
2008
13 years 6 months ago
Using MPEG-7 for Automatic Annotation of Audiovisual Content in eLearning Digital Libraries
In this paper we present a prototype system to enrich audiovisual contents with annotations, which exploits existing technologies for automatic extraction of metadata (such as OCR...
Giuseppe Amato, Paolo Bolettieri, Franca Debole, F...
CIKM
2009
Springer
13 years 11 months ago
The impact of document structure on keyphrase extraction
Keyphrases are short phrases that reflect the main topic of a document. Because manually annotating documents with keyphrases is a time-consuming process, several automatic appro...
Katja Hofmann, Manos Tsagkias, Edgar Meij, Maarten...
ICCV
2005
IEEE
13 years 10 months ago
Learning Non-Generative Grammatical Models for Document Analysis
— We present a general approach for the hierarchical segmentation and labeling of document layout structures. This approach models document layout as a grammar and performs a glo...
Michael Shilman, Percy Liang, Paul A. Viola