Sciweavers

583 search results - page 17 / 117
» Automatic extraction of titles from general documents using ...
Sort
View
EMNLP
2007
14 years 11 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder
CIKM
2007
Springer
15 years 3 months ago
The role of documents vs. queries in extracting class attributes from text
Challenging the implicit reliance on document collections, this paper discusses the pros and cons of using query logs rather than document collections, as self-contained sources o...
Marius Pasca, Benjamin Van Durme, Nikesh Garera
AUSDM
2008
Springer
367views Data Mining» more  AUSDM 2008»
14 years 11 months ago
Categorical Proportional Difference: A Feature Selection Method for Text Categorization
Supervised text categorization is a machine learning task where a predefined category label is automatically assigned to a previously unlabelled document based upon characteristic...
Mondelle Simeon, Robert J. Hilderman
CORR
2002
Springer
142views Education» more  CORR 2002»
14 years 9 months ago
Learning Algorithms for Keyphrase Extraction
Many academic journals ask their authors to provide a list of about five to fifteen keywords, to appear on the first page of each article. Since these key words are often phrases ...
Peter D. Turney
CIKM
2008
Springer
14 years 11 months ago
Identifying table boundaries in digital documents via sparse line detection
Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...
Ying Liu, Prasenjit Mitra, C. Lee Giles