Sciweavers

CIKM
2007
Springer

"More like these": growing entity classes from seeds

13 years 10 months ago
"More like these": growing entity classes from seeds
We present a corpus-based approach to the class expansion task. For a given set of seed entities we use co-occurrence statistics taken from a text collection to define a membership function that is used to rank candidate entities for inclusion in the set. We describe an evaluation framework that uses data from Wikipedia. The performance of our class extension method improves as the size of the text collection increases. Categories and Subject Descriptors H.3 [Information Storage and Retrieval]: H.3.1 Content Analysis and Indexing; H.3.3 Information Search and Retrieval; H.3.4 Systems and Software; H.4 [Information Systems Applications]: H.4.2 Types of Systems; H.4.m Miscellaneous General Terms Algorithms, Measurement, Performance Keywords Lexical acquisition, List expansion
Luís Sarmento, Valentin Jijkoun, Maarten de
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where CIKM
Authors Luís Sarmento, Valentin Jijkoun, Maarten de Rijke, Eugenio Oliveira
Comments (0)