Sciweavers

177 search results - page 24 / 36
» An Analysis of the XSL Algorithm
Sort
View
CIKM
2007
Springer
15 years 7 months ago
"More like these": growing entity classes from seeds
We present a corpus-based approach to the class expansion task. For a given set of seed entities we use co-occurrence statistics taken from a text collection to define a membersh...
Luís Sarmento, Valentin Jijkoun, Maarten de...
CIKM
2007
Springer
15 years 7 months ago
Sigma encoded inverted files
Compression of term frequency lists and very long document-id lists within an inverted file search engine are examined. Several compression schemes are compared including Elias γ...
Andrew Trotman, Vikram Subramanya
CIKM
2007
Springer
15 years 7 months ago
Structure and semantics for expressive text kernels
Several problems in text categorization are too hard to be solved by standard bag-of-words representations. Work in kernel-based learning has approached this problem by (i) consid...
Stephan Bloehdorn, Alessandro Moschitti
CIKM
2007
Springer
15 years 7 months ago
Wikify!: linking documents to encyclopedic knowledge
This paper introduces the use of Wikipedia as a resource for automatic keyword extraction and word sense disambiguation, and shows how this online encyclopedia can be used to achi...
Rada Mihalcea, Andras Csomai
CIKM
2007
Springer
15 years 7 months ago
Developing learning strategies for topic-based summarization
Most up-to-date well-behaved topic-based summarization systems are built upon the extractive framework. They score the sentences based on the associated features by manually assig...
Ouyang You, Sujian Li, Wenjie Li