Sciweavers

241 search results - page 7 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
KDD
2009
ACM
209views Data Mining» more  KDD 2009»
15 years 10 months ago
Collective annotation of Wikipedia entities in web text
To take the first step beyond keyword-based search toward entity-based search, suitable token spans ("spots") on documents must be identified as references to real-world...
Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, ...
87
Voted
CIKM
2005
Springer
15 years 3 months ago
Predicting accuracy of extracting information from unstructured text collections
Exploiting lexical and semantic relationships in large unstructured text collections can significantly enhance managing, integrating, and querying information locked in unstructur...
Eugene Agichtein, Silviu Cucerzan
TOIS
2010
128views more  TOIS 2010»
14 years 8 months ago
Learning author-topic models from text corpora
We propose a new unsupervised learning technique for extracting information about authors and topics from large text collections. We model documents as if they were generated by a...
Michal Rosen-Zvi, Chaitanya Chemudugunta, Thomas L...
ISMAR
2009
IEEE
15 years 4 months ago
Augmenting text document by on-line learning of local arrangement of keypoints
We propose a technique for text document tracking over a large range of viewpoints. Since the popular SIFT or SURF descriptors typically fail on such documents, our method conside...
Hideaki Uchiyama, Hideo Saito
CORR
2006
Springer
100views Education» more  CORR 2006»
14 years 9 months ago
Automatic annotation of multilingual text collections with a conceptual thesaurus
Automatic annotation of documents with controlled vocabulary terms (descriptors) from a conceptual thesaurus is not only useful for document indexing and retrieval. The mapping of...
Bruno Pouliquen, Ralf Steinberger, Camelia Ignat