Sciweavers

587 search results - page 58 / 118
» New Algorithms for Text Fingerprinting
Sort
View
IDEAS
2008
IEEE
80views Database» more  IDEAS 2008»
15 years 4 months ago
Improved count suffix trees for natural language data
With more and more natural language text stored in databases, handling respective query predicates becomes very important. Optimizing queries with predicates includes (sub)string ...
Guido Sautter, Cristina Abba, Klemens Böhm
WWW
2008
ACM
15 years 10 months ago
Automatic web image selection with a probabilistic latent topic model
We propose a new method to select relevant images to the given keywords from images gathered from the Web based on the Probabilistic Latent Semantic Analysis (PLSA) model which is...
Keiji Yanai
WWW
2007
ACM
15 years 10 months ago
Image collector III: a web image-gathering system with bag-of-keypoints
We propose a new system to mine visual knowledge on the Web. There are huge image data as well as text data on the Web. However, mining image data from the Web is paid less attent...
Keiji Yanai
ICMLA
2008
14 years 11 months ago
Graph-Based Multilevel Dimensionality Reduction with Applications to Eigenfaces and Latent Semantic Indexing
Dimension reduction techniques have been successfully applied to face recognition and text information retrieval. The process can be time-consuming when the data set is large. Thi...
Sophia Sakellaridi, Haw-ren Fang, Yousef Saad
ICMLA
2007
14 years 11 months ago
Memory-based context-sensitive spelling correction at web scale
We study the problem of correcting spelling mistakes in text using memory-based learning techniques and a very large database of token n-gram occurrences in web text as training d...
Andrew Carlson, Ian Fette