Sciweavers

937 search results - page 108 / 188
» Multi-Dimensional Text Classification
Sort
View
NIPS
2001
15 years 2 months ago
Latent Dirichlet Allocation
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian m...
David M. Blei, Andrew Y. Ng, Michael I. Jordan
SODA
1998
ACM
157views Algorithms» more  SODA 1998»
15 years 2 months ago
Approximate String Matching: A Simpler Faster Algorithm
We give two algorithms for finding all approximate matches of a pattern in a text, where the edit distance between the pattern and the matching text substring is at most k. The fir...
Richard Cole, Ramesh Hariharan
IJON
2006
78views more  IJON 2006»
15 years 1 months ago
Improving self-organization of document collections by semantic mapping
In text management tasks, the dimensionality reduction becomes necessary to computation and interpretability of the results generated by machine learning algorithms. This paper de...
Renato Fernandes Corrêa, Teresa Bernarda Lud...
IPM
2006
130views more  IPM 2006»
15 years 1 months ago
Exploiting structural information for semi-structured document categorization
This paper examines several different approaches to exploiting structural information in semi-structured document categorization. The methods under consideration are designed for ...
Andrej Bratko, Bogdan Filipic
SIGIR
2008
ACM
15 years 1 months ago
On document splitting in passage detection
Passages can be hidden within a text to circumvent their disallowed transfer. Such release of compartmentalized information is of concern to all corporate and governmental organiz...
Nazli Goharian, Saket S. R. Mengle