Sciweavers

142 search results - page 18 / 29
» Contemporaneous text as side-information in statistical lang...
Sort
View
AUSDM
2006
Springer
112views Data Mining» more  AUSDM 2006»
15 years 5 months ago
The Scamseek Project - Text Mining for Financial Scams on the Internet
The Scamseek project, as commissioned by ASIC has the principal objective of building an industrially viable system that retrieves potential scam candidate documents from the Inte...
Jon Patrick
ICML
2006
IEEE
16 years 2 months ago
Topic modeling: beyond bag-of-words
Some models of textual corpora employ text generation methods involving n-gram statistics, while others use latent topic variables inferred using the "bag-of-words" assu...
Hanna M. Wallach
EACL
2003
ACL Anthology
15 years 3 months ago
A Comparison of Event Models for Naive Bayes Anti-Spam E-Mail Filtering
We describe experiments with a Naive Bayes text classifier in the context of anti-spam E-mail filtering, using two different statistical event models: a multi-variate Bernoulli ...
Karl-Michael Schneider
DELOS
2001
15 years 3 months ago
Relevance Feedback for Best Match Term Weighting Algorithms in Information Retrieval
Personalisation in full text retrieval or full text filtering implies reweighting of the query terms based on some explicit or implicit feedback from the user. Relevance feedback i...
Djoerd Hiemstra, Stephen E. Robertson
ACL
1997
15 years 3 months ago
A Model of Lexical Attraction and Repulsion
This paper introduces new methods based on exponential families for modeling the correlations between words in text and speech. While previous work assumed the effects of word co-...
Doug Beeferman, Adam L. Berger, John D. Lafferty