Sciweavers

3371 search results - page 522 / 675
» Using parsimonious language models on web data
Sort
View
KDD
2009
ACM
211views Data Mining» more  KDD 2009»
16 years 5 months ago
Address standardization with latent semantic association
Address standardization is a very challenging task in data cleansing. To provide better customer relationship management and business intelligence for customer-oriented cooperates...
Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang,...
146
Voted
ACL
1997
15 years 6 months ago
A Model of Lexical Attraction and Repulsion
This paper introduces new methods based on exponential families for modeling the correlations between words in text and speech. While previous work assumed the effects of word co-...
Doug Beeferman, Adam L. Berger, John D. Lafferty
149
Voted
ACL
2009
15 years 2 months ago
Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty
Stochastic gradient descent (SGD) uses approximate gradients estimated from subsets of the training data and updates the parameters in an online fashion. This learning framework i...
Yoshimasa Tsuruoka, Jun-ichi Tsujii, Sophia Anania...
SIGIR
2009
ACM
15 years 11 months ago
An improved markov random field model for supporting verbose queries
Recent work in supervised learning of term-based retrieval models has shown significantly improved accuracy can often be achieved via better model estimation [2, 10, 11, 17]. In ...
Matthew Lease
167
Voted
SCP
2008
150views more  SCP 2008»
15 years 4 months ago
Google's MapReduce programming model - Revisited
Google's MapReduce programming model serves for processing large data sets in a massively parallel manner. We deliver the first rigorous description of the model including it...
Ralf Lämmel