Search Sciweavers | Sciweavers

64 search results - page 8 / 13

» Estimation of English and non-English Language Use on the WW...

127

Voted

NIPS
2004

109views Information Technology» more NIPS 2004»

A Probabilistic Model for Online Document Clustering with Application to Novelty Detection

15 years 3 months ago

Download www.gatsby.ucl.ac.uk

In this paper we propose a probabilistic model for online document clustering. We use non-parametric Dirichlet process prior to model the growing number of clusters, and use a pri...

Jian Zhang 0003, Zoubin Ghahramani, Yiming Yang

claim paper

Read More »

129

click to vote

IDEAS
2008
IEEE

80views Database» more IDEAS 2008»

Improved count suffix trees for natural language data

15 years 8 months ago

Download dbis.ipd.uni-karlsruhe.de

With more and more natural language text stored in databases, handling respective query predicates becomes very important. Optimizing queries with predicates includes (sub)string ...

Guido Sautter, Cristina Abba, Klemens Böhm

claim paper

Read More »

Voted

IJDAR
2007

106views more IJDAR 2007»

Investigation and modeling of the structure of texting language

15 years 1 months ago

Download research.ihost.com

Language usage over computer mediated discourses, like chats, emails and SMS texts, significantly differs from the standard form of the language. An urge towards shorter message l...

Monojit Choudhury, Rahul Saraf, Vijit Jain, Animes...

claim paper

Read More »

109

Voted

EMNLP
2009

147views Natural Language Processing» more EMNLP 2009»

Discriminative Corpus Weight Estimation for Machine Translation

14 years 11 months ago

Download aclweb.org

Current statistical machine translation (SMT) systems are trained on sentencealigned and word-aligned parallel text collected from various sources. Translation model parameters ar...

Spyros Matsoukas, Antti-Veikko I. Rosti, Bing Zhan...

claim paper

Read More »

107

click to vote

NLPRS
2001
Springer

125views Natural Language Processing» more NLPRS 2001»

A Simple Closed-Class/Open-Class Factorization for Improved Language Modeling

15 years 6 months ago

Download www.afnlp.org

We describe a simple improvement to ngram language models where we estimate the distribution over closed-class (function) words separately from the conditional distribution of ope...

Fuchun Peng, Dale Schuurmans

claim paper

Read More »

« Prev « First page 8 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers