Sciweavers

735 search results - page 34 / 147
» Corpora and data preparation
Sort
View
NIPS
2004
14 years 11 months ago
Sharing Clusters among Related Groups: Hierarchical Dirichlet Processes
We propose the hierarchical Dirichlet process (HDP), a nonparametric Bayesian model for clustering problems involving multiple groups of data. Each group of data is modeled with a...
Yee Whye Teh, Michael I. Jordan, Matthew J. Beal, ...
CORR
2010
Springer
116views Education» more  CORR 2010»
14 years 10 months ago
LiquidXML: Adaptive XML Content Redistribution
We propose to demonstrate LiquidXML, a platform for managing large corpora of XML documents in large-scale P2P networks. All LiquidXML peers may publish XML documents to be shared...
Jesús Camacho-Rodríguez, Asterios Ka...
COLING
2002
14 years 9 months ago
The Computation of Word Associations: Comparing Syntagmatic and Paradigmatic Approaches
It is shown that basic language processes such as the production of free word associations and the generation of synonyms can be simulated using statistical models that analyze th...
Reinhard Rapp
ACL
2009
14 years 7 months ago
Co-Training for Cross-Lingual Sentiment Classification
The lack of Chinese sentiment corpora limits the research progress on Chinese sentiment classification. However, there are many freely available English sentiment corpora on the W...
Xiaojun Wan
IMCSIT
2010
14 years 7 months ago
Parallel, Massive Processing in SuperMatrix - a General Tool for Distributional Semantic Analysis of Corpus
The paper presents an extended version of the SuperMatrix system -- a general tool supporting automatic acquisition of lexical semantic relations from corpora. Extensions focus mai...
Bartosz Broda, Damian Jaworski, Maciej Piasecki