Sciweavers

2929 search results - page 127 / 586
» Models of English Text
Sort
View
SIGIR
2011
ACM
14 years 8 months ago
No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity
This work explores the problem of cross-lingual pairwise similarity, where the task is to extract similar pairs of documents across two different languages. Solutions to this pro...
Ferhan Ture, Tamer Elsayed, Jimmy J. Lin
EMNLP
2010
15 years 3 months ago
Enhancing Domain Portability of Chinese Segmentation Model Using Chi-Square Statistics and Bootstrapping
Almost all Chinese language processing tasks involve word segmentation of the language input as their first steps, thus robust and reliable segmentation techniques are always requ...
Baobao Chang, Dongxu Han
NAACL
1994
15 years 6 months ago
On Using Written Language Training Data for Spoken Language Modeling
We attemped to improve recognition accuracy by reducing the inadequacies of the lexicon and language model. Specifically we address the following three problems: (1) the best size...
Richard M. Schwartz, Long Nguyen, Francis Kubala, ...
PKDD
2007
Springer
86views Data Mining» more  PKDD 2007»
15 years 11 months ago
An Effective Approach to Enhance Centroid Classifier for Text Categorization
Centroid Classifier has been shown to be a simple and yet effective method for text categorization. However, it is often plagued with model misfit (or inductive bias) incurred by i...
Songbo Tan, Xueqi Cheng
COLING
2008
15 years 6 months ago
Exact Inference for Multi-label Classification using Sparse Graphical Models
This paper describes a parameter estimation method for multi-label classification that does not rely on approximate inference. It is known that multi-label classification involvin...
Yusuke Miyao, Jun-ichi Tsujii