Sciweavers

3716 search results - page 59 / 744
» On the monotonization of the training set
Sort
View
KDD
2007
ACM
165views Data Mining» more  KDD 2007»
16 years 1 months ago
Finding low-entropy sets and trees from binary data
The discovery of subsets with special properties from binary data has been one of the key themes in pattern discovery. Pattern classes such as frequent itemsets stress the co-occu...
Eino Hinkkanen, Hannes Heikinheimo, Heikki Mannila...
LREC
2008
84views Education» more  LREC 2008»
15 years 2 months ago
Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data
This paper describes an accurate, extensible method for automatically classifying unknown foreign words that requires minimal monolingual resources and no bilingual training data ...
Kirk Baker, Chris Brew
99
Voted
NAACL
2004
15 years 2 months ago
Name Tagging with Word Clusters and Discriminative Training
We present a technique for augmenting annotated training data with hierarchical word clusters that are automatically derived from a large unannotated corpus. Cluster membership is...
Scott Miller, Jethran Guinness, Alex Zamanian
124
Voted
ICANN
2001
Springer
15 years 5 months ago
Fast Training of Support Vector Machines by Extracting Boundary Data
Support vector machines have gotten wide acceptance for their high generalization ability for real world applications. But the major drawback is slow training for classification p...
Shigeo Abe, Takuya Inoue
112
Voted
ICDM
2007
IEEE
157views Data Mining» more  ICDM 2007»
15 years 2 months ago
Training Conditional Random Fields by Periodic Step Size Adaptation for Large-Scale Text Mining
For applications with consecutive incoming training examples, on-line learning has the potential to achieve a likelihood as high as off-line learning without scanning all availabl...
Han-Shen Huang, Yu-Ming Chang, Chun-Nan Hsu