Sciweavers

84 search results - page 10 / 17
» Negative Training Data Can be Harmful to Text Classification
Sort
View
EMNLP
2009
14 years 9 months ago
Discriminative Corpus Weight Estimation for Machine Translation
Current statistical machine translation (SMT) systems are trained on sentencealigned and word-aligned parallel text collected from various sources. Translation model parameters ar...
Spyros Matsoukas, Antti-Veikko I. Rosti, Bing Zhan...
IJCAI
2003
15 years 1 months ago
Web Page Cleaning for Web Mining through Feature Weighting
Unlike conventional data or text, Web pages typically contain a large amount of information that is not part of the main contents of the pages, e.g., banner ads, navigation bars, ...
Lan Yi, Bing Liu
PAKDD
2000
ACM
128views Data Mining» more  PAKDD 2000»
15 years 3 months ago
A Comparative Study of Classification Based Personal E-mail Filtering
This paper addresses personal E-mail filtering by casting it in the framework of text classification. Modeled as semi-structured documents, Email messages consist of a set of field...
Yanlei Diao, Hongjun Lu, Dekai Wu
94
Voted
ECCV
2008
Springer
16 years 1 months ago
Learning to Localize Objects with Structured Output Regression
Sliding window classifiers are among the most successful and widely applied techniques for object localization. However, training is typically done in a way that is not specific to...
Matthew B. Blaschko, Christoph H. Lampert
ICDM
2010
IEEE
147views Data Mining» more  ICDM 2010»
14 years 9 months ago
Location and Scatter Matching for Dataset Shift in Text Mining
Dataset shift from the training data in a source domain to the data in a target domain poses a great challenge for many statistical learning methods. Most algorithms can be viewed ...
Bo Chen, Wai Lam, Ivor W. Tsang, Tak-Lam Wong