Sciweavers

483 search results - page 20 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
TCBB
2011
14 years 10 months ago
Ensemble Learning with Active Example Selection for Imbalanced Biomedical Data Classification
—In biomedical data, the imbalanced data problem occurs frequently and causes poor prediction performance for minority classes. It is because the trained classifiers are mostly d...
Sangyoon Oh, Min Su Lee, Byoung-Tak Zhang
CIKM
2009
Springer
15 years 9 months ago
Improving web page classification by label-propagation over click graphs
In this paper, we present a semi-supervised learning method for web page classification, leveraging click logs to augment training data by propagating class labels to unlabeled si...
Soo-Min Kim, Patrick Pantel, Lei Duan, Scott Gaffn...
SIGIR
2008
ACM
15 years 2 months ago
Deep classification in large-scale text hierarchies
Most classification algorithms are best at categorizing the Web documents into a few categories, such as the top two levels in the Open Directory Project. Such a classification me...
Gui-Rong Xue, Dikan Xing, Qiang Yang, Yong Yu
ICIP
2009
IEEE
16 years 4 months ago
Parking Space Detection From Video By Augmenting Training Dataset
Auto parking techniques are attracting more attention these days. In this paper, we develop an image-based method to estimate the depth contour in parking areas. Our algorithm is ...
CORR
2011
Springer
183views Education» more  CORR 2011»
14 years 6 months ago
Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction
For large, real-world inductive learning problems, the number of training examples often must be limited due to the costs associated with procuring, preparing, and storing the tra...
Foster J. Provost, Gary M. Weiss