Sciweavers

483 search results - page 46 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
BMCBI
2006
165views more  BMCBI 2006»
15 years 3 months ago
A stable gene selection in microarray data analysis
Background: Microarray data analysis is notorious for involving a huge number of genes compared to a relatively small number of samples. Gene selection is to detect the most signi...
Kun Yang, Zhipeng Cai, Jianzhong Li, Guohui Lin
ICDM
2009
IEEE
151views Data Mining» more  ICDM 2009»
15 years 28 days ago
TagLearner: A P2P Classifier Learning System from Collaboratively Tagged Text Documents
The amount of text data on the Internet is growing at a very fast rate. Online text repositories for news agencies, digital libraries and other organizations currently store gigaan...
Haimonti Dutta, Xianshu Zhu, Tushar Mahule, Hillol...
WWW
2006
ACM
16 years 3 months ago
Large-scale text categorization by batch mode active learning
Large-scale text categorization is an important research topic for Web data mining. One of the challenges in large-scale text categorization is how to reduce the amount of human e...
Steven C. H. Hoi, Rong Jin, Michael R. Lyu
CICLING
2004
Springer
15 years 8 months ago
Automatic Learning Features Using Bootstrapping for Text Categorization
When text categorization is applied to complex tasks, it is tedious and expensive to hand-label the large amounts of training data necessary for good performance. In this paper, we...
Wenliang Chen, Jingbo Zhu, Honglin Wu, Tianshun Ya...
WWW
2007
ACM
16 years 3 months ago
Extraction and classification of dense communities in the web
The World Wide Web (WWW) is rapidly becoming important for society as a medium for sharing data, information and services, and there is a growing interest in tools for understandi...
Yon Dourisboure, Filippo Geraci, Marco Pellegrini