Sciweavers

36 search results - page 2 / 8
» Painless Labeling with Application to Text Mining
Sort
View
SIGMOD
2008
ACM
119views Database» more  SIGMOD 2008»
14 years 5 months ago
Webpage understanding: beyond page-level search
In this paper we introduce the webpage understanding problem which consists of three subtasks: webpage segmentation, webpage structure labeling, and webpage text segmentation and ...
Zaiqing Nie, Ji-Rong Wen, Wei-Ying Ma
KDD
2004
ACM
114views Data Mining» more  KDD 2004»
14 years 5 months ago
Mining reference tables for automatic text segmentation
Automatically segmenting unstructured text strings into structured records is necessary for importing the information contained in legacy sources and text collections into a data ...
Eugene Agichtein, Venkatesh Ganti
ICDM
2003
IEEE
126views Data Mining» more  ICDM 2003»
13 years 10 months ago
Mining Relevant Text from Unlabelled Documents
Automatic classification of documents is an important area of research with many applications in the fields of document searching, forensics and others. Methods to perform class...
Daniel Barbará, Carlotta Domeniconi, Ning K...
SAC
2004
ACM
13 years 10 months ago
An optimized approach for KNN text categorization using P-trees
The importance of text mining stems from the availability of huge volumes of text databases holding a wealth of valuable information that needs to be mined. Text categorization is...
Imad Rahal, William Perrizo
DASFAA
2004
IEEE
135views Database» more  DASFAA 2004»
13 years 9 months ago
Semi-supervised Text Classification Using Partitioned EM
Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...
Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu