Search Sciweavers | Sciweavers

36 search results - page 2 / 8

» Painless Labeling with Application to Text Mining

click to vote

SIGMOD
2008
ACM

119views Database» more SIGMOD 2008»

Webpage understanding: beyond page-level search

14 years 5 months ago

Download research.microsoft.com

In this paper we introduce the webpage understanding problem which consists of three subtasks: webpage segmentation, webpage structure labeling, and webpage text segmentation and ...

Zaiqing Nie, Ji-Rong Wen, Wei-Ying Ma

claim paper

Read More »

click to vote

KDD
2004
ACM

114views Data Mining» more KDD 2004»

Mining reference tables for automatic text segmentation

14 years 5 months ago

Download www.mathcs.emory.edu

Automatically segmenting unstructured text strings into structured records is necessary for importing the information contained in legacy sources and text collections into a data ...

Eugene Agichtein, Venkatesh Ganti

claim paper

Read More »

click to vote

ICDM
2003
IEEE

126views Data Mining» more ICDM 2003»

Mining Relevant Text from Unlabelled Documents

13 years 10 months ago

Download cs.gmu.edu

Automatic classiﬁcation of documents is an important area of research with many applications in the ﬁelds of document searching, forensics and others. Methods to perform class...

Daniel Barbará, Carlotta Domeniconi, Ning K...

claim paper

Read More »

click to vote

SAC
2004
ACM

111views Applied Computing» more SAC 2004»

An optimized approach for KNN text categorization using P-trees

13 years 10 months ago

Download www.cis.uab.edu

The importance of text mining stems from the availability of huge volumes of text databases holding a wealth of valuable information that needs to be mined. Text categorization is...

Imad Rahal, William Perrizo

claim paper

Read More »

click to vote

DASFAA
2004
IEEE

135views Database» more DASFAA 2004»

Semi-supervised Text Classification Using Partitioned EM

13 years 9 months ago

Download www.cs.uic.edu

Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling traini...

Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu

claim paper

Read More »

« Prev « First page 2 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers