Search Sciweavers | Sciweavers

523 search results - page 9 / 105

» Metric Learning for Text Documents

157

Voted

FLAIRS
2001

131views Artificial Intelligence» more FLAIRS 2001»

Extracting Partial Structures from HTML Documents

15 years 7 months ago

Download qir.kyushu-u.ac.jp

The new wrapper model for extractiong text data from HTML documents is introduced. The Kushmerick's wrapper class (Kusshmerick 2000) may be unsuccessful in the case that suff...

Hiroshi Sakamoto, Yoshitsugu Murakami, Hiroki Arim...

claim paper

Read More »

153

click to vote

MLDM
2007
Springer

119views Machine Learning» more MLDM 2007»

PE-PUC: A Graph Based PU-Learning Approach for Text Classification

16 years 6 days ago

Download dm.thss.tsinghua.edu.cn

This paper presents a novel solution for the problem of building text classifier using positive documents (P) and unlabeled documents (U). Here, the unlabeled documents are mixed w...

Shuang Yu, Chunping Li

claim paper

Read More »

192

click to vote

WSDM
2010
ACM

261views Data Mining» more WSDM 2010»

16 years 3 months ago

Learning Similarity Metrics for Event Identification in Social Media

Download infolab.stanford.edu

Social media sites (e.g., Flickr, YouTube, and Facebook) are a popular distribution outlet for users looking to share their experiences and interests on the Web. These sites host ...

Hila Becker, Mor Naaman, Luis Gravano

claim paper

Read More »

184

click to vote

ICML
2001
IEEE

251views Machine Learning» more ICML 2001»

Learning to Select Good Title Words: An New Approach based on Reverse Information Retrieval

16 years 6 months ago

Download www.informedia.cs.cmu.edu

In this paper, we show how we can learn to select good words for a document title. We view the problem of selecting good title words for a document as a variant of an Information ...

Rong Jin, Alexander G. Hauptmann

claim paper

Read More »

116

click to vote

IPM
2006

64views more IPM 2006»

Text mining without document context

15 years 6 months ago

Download fidelia1.free.fr

We consider a challenging clustering task: the clustering of muti-word terms without document co-occurrence information in order to form coherent groups of topics. For this task, ...

Eric SanJuan, Fidelia Ibekwe-Sanjuan

claim paper

Read More »

« Prev « First page 9 / 105 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers