Search Sciweavers | Sciweavers

507 search results - page 59 / 102

» Using Text Mining and Link Analysis for Software Mining

138

Voted

KDD
2004
ACM

158views Data Mining» more KDD 2004»

A generalized maximum entropy approach to bregman co-clustering and matrix approximation

16 years 3 months ago

Download www.ideal.ece.utexas.edu

Co-clustering is a powerful data mining technique with varied applications such as text clustering, microarray analysis and recommender systems. Recently, an informationtheoretic ...

Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...

claim paper

Read More »

149

click to vote

MSV
2004

162views Modeling And Simulation» more MSV 2004»

MABAC - Matrix Based Clustering Algorithm

15 years 4 months ago

Download sosa.ucsd.edu

Clustering is a prominent method in the data mining field. It is a discovery process that groups data such that intra cluster similarity is maximized and the inter cluster similar...

Yonghui Chen, Alan P. Sprague, Kevin D. Reilly

claim paper

Read More »

122

Voted

SIGIR
2009
ACM

146views Information Technology» more SIGIR 2009»

Identifying the original contribution of a document via language modeling

15 years 10 months ago

Download www.cs.cornell.edu

Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this eﬀect, we propose a statistica...

Benyah Shaparenko, Thorsten Joachims

claim paper

Read More »

113

Voted

WWW
2006
ACM

139views Internet Technology» more WWW 2006»

Do not crawl in the DUST: different URLs with similar text

15 years 9 months ago

Download www2007.org

We consider the problem of dust: Diﬀerent URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...

Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar

claim paper

Read More »

136

Voted

ADVIS
2004
Springer

145views Information Technology» more ADVIS 2004»

Multiple Sets of Rules for Text Categorization

15 years 8 months ago

Download www.infm.ulst.ac.uk

An important issue in text mining is how to make use of multiple pieces knowledge discovered to improve future decisions. In this paper, we propose a new approach to combining mult...

Yaxin Bi, Terry J. Anderson, Sally I. McClean

claim paper

Read More »

« Prev « First page 59 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers