Search Sciweavers | Sciweavers

385 search results - page 27 / 77

» Improving data mining utility with projective sampling

148

Voted

KDD
2008
ACM

176views Data Mining» more KDD 2008»

Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface

16 years 4 months ago

Download cs.anu.edu.au

Matching records that refer to the same entity across databases is becoming an increasingly important part of many data mining projects, as often data from multiple sources needs ...

Peter Christen

claim paper

Read More »

192

click to vote

BMCBI
2006

144views more BMCBI 2006»

Association algorithm to mine the rules that govern enzyme definition and to classify protein sequences

15 years 4 months ago

Download www.biomedcentral.com

Background: The number of sequences compiled in many genome projects is growing exponentially, but most of them have not been characterized experimentally. An automatic annotation...

Shih-Hau Chiu, Chien-Chi Chen, Gwo-Fang Yuan, Thy-...

claim paper

Read More »

125

click to vote

ICMLA
2008

102views Machine Learning» more ICMLA 2008»

Highly Scalable SVM Modeling with Random Granulation for Spam Sender Detection

15 years 5 months ago

Download www.trustedsource.org

Spam sender detection based on email subject data is a complex large-scale text mining task. The dataset consists of email subject lines and the corresponding IP address of the em...

Yuchun Tang, Yuanchen He, Sven Krasser

claim paper

Read More »

147

click to vote

DKE
2007

95views more DKE 2007»

Warping the time on data streams

15 years 4 months ago

Download www-db.deis.unibo.it

Continuously monitoring through time the correlation/distance of multiple data streams is of interest in a variety of applications, including ﬁnancial analysis, video surveillanc...

Paolo Capitani, Paolo Ciaccia

claim paper

Read More »

145

click to vote

CIKM
2004
Springer

143views Information Technology» more CIKM 2004»

Optimizing web search using web click-through data

15 years 9 months ago

Download apex.sjtu.edu.cn

The performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce...

Gui-Rong Xue, Hua-Jun Zeng, Zheng Chen, Yong Yu, W...

claim paper

Read More »

« Prev « First page 27 / 77 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers