Sciweavers

1768 search results - page 246 / 354
» Mining Very Large Databases
Sort
View
122
Voted
KDD
2008
ACM
135views Data Mining» more  KDD 2008»
16 years 4 months ago
DiMaC: a disguised missing data cleaning tool
In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...
Ming Hua, Jian Pei
141
Voted
ICDM
2009
IEEE
109views Data Mining» more  ICDM 2009»
15 years 10 months ago
Knowledge Discovery from Citation Networks
—Knowledge discovery from scientific articles has received increasing attentions recently since huge repositories are made available by the development of the Internet and digit...
Zhen Guo, Zhongfei Zhang, Shenghuo Zhu, Yun Chi, Y...
141
Voted
KDD
2010
ACM
199views Data Mining» more  KDD 2010»
15 years 7 months ago
Online discovery and maintenance of time series motifs
The detection of repeated subsequences, time series motifs, is a problem which has been shown to have great utility for several higher-level data mining algorithms, including clas...
Abdullah Mueen, Eamonn J. Keogh
140
Voted
SSDBM
2003
IEEE
164views Database» more  SSDBM 2003»
15 years 9 months ago
Approximate String Joins
String data is ubiquitous, and its management has taken on particular importance in the past few years. Approximate queries are very important on string data especially for more c...
Divesh Srivastava
150
Voted
BMCBI
2006
120views more  BMCBI 2006»
15 years 3 months ago
Projections for fast protein structure retrieval
Background: In recent times, there has been an exponential rise in the number of protein structures in databases e.g. PDB. So, design of fast algorithms capable of querying such d...
Sourangshu Bhattacharya, Chiranjib Bhattacharyya, ...