Sciweavers

568 search results - page 46 / 114
» Efficient Distribution Mining and Classification
Sort
View
121
Voted
SDM
2008
SIAM
197views Data Mining» more  SDM 2008»
15 years 5 months ago
A general framework for estimating similarity of datasets and decision trees: exploring semantic similarity of decision trees
Decision trees are among the most popular pattern types in data mining due to their intuitive representation. However, little attention has been given on the definition of measure...
Irene Ntoutsi, Alexandros Kalousis, Yannis Theodor...
257
Voted
ICDE
2009
IEEE
157views Database» more  ICDE 2009»
16 years 5 months ago
A Rule-Based Classification Algorithm for Uncertain Data
Abstract-- Data uncertainty is common in real-world applications due to various causes, including imprecise measurement, network latency, outdated sources and sampling errors. Thes...
Biao Qin, Yuni Xia, Sunil Prabhakar, Yi-Cheng Tu
155
Voted
CSCW
2011
ACM
14 years 10 months ago
Your time zone or mine?: a study of globally time zone-shifted collaboration
We conducted interviews with sixteen members of teams that worked across global time zone differences. Despite time zone differences of about eight hours, collaborators still foun...
John C. Tang, Chen Zhao, Xiang Cao, Kori Inkpen
144
Voted
GECCO
2008
Springer
232views Optimization» more  GECCO 2008»
15 years 4 months ago
An efficient SVM-GA feature selection model for large healthcare databases
This paper presents an efficient hybrid feature selection model based on Support Vector Machine (SVM) and Genetic Algorithm (GA) for large healthcare databases. Even though SVM an...
Rick Chow, Wei Zhong, Michael Blackmon, Richard St...
ACL
2008
15 years 5 months ago
Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation
In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...
Jakob Uszkoreit, Thorsten Brants