Sciweavers

JSS
2007
118views more  JSS 2007»
13 years 4 months ago
A new imputation method for small software project data sets
Effort prediction is a very important issue for software project management. Historical project data sets are frequently used to support such prediction. But missing data are oft...
Qinbao Song, Martin J. Shepperd
JODS
2007
102views Data Mining» more  JODS 2007»
13 years 4 months ago
Default Clustering with Conceptual Structures
This paper describes a theoretical framework for inducing knowledge from incomplete data sets. The general framework can be used with any formalism based on a lattice structure. It...
Julien Velcin, Jean-Gabriel Ganascia
JMLR
2007
58views more  JMLR 2007»
13 years 4 months ago
Distances between Data Sets Based on Summary Statistics
The concepts of similarity and distance are crucial in data mining. We consider the problem of defining the distance between two data sets by comparing summary statistics compute...
Nikolaj Tatti
JBI
2007
127views Bioinformatics» more  JBI 2007»
13 years 4 months ago
Data integration and genomic medicine
Genomic medicine aims to revolutionize health care by applying our growing understanding of the molecular basis of disease. Research in this arena is data intensive, which means d...
Brenton Louie, Peter Mork, Fernando Martín-...
BMCBI
2005
122views more  BMCBI 2005»
13 years 4 months ago
FACT - a framework for the functional interpretation of high-throughput experiments
Background: Interpreting the results of high-throughput experiments, such as those obtained from DNA-microarrays, is an often time-consuming task due to the high number of data-po...
Felix Kokocinski, Nicolas Delhomme, Gunnar Wrobel,...
SMA
2008
ACM
155views Solid Modeling» more  SMA 2008»
13 years 4 months ago
Filament tracking and encoding for complex biological networks
We present a framework for segmenting and storing filament networks from scalar volume data. Filament structures are commonly found in data generated using high-throughput microsc...
David Mayerich, John Keyser
PAMI
2006
128views more  PAMI 2006»
13 years 4 months ago
Multisurface Proximal Support Vector Machine Classification via Generalized Eigenvalues
A new approach to support vector machine (SVM) classification is proposed wherein each of two data sets are proximal to one of two distinct planes that are not parallel to each oth...
Olvi L. Mangasarian, Edward W. Wild
PAAPP
2006
85views more  PAAPP 2006»
13 years 4 months ago
A new metric splitting criterion for decision trees
Abstract: We examine a new approach to building decision tree by introducing a geometric splitting criterion, based on the properties of a family of metrics on the space of partiti...
Dan A. Simovici, Szymon Jaroszewicz
NN
2006
Springer
104views Neural Networks» more  NN 2006»
13 years 4 months ago
Local multidimensional scaling
Several bioinformatics data sets are naturally represented as graphs, for instance gene regulation, metabolic pathways, and proteinprotein interactions. The graphs are often large ...
Jarkko Venna, Samuel Kaski
CSDA
2007
100views more  CSDA 2007»
13 years 4 months ago
Convergence of random k-nearest-neighbour imputation
Random k-nearest-neighbour (RKNN) imputation is an established algorithm for filling in missing values in data sets. Assume that data are missing in a random way, so that missing...
Fredrik A. Dahl