Sciweavers

948 search results - page 186 / 190
» The similarity metric
Sort
View
NIPS
2007
15 years 1 months ago
Mining Internet-Scale Software Repositories
Large repositories of source code create new challenges and opportunities for statistical machine learning. Here we first develop Sourcerer, an infrastructure for the automated c...
Erik Linstead, Paul Rigor, Sushil Krishna Bajracha...
CONEXT
2009
ACM
15 years 25 days ago
Exploiting dynamicity in graph-based traffic analysis: techniques and applications
Network traffic can be represented by a Traffic Dispersion Graph (TDG) that contains an edge between two nodes that send a particular type of traffic (e.g., DNS) to one another. T...
Marios Iliofotou, Michalis Faloutsos, Michael Mitz...
GECCO
2010
Springer
186views Optimization» more  GECCO 2010»
15 years 25 days ago
Genetic rule extraction optimizing brier score
Most highly accurate predictive modeling techniques produce opaque models. When comprehensible models are required, rule extraction is sometimes used to generate a transparent mod...
Ulf Johansson, Rikard König, Lars Niklasson
BIOINFORMATICS
2007
137views more  BIOINFORMATICS 2007»
14 years 12 months ago
Annotation-based distance measures for patient subgroup discovery in clinical microarray studies
: Background Clustering algorithms are widely used in the analysis of microarray data. In clinical studies, they are often applied to find groups of co-regulated genes. Clustering...
Claudio Lottaz, Joern Toedling, Rainer Spang
BMCBI
2010
130views more  BMCBI 2010»
14 years 12 months ago
Amino acid "little Big Bang": Representing amino acid substitution matrices as dot products of Euclidian vectors
Background: Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This ...
Karel Zimmermann, Jean-François Gibrat