Sciweavers

1950 search results - page 132 / 390
» Informative sampling for large unbalanced data sets
Sort
View
IPAW
2010
15 years 1 months ago
Publishing and Consuming Provenance Metadata on the Web of Linked Data
The World Wide Web evolves into a Web of Data, a huge, globally distributed dataspace that contains a rich body of machineprocessable information from a virtually unbound set of pr...
Olaf Hartig, Jun Zhao
117
Voted
WSCG
2004
170views more  WSCG 2004»
15 years 4 months ago
Geo-Spatial Data Viewer: From Familiar Land-covering to Arbitrary Distorted Geo-Spatial Quadtree Maps
In many application domains, data is collected and referenced by its geo-spatial location. Spatial data mining, or the discovery of interesting patterns in such databases, is an i...
Daniel A. Keim, Christian Panse, Jörn Schneid...
166
Voted
BMCBI
2010
171views more  BMCBI 2010»
15 years 3 months ago
PyMix - The Python mixture package - a tool for clustering of heterogeneous biological data
Background: Cluster analysis is an important technique for the exploratory analysis of biological data. Such data is often high-dimensional, inherently noisy and contains outliers...
Benjamin Georgi, Ivan Gesteira Costa, Alexander Sc...
107
Voted
BIBE
2007
IEEE
136views Bioinformatics» more  BIBE 2007»
15 years 5 months ago
A Two-Stage Gene Selection Algorithm by Combining ReliefF and mRMR
Abstract—Gene expression data usually contains a large number of genes, but a small number of samples. Feature selection for gene expression data aims at finding a set of genes ...
Yi Zhang, Chris H. Q. Ding, Tao Li
230
Voted
SIGMOD
2008
ACM
164views Database» more  SIGMOD 2008»
16 years 3 months ago
Finding frequent items in probabilistic data
Computing statistical information on probabilistic data has attracted a lot of attention recently, as the data generated from a wide range of data sources are inherently fuzzy or ...
Qin Zhang, Feifei Li, Ke Yi