Sciweavers

134 search results - page 20 / 27
» Reconciling Attribute Values from Multiple Data Sources
Sort
View
WWW
2010
ACM
15 years 3 months ago
Enabling entity-based aggregators for web 2.0 data
Selecting and presenting content culled from multiple heterogeneous and physically distributed sources is a challenging task. The exponential growth of the web data in modern time...
Ekaterini Ioannou, Claudia Niederée, Yannis...
BMCBI
2010
171views more  BMCBI 2010»
14 years 11 months ago
PyMix - The Python mixture package - a tool for clustering of heterogeneous biological data
Background: Cluster analysis is an important technique for the exploratory analysis of biological data. Such data is often high-dimensional, inherently noisy and contains outliers...
Benjamin Georgi, Ivan Gesteira Costa, Alexander Sc...
EDM
2009
147views Data Mining» more  EDM 2009»
14 years 9 months ago
Using Dirichlet priors to improve model parameter plausibility
Student modeling is a widely used approach to make inference about a student's attributes like knowledge, learning, etc. If we wish to use these models to analyze and better u...
Dovan Rai, Yue Gong, Joseph Beck
SIGMOD
2008
ACM
158views Database» more  SIGMOD 2008»
15 years 11 months ago
Sampling cube: a framework for statistical olap over sampling data
Sampling is a popular method of data collection when it is impossible or too costly to reach the entire population. For example, television show ratings in the United States are g...
Xiaolei Li, Jiawei Han, Zhijun Yin, Jae-Gil Lee, Y...
SIGMOD
2008
ACM
95views Database» more  SIGMOD 2008»
15 years 11 months ago
Interactive generation of integrated schemas
Schema integration is the problem of creating a unified target schema based on a set of existing source schemas that relate to each other via specified correspondences. The unifie...
Laura Chiticariu, Phokion G. Kolaitis, Lucian Popa