Sciweavers

338 search results - page 3 / 68
» Probabilistic Models to Reconcile Complex Data from Inaccura...
Sort
View
PVLDB
2010
118views more  PVLDB 2010»
13 years 3 months ago
Global Detection of Complex Copying Relationships Between Sources
Web technologies have enabled data sharing between sources but also simplified copying (and often publishing without proper attribution). The copying relationships can be complex...
Xin Dong, Laure Berti-Equille, Yifan Hu, Divesh Sr...
SGAI
2009
Springer
13 years 12 months ago
From Source Code to Runtime Behaviour: Software Metrics Help to Select the Computer Architecture
The decision which hardware platform to use for a certain application is an important problem in computer architecture. This paper reports on a study where a data-mining approach i...
Frank Eichinger, David Kramer, Klemens Böhm, ...
KDD
2005
ACM
149views Data Mining» more  KDD 2005»
13 years 10 months ago
A distributed learning framework for heterogeneous data sources
We present a probabilistic model-based framework for distributed learning that takes into account privacy restrictions and is applicable to scenarios where the different sites ha...
Srujana Merugu, Joydeep Ghosh
SIGMOD
2008
ACM
159views Database» more  SIGMOD 2008»
14 years 5 months ago
Bootstrapping pay-as-you-go data integration systems
Data integration systems offer a uniform interface to a set of data sources. Despite recent progress, setting up and maintaining a data integration application still requires sign...
Anish Das Sarma, Xin Dong, Alon Y. Halevy
ACL
2007
13 years 6 months ago
Generating Complex Morphology for Machine Translation
We present a novel method for predicting inflected word forms for generating morphologically rich languages in machine translation. We utilize a rich set of syntactic and morphol...
Einat Minkov, Kristina Toutanova, Hisami Suzuki