Sciweavers

1332 search results - page 185 / 267
» Database Challenges in the Integration of Biomedical Data Se...
Sort
View
IPPS
2003
IEEE
15 years 3 months ago
Parallel ROLAP Data Cube Construction On Shared-Nothing Multiprocessors
The pre-computation of data cubes is critical to improving the response time of On-Line Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tas...
Ying Chen, Frank K. H. A. Dehne, Todd Eavis, Andre...
PODS
2010
ACM
232views Database» more  PODS 2010»
15 years 3 months ago
Optimal sampling from distributed streams
A fundamental problem in data management is to draw a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With large, streamin...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang
SIGMOD
2000
ACM
85views Database» more  SIGMOD 2000»
15 years 2 months ago
Finding Replicated Web Collections
Many web documents (such as JAVA FAQs) are being replicated on the Internet. Often entire document collections (such as hyperlinked Linux manuals) are being replicated many times....
Junghoo Cho, Narayanan Shivakumar, Hector Garcia-M...
COLING
2010
14 years 4 months ago
Exploiting Structured Ontology to Organize Scattered Online Opinions
We study the problem of integrating scattered online opinions. For this purpose, we propose to exploit structured ontology to obtain well-formed relevant aspects to a topic and us...
Yue Lu, Huizhong Duan, Hongning Wang, ChengXiang Z...
WWW
2001
ACM
15 years 10 months ago
Effective Web data extraction with standard XML technologies
We discuss the problem of Web data extraction and describe an XML-based methodology whose goal extends far beyond simple "screen scraping." An ideal data extraction proc...
Jussi Myllymaki