Sciweavers

10293 search results - page 195 / 2059
» Describing Semistructured Data
Sort
View
WWW
2005
ACM
16 years 5 months ago
Using visual cues for extraction of tabular data from arbitrary HTML documents
We describe a method to extract tabular data from web pages. Rather than just analyzing the DOM tree, we also exploit visual cues in the rendered version of the document to extrac...
Bernhard Krüpl, Marcus Herzog, Wolfgang Gatte...
142
Voted
PODS
2007
ACM
139views Database» more  PODS 2007»
16 years 5 months ago
Management of probabilistic data: foundations and challenges
Many applications today need to manage large data sets with uncertainties. In this paper we describe the foundations of managing data where the uncertainties are quantified as pro...
Nilesh N. Dalvi, Dan Suciu
ASUNAM
2009
IEEE
15 years 11 months ago
Prying Data out of a Social Network
—Preventing adversaries from compiling significant amounts of user data is a major challenge for social network operators. We examine the difficulty of collecting profile and ...
Joseph Bonneau, Jonathan Anderson, George Danezis
ISBRA
2009
Springer
15 years 11 months ago
Practical Quality Assessment of Microarray Data by Simulation of Differential Gene Expression
There are many methods for assessing the quality of microarray data, but little guidance regarding what to do when defective data is identified. Depending on the scientific questio...
Brian E. Howard, Beate Sick, Steffen Heber
ADMI
2009
Springer
15 years 11 months ago
Agent-Enriched Data Mining Using an Extendable Framework
An extendable and generic Agent Enriched Data Mining (AEDM) framework, EMADS (the Extendable Multi-Agent Data mining System) is described. The central feature of the framework is ...
Kamal Ali Albashiri, Frans Coenen