Sciweavers

4660 search results - page 654 / 932
» Learning from imperfect data
Sort
View
AAAI
1998
15 years 2 months ago
Modeling Web Sources for Information Integration
The Web is based on a browsing paradigm that makes it di cult to retrieve and integrate data from multiple sites. Today, the only way to do this is to build specialized applicatio...
Craig A. Knoblock, Steven Minton, José Luis...
113
Voted
WWW
2006
ACM
16 years 1 months ago
Interactive wrapper generation with minimal user effort
While much of the data on the web is unstructured in nature, there is also a significant amount of embedded structured data, such as product information on e-commerce sites or sto...
Utku Irmak, Torsten Suel
SEMWEB
2005
Springer
15 years 6 months ago
Rapid Benchmarking for Semantic Web Knowledge Base Systems
Abstract. We present a method for rapid development of benchmarks for Semantic Web knowledge base systems. At the core, we have a synthetic data generation approach for OWL that is...
Sui-Yu Wang, Yuanbo Guo, Abir Qasem, Jeff Heflin
KDD
2009
ACM
193views Data Mining» more  KDD 2009»
15 years 7 months ago
Category detection using hierarchical mean shift
Many applications in surveillance, monitoring, scientific discovery, and data cleaning require the identification of anomalies. Although many methods have been developed to iden...
Pavan Vatturi, Weng-Keen Wong
134
Voted
KDD
2004
ACM
106views Data Mining» more  KDD 2004»
16 years 1 months ago
Early detection of insider trading in option markets
"Inside information" comes in many forms: knowledge of a corporate takeover, a terrorist attack, unexpectedly poor earnings, the FDA's acceptance of a new drug, etc...
Steve Donoho