Sciweavers

2876 search results - page 109 / 576
» A Conceptual-Modeling Approach to Extracting Data from the W...
Sort
View
DOCENG
2010
ACM
14 years 8 months ago
From templates to schemas: bridging the gap between free editing and safe data processing
In this paper we present tools that provide an easy way to edit XML content directly on the web, with the usual benefit of valid XML content. These tools make it possible to crea...
Vincent Quint, Cécile Roisin, Stépha...
MICAI
2007
Springer
15 years 3 months ago
Taking Advantage of the Web for Text Classification with Imbalanced Classes
A problem of supervised approaches for text classification is that they commonly require high-quality training data to construct an accurate classifier. Unfortunately, in many real...
Rafael Guzmán-Cabrera, Manuel Montes-y-G&oa...
WWW
2009
ACM
15 years 4 months ago
Bootstrapped extraction of class attributes
As an alternative to previous studies on extracting class attributes from unstructured text, which consider either Web documents or query logs as the source of textual data, A boo...
Joseph Reisinger, Marius Pasca
CIKM
2010
Springer
14 years 8 months ago
Clickthrough-based translation models for web search: from word models to phrase models
Web search is challenging partly due to the fact that search queries and Web documents use different language styles and vocabularies. This paper provides a quantitative analysis ...
Jianfeng Gao, Xiaodong He, Jian-Yun Nie
DIS
2001
Springer
15 years 2 months ago
Dynamic Aggregation to Support Pattern Discovery: A Case Study with Web Logs
Rapid growth of digital data collections is overwhelming the capabilities of humans to comprehend them without aid. The extraction of useful data from large raw data sets is someth...
Lida Tang, Ben Shneiderman