Sciweavers

563 search results - page 73 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
WWW
2009
ACM
15 years 4 months ago
Bootstrapped extraction of class attributes
As an alternative to previous studies on extracting class attributes from unstructured text, which consider either Web documents or query logs as the source of textual data, A boo...
Joseph Reisinger, Marius Pasca
91
Voted
CICLING
2009
Springer
15 years 10 months ago
Business Specific Online Information Extraction from German Websites
This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifi...
Yeong Su Lee, Michaela Geierhos
PKDD
2010
Springer
178views Data Mining» more  PKDD 2010»
14 years 8 months ago
Large-Scale Support Vector Learning with Structural Kernels
Abstract. In this paper, we present an extensive study of the cuttingplane algorithm (CPA) applied to structural kernels for advanced text classification on large datasets. In par...
Aliaksei Severyn, Alessandro Moschitti
ICIP
2006
IEEE
15 years 11 months ago
Topic Tracking Across Broadcast News Videos with Visual Duplicates and Semantic Concepts
Videos from distributed sources (e.g., broadcasts, podcasts, blogs, etc.) have grown exponentially. Topic threading is very useful for organizing such large-volume information sou...
Winston H. Hsu, Shih-Fu Chang
WWW
2010
ACM
15 years 3 months ago
Linking content in unstructured sources
This tutorial focuses on the task of automated information linking in text and multimedia sources. In any task where information is fused from different sources, this linking is ...
Marie-Francine Moens