Sciweavers

2631 search results - page 333 / 527
» SVM Based Learning System for Information Extraction
Sort
View
194
Voted
NLDB
2004
Springer
15 years 10 months ago
Acquiring Selectional Preferences from Untagged Text for Prepositional Phrase Attachment Disambiguation
Abstract. Extracting information automatically from texts for database representation requires previously well-grouped phrases so that entities can be separated adequately. This pr...
Hiram Calvo, Alexander F. Gelbukh
WWW
2008
ACM
16 years 5 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
ACL
2012
13 years 7 months ago
Fast Online Training with Frequency-Adaptive Learning Rates for Chinese Word Segmentation and New Word Detection
We present a joint model for Chinese word segmentation and new word detection. We present high dimensional new features, including word-based features and enriched edge (label-tra...
Xu Sun, Houfeng Wang, Wenjie Li
IADIS
2008
15 years 6 months ago
Design of an Automated System for Clustering Heterogeneous Data
The goal of this work is to study the feasibility of a Heterogeneous Data Classification and Search (HDCS) system and to provide a possible design for its implementing. In order t...
Dorin Carstoiu, Alexandra Cernian, Adriana Olteanu...
120
Voted
SIGMOD
2010
ACM
224views Database» more  SIGMOD 2010»
15 years 5 months ago
GDR: a system for guided data repair
Improving data quality is a time-consuming, labor-intensive and often domain specific operation. Existing data repair approaches are either fully automated or not efficient in int...
Mohamed Yakout, Ahmed K. Elmagarmid, Jennifer Nevi...