Sciweavers

1755 search results - page 175 / 351
» Symposium on document engineering
Sort
View
LREC
2008
169views Education» more  LREC 2008»
15 years 22 days ago
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from the Web are released, and these data improve the performance of applications in several NLP tasks. Although the language resource...
Keiji Shinzato, Daisuke Kawahara, Chikara Hashimot...
ACL
2003
15 years 20 days ago
Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web
In this paper, we present a method that automatically constructs a Named Entity (NE) tagged corpus from the web to be used for learning of Named Entity Recognition systems. We use...
Joohui An, Seungwoo Lee, Gary Geunbae Lee
IIS
2004
15 years 20 days ago
Conceptual Clustering Using Lingo Algorithm: Evaluation on Open Directory Project Data
Search results clustering problem is defined as an automatic, on-line grouping of similar documents in a search hits list, returned from a search engine. In this paper we present t...
Stanislaw Osinski, Dawid Weiss
DBSEC
2003
140views Database» more  DBSEC 2003»
15 years 20 days ago
Correlated Data Inference
In this paper we examine undesired inference attacks from distributed public XML documents. An undesired inference is a chain of reasoning that leads to protected data of an organ...
Csilla Farkas, Andrei Stoica
RANLP
2003
15 years 20 days ago
A framework for named entity recognition in the open domain
In this paper, a system for Named Entity Recognition in the Open domain (NERO) is described. It is concerned with recognition of various types of entity, types that will be approp...
Richard J. Evans