Sciweavers

1755 search results - page 175 / 351
» Symposium on document engineering
Sort
View
139
Voted
LREC
2008
169views Education» more  LREC 2008»
15 years 5 months ago
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from the Web are released, and these data improve the performance of applications in several NLP tasks. Although the language resource...
Keiji Shinzato, Daisuke Kawahara, Chikara Hashimot...
157
Voted
ACL
2003
15 years 5 months ago
Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web
In this paper, we present a method that automatically constructs a Named Entity (NE) tagged corpus from the web to be used for learning of Named Entity Recognition systems. We use...
Joohui An, Seungwoo Lee, Gary Geunbae Lee
144
Voted
IIS
2004
15 years 5 months ago
Conceptual Clustering Using Lingo Algorithm: Evaluation on Open Directory Project Data
Search results clustering problem is defined as an automatic, on-line grouping of similar documents in a search hits list, returned from a search engine. In this paper we present t...
Stanislaw Osinski, Dawid Weiss
109
Voted
DBSEC
2003
140views Database» more  DBSEC 2003»
15 years 5 months ago
Correlated Data Inference
In this paper we examine undesired inference attacks from distributed public XML documents. An undesired inference is a chain of reasoning that leads to protected data of an organ...
Csilla Farkas, Andrei Stoica
184
Voted
RANLP
2003
15 years 5 months ago
A framework for named entity recognition in the open domain
In this paper, a system for Named Entity Recognition in the Open domain (NERO) is described. It is concerned with recognition of various types of entity, types that will be approp...
Richard J. Evans