Sciweavers

129 search results - page 24 / 26
» A Corpus of Scope-disambiguated English Text
Sort
View
LREC
2010
176views Education» more  LREC 2010»
13 years 7 months ago
The DAD Parallel Corpora and their Uses
This paper deals with the uses of the annotations of third person singular neuter pronouns in the DAD parallel and comparable corpora of Danish and Italian texts and spoken data. ...
Costanza Navarretta
LREC
2010
170views Education» more  LREC 2010»
13 years 7 months ago
Arabic Word Segmentation for Better Unit of Analysis
The Arabic language has a very rich morphology where a word is composed of zero or more prefixes, a stem and zero or more suffixes. This makes Arabic data sparse compared to other...
Yassine Benajiba, Imed Zitouni
WSDM
2012
ACM
325views Data Mining» more  WSDM 2012»
12 years 1 months ago
Coupled temporal scoping of relational facts
Recent research has made significant advances in automatically constructing knowledge bases by extracting relational facts (e.g., Bill Clinton-presidentOf-US) from large text cor...
Partha Pratim Talukdar, Derry Tanti Wijaya, Tom Mi...
ANLP
1997
137views more  ANLP 1997»
13 years 7 months ago
Probabilistic and Rule-Based Tagger of an Inflective Language- a Comparison
We present results of probabilistic tagging of Czech texts in order to show how these techniques work for one of the highly morphologically ambiguous inflective languages. After d...
Jan Hajic, Barbora Hladká
DAS
2010
Springer
13 years 10 months ago
A post-processing scheme for malayalam using statistical sub-character language models
Most of the Indian scripts do not have any robust commercial OCRs. Many of the laboratory prototypes report reasonable results at recognition/classification stage. However, word ...
Karthika Mohan, C. V. Jawahar