Sciweavers

149 search results - page 3 / 30
» A Corpus Factory for Many Languages
Sort
View
IJCNLP
2005
Springer
13 years 11 months ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
ICDAR
2005
IEEE
13 years 11 months ago
A Corpus for Comparative Evaluation of OCR Software and Postcorrection Techniques
We describe a new corpus collected for comparative evaluation of OCR-software and postcorrection techniques. The corpus is freely available for academic groups and use. The major ...
Stoyan Mihov, Klaus U. Schulz, Christoph Ringlstet...
EMNLP
2009
13 years 3 months ago
Construction of a Blog Emotion Corpus for Chinese Emotional Expression Analysis
There is plenty of evidence that emotion analysis has many valuable applications. In this study a blog emotion corpus is constructed for Chinese emotional expression analysis. Thi...
Changqin Quan, Fuji Ren
IPCV
2008
13 years 7 months ago
Speech Recognition System of Arabic Digits based on A Telephony Arabic Corpus
- Automatic recognition of spoken digits is one of the difficult tasks in the field of computer speech recognition. Spoken digits recognition process is required in many applicatio...
Yousef Alotaibi, Mansour Al-Ghamdi, Fahad Alotaiby
ACL
2012
11 years 8 months ago
Temporally Anchored Relation Extraction
Although much work on relation extraction has aimed at obtaining static facts, many of the target relations are actually fluents, as their validity is naturally anchored to a cer...
Guillermo Garrido, Anselmo Peñas, Bernardo ...