Sciweavers

735 search results - page 90 / 147
» Corpora and data preparation
Sort
View
CORR
1999
Springer
50views Education» more  CORR 1999»
14 years 9 months ago
An Example-Based Approach to Japanese-to-English Translation of Tense Aspect, and Modality
We have developed a new method for Japanese-to-English translation of tense, aspect, and modality that uses an example-based method. In this method the similarity between input an...
Masaki Murata, Qing Ma, Kiyotaka Uchimoto, Hitoshi...
ICASSP
2010
IEEE
14 years 8 months ago
Random attributed graphs for statistical inference from content and context
Coping with Information Overload is a major challenge of the 21st century. Huge volumes and varieties of multilingual data must be processed to extract salient information. Previo...
Allen L. Gorin, Carey E. Priebe, John Grothendieck
EMNLP
2010
14 years 8 months ago
An Efficient Algorithm for Unsupervised Word Segmentation with Branching Entropy and MDL
This paper proposes a fast and simple unsupervised word segmentation algorithm that utilizes the local predictability of adjacent character sequences, while searching for a leaste...
Valentin Zhikov, Hiroya Takamura, Manabu Okumura
NAACL
2010
14 years 8 months ago
Extracting Glosses to Disambiguate Word Senses
Like most natural language disambiguation tasks, word sense disambiguation (WSD) requires world knowledge for accurate predictions. Several proxies for this knowledge have been in...
Weisi Duan, Alexander Yates
NAACL
2010
14 years 8 months ago
Streaming First Story Detection with application to Twitter
With the recent rise in popularity and size of social media, there is a growing need for systems that can extract useful information from this amount of data. We address the probl...
Sasa Petrovic, Miles Osborne, Victor Lavrenko