Sciweavers

735 search results - page 6 / 147
» Corpora and data preparation
Sort
View
LREC
2010
224views Education» more  LREC 2010»
14 years 11 months ago
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation
With the development of speech and language processing, speech translation systems have been developed. These studies target spoken dialogues, and employ consecutive interpretatio...
Masaki Murata, Tomohiro Ohno, Shigeki Matsubara, Y...
LREC
2008
75views Education» more  LREC 2008»
14 years 11 months ago
Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data
As a first step to developing systems that enable non-native speakers to output near-perfect English sentences for given mixed EnglishJapanese sentences, we propose new approaches...
Qing Ma, Koichi Nakao, Masaki Murata, Hitoshi Isah...
EDBT
2008
ACM
144views Database» more  EDBT 2008»
15 years 10 months ago
Protecting privacy in recorded conversations
Professionals in the field of speech technology are often constrained by a lack of speech corpora that are important to their research and development activities. These corpora ex...
Scot Cunningham, Traian Marius Truta
ECIR
2006
Springer
14 years 11 months ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines
ACL
2006
14 years 11 months ago
Towards A Modular Data Model For Multi-Layer Annotated Corpora
In this paper we discuss the current methods in the representation of corpora annotated at multiple levels of linguistic organization (so-called multi-level or multi-layer corpora...
Richard Eckart