Sciweavers

21 search results - page 2 / 5
» Constructing Parallel Corpus from Movie Subtitles
Sort
View
LREC
2010
207views Education» more  LREC 2010»
13 years 6 months ago
Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues
We describe the construction of the CODA corpus, a parallel corpus of monologues and expository dialogues. The dialogue part of the corpus consists of expository, i.e., informatio...
Svetlana Stoyanchev, Paul Piwek
IMCSIT
2010
13 years 2 months ago
Parallel, Massive Processing in SuperMatrix - a General Tool for Distributional Semantic Analysis of Corpus
The paper presents an extended version of the SuperMatrix system -- a general tool supporting automatic acquisition of lexical semantic relations from corpora. Extensions focus mai...
Bartosz Broda, Damian Jaworski, Maciej Piasecki
NAACL
2010
13 years 3 months ago
Generating Expository Dialogue from Monologue: Motivation, Corpus and Preliminary Rules
Generating expository dialogue from monologue is a task that poses an interesting and rewarding challenge for Natural Language Processing. This short paper has three aims: firstly...
Paul Piwek, Svetlana Stoyanchev
ACSW
2004
13 years 6 months ago
Discovering Parallel Text from the World Wide Web
Parallel corpus is a rich linguistic resource for various multilingual text management tasks, including crosslingual text retrieval, multilingual computational linguistics and mul...
Jisong Chen, Rowena Chau, Chung-Hsing Yeh
EDBT
2009
ACM
169views Database» more  EDBT 2009»
14 years 4 days ago
Xoom: a tool for zooming in and out of XML documents
Suppose there is a large corpus of XML documents, each of which describes a movie released in the last 30 years (for example, extracted from IMDB). A movie enthusiast wants to mak...
Maya Ramanath, Kondreddi Sarath Kumar