Sciweavers

EMNLP
2009
13 years 2 months ago
Stream-based Randomised Language Models for SMT
Randomised techniques allow very big language models to be represented succinctly. However, being batch-based they are unsuitable for modelling an unbounded stream of language whi...
Abby Levenberg, Miles Osborne
ACL
1990
13 years 5 months ago
A Hardware Algorithm for High Speed Morpheme Extraction and its Implementation
This paper describes a new hardware algorithm for morpheme extraction and its implementation on a specific machine (MEX-I), as the first step toward achieving natural language par...
Toshikazu Fukushima, Yutaka Ohyama, Hitoshi Miyai
RIDE
1998
IEEE
13 years 8 months ago
Generating Hypermedia Documents from Transcriptions of Television Programs Using Parallel Text Alignment
This paper presents a method of automatically creating hypermedia documents from conventional transcriptions of television programs. Using parallel text alignment techniques, the ...
David C. Gibbon
SIGIR
2003
ACM
13 years 9 months ago
Domain-independent text segmentation using anisotropic diffusion and dynamic programming
This paper presents a novel domain-independent text segmentation method, which identifies the boundaries of topic changes in long text documents and/or text streams. The method c...
Xiang Ji, Hongyuan Zha
MMM
2010
Springer
203views Multimedia» more  MMM 2010»
14 years 1 months ago
TV News Story Segmentation Based on Semantic Coherence and Content Similarity
In this paper, we introduce and evaluate two novel approaches, one using video stream and the other using close-caption text stream, for segmenting TV news into stories. The segmen...
Hemant Misra, Frank Hopfgartner, Anuj Goyal, P. Pu...