Sciweavers

34 search results - page 3 / 7
» Lexicalized Phonotactic Word Segmentation
Sort
View
EMNLP
2008
13 years 7 months ago
Bayesian Unsupervised Topic Segmentation
This paper describes a novel Bayesian approach to unsupervised topic segmentation. Unsupervised systems for this task are driven by lexical cohesion: the tendency of wellformed se...
Jacob Eisenstein, Regina Barzilay
AAAI
2007
13 years 8 months ago
Topic Segmentation Algorithms for Text Summarization and Passage Retrieval: An Exhaustive Evaluation
In order to solve problems of reliability of systems based on lexical repetition and problems of adaptability of languagedependent systems, we present a context-based topic segmen...
Gaël Dias, Elsa Alves, José Gabriel Pe...
EMNLP
2010
13 years 3 months ago
Predicting the Semantic Compositionality of Prefix Verbs
In many applications, replacing a complex word form by its stem can reduce sparsity, revealing connections in the data that would not otherwise be apparent. In this paper, we focu...
Shane Bergsma, Aditya Bhargava, Hua He, Grzegorz K...
JAIR
2010
162views more  JAIR 2010»
13 years 4 months ago
Text Relatedness Based on a Word Thesaurus
The computation of relatedness between two fragments of text in an automated manner requires taking into account a wide range of factors pertaining to the meaning the two fragment...
George Tsatsaronis, Iraklis Varlamis, Michalis Vaz...
CICLING
2004
Springer
13 years 9 months ago
Language-Independent Methods for Compiling Monolingual Lexical Data
Abstract: In this paper we describe a flexible, portable and languageindependent infrastructure for setting up large monolingual language corpora. The approach is based on collecti...
Christian Biemann, Stefan Bordag, Gerhard Heyer, U...