Sciweavers

36 search results - page 2 / 8
» A Formalism for Universal Segmentation of Text
Sort
View
ACL
2009
13 years 2 months ago
A Syntactic and Lexical-Based Discourse Segmenter
We present a syntactic and lexically based discourse segmenter (SLSeg) that is designed to avoid the common problem of over-segmenting text. Segmentation is the first step in a di...
Milan Tofiloski, Julian Brooke, Maite Taboada
ECIR
2007
Springer
13 years 6 months ago
Similarity Measures for Short Segments of Text
Measuring the similarity between documents and queries has been extensively studied in information retrieval. However, there are a growing number of tasks that require computing th...
Donald Metzler, Susan T. Dumais, Christopher Meek
IDA
2007
Springer
13 years 4 months ago
Voting experts: An unsupervised algorithm for segmenting sequences
We describe a statistical signature of chunks and an algorithm for finding chunks. While there is no formal definition of chunks, they may be reliably identified as configurat...
Paul R. Cohen, Niall M. Adams, Brent Heeringa
HICSS
1996
IEEE
136views Biometrics» more  HICSS 1996»
13 years 9 months ago
Applications of Multilingual Text Retrieval
The recent enormous increase in the use of networked information access and on-line databases has led to more databases being available in languages other than English. The Center...
W. Bruce Croft, John Broglio, Hideo Fujii
WWW
2008
ACM
14 years 5 months ago
Learning to classify short and sparse text & web with hidden topics from large-scale data collections
This paper presents a general framework for building classifiers that deal with short and sparse text & Web segments by making the most of hidden topics discovered from larges...
Xuan Hieu Phan, Minh Le Nguyen, Susumu Horiguchi