Sciweavers

ANLP
2000
92views more  ANLP 2000»
13 years 6 months ago
Tagging Sentence Boundaries
In this paper we tackle sentence boundary disambiguation through a part-of-speech (POS) tagging framework. We describe necessary changes in text tokenization and the implementatio...
Andrei Mikheev
SIGIR
2000
ACM
13 years 9 months ago
Document centered approach to text normalization
In this paper we present an approach to tackle three important problems of text normalization: sentence boundary disambiguation, disambiguation of capitalized words when they are ...
Andrei Mikheev