Sciweavers

6258 search results - page 153 / 1252
» Applied Text Generation
Sort
View
WWW
2007
ACM
15 years 10 months ago
EPCI: extracting potentially copyright infringement texts from the web
In this paper, we propose a new system extracting potentially copyright infringement texts from the Web, called EPCI. EPCI extracts them in the following way: (1) generating a set...
Takashi Tashiro, Takanori Ueda, Taisuke Hori, Yu H...
KDD
2005
ACM
118views Data Mining» more  KDD 2005»
15 years 10 months ago
On the use of linear programming for unsupervised text classification
We propose a new algorithm for dimensionality reduction and unsupervised text classification. We use mixture models as underlying process of generating corpus and utilize a novel,...
Mark Sandler
89
Voted
SETN
2004
Springer
15 years 3 months ago
Text Normalization for the Pronunciation of Non-standard Words in an Inflected Language
In this paper we present a novel approach, called “Text to Pronunciation (TtP)”, for the proper normalization of Non-Standard Words (NSWs) in unrestricted texts. The methodolog...
Gerasimos Xydas, Georgios Karberis, Georgios Kouro...
TSD
2001
Springer
15 years 2 months ago
Augmented Auditory Representation of e-Texts for Text-to-Speech Systems
Abstract. Emerging electronic text formats include hierarchical structure and visualization related information that current Text-to-Speech (TtS) systems ignore. In this paper we p...
Gerasimos Xydas, Georgios Kouroupetroglou
SIGIR
1999
ACM
15 years 2 months ago
Summarizing Text Documents: Sentence Selection and Evaluation Metrics
Human-quality text summarization systems are di cult to design, and even more di cult to evaluate, in part because documents can di er along several dimensions, such as length, wri...
Jade Goldstein, Mark Kantrowitz, Vibhu O. Mittal, ...