Sciweavers

576 search results - page 80 / 116
» Combination of Feature Selection Methods for Text Categorisa...
Sort
View
IMCSIT
2010
15 years 17 days ago
Evaluation of Clustering Algorithms for Polish Word Sense Disambiguation
Word Sense Disambiguation in text is still a difficult problem as the best supervised methods require laborious and costly manual preparation of training data. Thus, this work focu...
Bartosz Broda, Wojciech Mazur
134
Voted
KDD
2002
ACM
186views Data Mining» more  KDD 2002»
16 years 3 months ago
Topic-conditioned novelty detection
Automated detection of the first document reporting each new event in temporally-sequenced streams of documents is an open challenge. In this paper we propose a new approach which...
Yiming Yang, Jian Zhang, Jaime G. Carbonell, Chun ...
ACL
2003
15 years 4 months ago
Learning to Predict Pitch Accents and Prosodic Boundaries in Dutch
We train a decision tree inducer (CART) and a memory-based classifier (MBL) on predicting prosodic pitch accents and breaks in Dutch text, on the basis of shallow, easy-to-comput...
Erwin Marsi, Martin Reynaert, Antal van den Bosch,...
120
Voted
ICASSP
2011
IEEE
14 years 6 months ago
Improved pos tagging for text-to-speech synthesis
One of the fundamental building blocks of text processing for textto-speech (TTS) synthesis is the assignment of a part-of-speech (POS) tag to each input word. POS tags are heavil...
Ming Sun, Jerome R. Bellegarda
CORR
2010
Springer
286views Education» more  CORR 2010»
15 years 16 hour ago
PhishDef: URL Names Say It All
Phishing is an increasingly sophisticated method to steal personal user information using sites that pretend to be legitimate. In this paper, we take the following steps to identif...
Anh Le, Athina Markopoulou, Michalis Faloutsos