Sciweavers

148 search results - page 23 / 30
» Evaluation Metrics for Automatic Temporal Annotation of Text...
Sort
View
CORR
2002
Springer
90views Education» more  CORR 2002»
14 years 11 months ago
Mostly-Unsupervised Statistical Segmentation of Japanese Kanji Sequences
Given the lack of word delimiters in written Japanese, word segmentation is generally considered a crucial first step in processing Japanese texts. Typical Japanese segmentation a...
Rie Kubota Ando, Lillian Lee
LREC
2010
187views Education» more  LREC 2010»
15 years 1 months ago
Ontology-Based Categorization of Web Services with Machine Learning
We present the problem of categorizing web services according to a shallow ontology for presentation on a specialist portal, using their WSDL and associated textual documents foun...
Adam Funk, Kalina Bontcheva
102
Voted
DEXAW
2010
IEEE
202views Database» more  DEXAW 2010»
15 years 21 days ago
Identifying Sentence-Level Semantic Content Units with Topic Models
Abstract--Statistical approaches to document content modeling typically focus either on broad topics or on discourselevel subtopics of a text. We present an analysis of the perform...
Leonhard Hennig, Thomas Strecker, Sascha Narr, Ern...
LREC
2010
115views Education» more  LREC 2010»
14 years 9 months ago
Mining Naturally-occurring Corrections and Paraphrases from Wikipedia's Revision History
Naturally-occurring instances of linguistic phenomena are important both for training and for evaluating automatic text processing. When available in large quantities, they also p...
Aurélien Max, Guillaume Wisniewski
88
Voted
ACL
2003
15 years 1 months ago
Minimum Error Rate Training in Statistical Machine Translation
Often, the training procedure for statistical machine translation models is based on maximum likelihood or related criteria. A general problem of this approach is that there is on...
Franz Josef Och