Sciweavers

577 search results - page 35 / 116
» Improved Text Generation Using N-gram Statistics
Sort
View
PLDI
2010
ACM
15 years 7 months ago
A Context-free Markup Language for Semi-structured Text
An ad hoc data format is any non-standard, semi-structured data format for which robust data processing tools are not available. In this paper, we present ANNE, a new kind of mark...
Qian Xi, David Walker
ICASSP
2011
IEEE
14 years 1 months ago
Training of error-corrective model for ASR without using audio data
This paper introduces a method to train an error-corrective model for Automatic Speech Recognition (ASR) without using audio data. In existing techniques, it is assumed that suf...
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura
ICDE
2007
IEEE
136views Database» more  ICDE 2007»
15 years 11 months ago
Faceted Browsing over Large Databases of Text-Annotated Objects
We demonstrate a fully working system for multifaceted browsing over large collections of text-annotated data, such as annotated images, that are stored in relational databases. T...
Wisam Dakka, Panagiotis G. Ipeirotis, Kenneth R. W...
WWW
2008
ACM
15 years 10 months ago
Detecting image spam using visual features and near duplicate detection
Email spam is a much studied topic, but even though current email spam detecting software has been gaining a competitive edge against text based email spam, new advances in spam g...
Bhaskar Mehta, Saurabh Nangia, Manish Gupta 0002, ...
NAACL
2007
14 years 11 months ago
Are Very Large N-Best Lists Useful for SMT?
This paper describes an efficient method to extract large n-best lists from a word graph produced by a statistical machine translation system. The extraction is based on the k sh...
Sasa Hasan, Richard Zens, Hermann Ney