Sciweavers

340 search results - page 28 / 68
» New adaptive compressors for natural language text
Sort
View
EMNLP
2009
14 years 7 months ago
Using the Web for Language Independent Spellchecking and Autocorrection
We have designed, implemented and evaluated an end-to-end system spellchecking and autocorrection system that does not require any manually annotated training data. The World Wide...
Casey Whitelaw, Ben Hutchinson, Grace Chung, Ged E...
67
Voted
ICML
1999
IEEE
15 years 2 months ago
Feature Engineering for Text Classification
Most research in text classification to date has used a “bag of words” representation in which each feature corresponds to a single word. This paper examines some alternative ...
Sam Scott, Stan Matwin
72
Voted
EMNLP
2009
14 years 7 months ago
Weighted Alignment Matrices for Statistical Machine Translation
Current statistical machine translation systems usually extract rules from bilingual corpora annotated with 1-best alignments. They are prone to learn noisy rules due to alignment...
Yang Liu, Tian Xia, Xinyan Xiao, Qun Liu
SIGMOD
2003
ACM
115views Database» more  SIGMOD 2003»
15 years 9 months ago
Querying Structured Text in an XML Database
XML databases often contain documents comprising structured text. Therefore, it is important to integrate "information retrieval style" query evaluation, which is well-s...
Shurug Al-Khalifa, Cong Yu, H. V. Jagadish
73
Voted
CIKM
2006
Springer
15 years 1 months ago
Concept frequency distribution in biomedical text summarization
Text summarization is a data reduction process. The use of text summarization enables users to reduce the amount of text that must be read while still assimilating the core inform...
Lawrence H. Reeve, Hyoil Han, Saya V. Nagori, Jona...