Sciweavers

1615 search results - page 3 / 323
» Web Text Corpus for Natural Language Processing
Sort
View
INLG
2004
Springer
13 years 10 months ago
A Corpus-Based Methodology for Evaluating Metrics of Coherence for Text Structuring
This paper presents a novel corpus-based methodology for comparing metrics of coherence with respect to their potential usefulness for text structuring. Different definitions of ...
Nikiforos Karamanis, Chris Mellish, Jon Oberlander...
FLAIRS
2007
13 years 7 months ago
Lexicon Development and POS Tagging Using a Tagged Bengali News Corpus
Lexicon development and Part of Speech (POS) tagging are very important for almost all Natural Language Processing(NLP) application areas. The rapid development of these resources...
Asif Ekbal, Sivaji Bandyopadhyay
EACL
2006
ACL Anthology
13 years 6 months ago
Large Linguistically-Processed Web Corpora for Multiple Languages
The Web contains vast amounts of linguistic data. One key issue for linguists and language technologists is how to access it. Commercial search engines give highly compromised acc...
Marco Baroni, Adam Kilgarriff
EMNLP
2009
13 years 2 months ago
Using the Web for Language Independent Spellchecking and Autocorrection
We have designed, implemented and evaluated an end-to-end system spellchecking and autocorrection system that does not require any manually annotated training data. The World Wide...
Casey Whitelaw, Ben Hutchinson, Grace Chung, Ged E...
DOCENG
2008
ACM
13 years 6 months ago
Towards Brazilian Portuguese automatic text simplification systems
In this paper we investigate the main linguistic phenomena that can make texts complex and how they could be simplified. We focus on a corpus analysis of simple account texts avai...
Sandra M. Aluísio, Lucia Specia, Thiago Ale...