Sciweavers

331 search results - page 38 / 67
» Corpus studies in word prediction
Sort
View
WEBI
2009
Springer
15 years 6 months ago
Revealing Hidden Community Structures and Identifying Bridges in Complex Networks: An Application to Analyzing Contents of Web P
The emergence of scale free and small world properties in real world complex networks has stimulated lots of activity in the field of network analysis. An example of such a netwo...
Faraz Zaidi, Arnaud Sallaberry, Guy Melanço...
LREC
2010
233views Education» more  LREC 2010»
15 years 1 months ago
The Development of a Morphosyntactic Tagset for Afrikaans and its Use with Statistical Tagging
In this paper, we present a morphosyntactic tagset for Afrikaans based on the guidelines developed by the Expert Advisory Group on Language Engineering Standards (EAGLES). We comp...
Boris Haselbach, Ulrich Heid
INFORMATICALT
2006
116views more  INFORMATICALT 2006»
14 years 11 months ago
Cache-based Statistical Language Models of English and Highly Inflected Lithuanian
This paper investigates a variety of statistical cache-based language models built upon three corpora: English, Lithuanian, and Lithuanian base forms. The impact of the cache size,...
Airenas Vaiciunas, Gailius Raskinis
FLAIRS
2008
15 years 2 months ago
Learning a Probabilistic Model of Event Sequences from Internet Weblog Stories
One of the central problems in building broad-coverage story understanding systems is generating expectations about event sequences, i.e. predicting what happens next given some a...
Mehdi Manshadi, Reid Swanson, Andrew S. Gordon
ICDAR
2003
IEEE
15 years 5 months ago
Automatic Feature Selection with Applications to Script Identification of Degraded Documents
Current approaches to script identification rely on hand-selected features and often require processing a significant part of the document to achieve reliable identification. We p...
Vitaly Ablavsky, Mark R. Stevens