Sciweavers

735 search results - page 24 / 147
» Corpora and data preparation
Sort
View
NAACL
2010
14 years 7 months ago
Everybody loves a rich cousin: An empirical study of transliteration through bridge languages
Most state of the art approaches for machine transliteration are data driven and require significant parallel names corpora between languages. As a result, developing transliterat...
Mitesh M. Khapra, A. Kumaran, Pushpak Bhattacharyy...
SIGMOD
2008
ACM
123views Database» more  SIGMOD 2008»
15 years 10 months ago
SchemaScope: a system for inferring and cleaning XML schemas
We present SchemaScope, a system to derive Document Type Definitions and XML Schemas from corpora of sample XML documents. Tools are provided to visualize, clean, and refine exist...
Geert Jan Bex, Frank Neven, Stijn Vansummeren
COLING
1994
14 years 11 months ago
Encoding standards for large text resources: The Text Encoding Initiative
The Text Encoding Initiative (TEl) is an international project established in 1988 to develop guidelines for the preparation and interchange of electronic texts for research, and t...
Nancy Ide
WWW
2011
ACM
14 years 4 months ago
Analysis and tracking of emotions in english and bengali texts: a computational approach
The present discussion highlights the aspects of an ongoing doctoral thesis grounded on the analysis and tracking of emotions from English and Bengali texts. Development of lexica...
Dipankar Das
CHI
2006
ACM
15 years 10 months ago
Comparisons of keystroke-level model predictions to observed data
Comparison of model prediction against observed data is an investigative step used in cognitive modeling research for human-computer interaction. In this paper we describe compari...
Leonghwee Teo, Bonnie E. John