Sciweavers

151 search results - page 10 / 31
» A Chinese Corpus for Linguistic Research
Sort
View
TSD
2009
Springer
15 years 4 months ago
The Czech Broadcast Conversation Corpus
Abstract. This paper presents the final version of the Czech Broadcast Conversation Corpus released at the Linguistic Data Consortium (LDC). The corpus contains 72 recordings of a...
Jáchym Kolár, Jan Svec

Dataset
924views
15 years 5 months ago
SCUT-COUCH2009 - A Comprehensive Online Unconstrained Handwriting Database
SCUT-COUCH 2009 database is a comprehensive database that consists of 12 datasets, namely GB1, GB2, TradGB1, Big5, Pinyin, Letters, Digit, Symbol, Word8888, Word17366, Word44208 an...
Lianwen Jin
CLEAR
2007
Springer
117views Biometrics» more  CLEAR 2007»
15 years 3 months ago
Shared Linguistic Resources for the Meeting Domain
This paper describes efforts by the University of Pennsylvania's Linguistic Data Consortium to create and distribute shared linguistic resources – including data, annotation...
Meghan Lammie Glenn, Stephanie Strassel
AAAI
1994
14 years 10 months ago
Talking About AI: Socially Defined Linguistic Subcontexts in AI
This paper describes experiments documenting significant variations in word usage patterns within social subgroups of AI researchers. As some phrases have very different collocati...
Amy M. Steier, Richard K. Belew
LREC
2010
155views Education» more  LREC 2010»
14 years 11 months ago
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese
In this paper we bring to light a novel intersection between corpus linguistics and behavioral data that can be employed as an evaluation metric for resources for low-density lang...
Jerid Francom, Amy LaCross, Adam Ussishkin