Sciweavers

735 search results - page 55 / 147
» Corpora and data preparation
Sort
View
LREC
2008
140views Education» more  LREC 2008»
14 years 11 months ago
New Functions of FrameSQL for Multilingual FrameNets
The Berkeley FrameNet Project (BFN) is making an English lexical database called FrameNet, which describes syntactic and semantic properties of an English lexicon extracted from l...
Hiroaki Sato
ANLP
1992
89views more  ANLP 1992»
14 years 11 months ago
Computational Lexicons: the Neat Examples and the Odd Exemplars
When implementing computational lexicons it is important to keep in mind the texts that a NLP system must deal with. Words relate to each other in many different, often queer, way...
Roberto Basili, Maria Teresa Pazienza, Paola Velar...
SIGDIAL
2010
14 years 8 months ago
Towards Semi-Supervised Classification of Discourse Relations using Feature Correlations
Two of the main corpora available for training discourse relation classifiers are the RST Discourse Treebank (RST-DT) and the Penn Discourse Treebank (PDTB), which are both based ...
Hugo Hernault, Danushka Bollegala, Mitsuru Ishizuk...
CIDR
2003
125views Algorithms» more  CIDR 2003»
14 years 11 months ago
Crossing the Structure Chasm
It has frequently been observed that most of the world’s data lies outside database systems. The reason is that database systems focus on structured data, leaving the unstructur...
Alon Y. Halevy, Oren Etzioni, AnHai Doan, Zachary ...
SPEECH
2010
80views more  SPEECH 2010»
14 years 4 months ago
The Nijmegen Corpus of Casual French
This article describes the preparation, recording and orthographic transcription of a new speech corpus, the Nijmegen Corpus of Casual French (NCCFr). The corpus contains a total ...
Francisco Torreira, Martine Adda-Decker, Mirjam Er...