Sciweavers

735 search results - page 12 / 147
» Corpora and data preparation
Sort
View
LREC
2010
176views Education» more  LREC 2010»
14 years 11 months ago
The DAD Parallel Corpora and their Uses
This paper deals with the uses of the annotations of third person singular neuter pronouns in the DAD parallel and comparable corpora of Danish and Italian texts and spoken data. ...
Costanza Navarretta
ICDM
2008
IEEE
80views Data Mining» more  ICDM 2008»
15 years 4 months ago
Collective Latent Dirichlet Allocation
In this paper, we propose a new variant of Latent Dirichlet Allocation(LDA): Collective LDA (C-LDA), for multiple corpora modeling. C-LDA combines multiple corpora during learning...
Zhiyong Shen, Jun Sun, Yi-Dong Shen
ACL
2007
14 years 11 months ago
A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora
This paper describes the first system for large-scale acquisition of subcategorization frames (SCFs) from English corpus data which can be used to acquire comprehensive lexicons ...
Judita Preiss, Ted Briscoe, Anna Korhonen
ACL
2009
14 years 7 months ago
System for Querying Syntactically Annotated Corpora
This paper presents a system for querying treebanks. The system consists of a powerful query language with natural support for cross-layer queries, a client interface with a graph...
Petr Pajas, Jan Stepánek
LREC
2010
136views Education» more  LREC 2010»
14 years 11 months ago
Sign Language Corpora for Analysis, Processing and Evaluation
Sign Languages (SLs) are the visuo-gestural languages practised by the deaf communities. Research on SLs requires to build, to analyse and to use corpora. The aim of this paper is...
Annelies Braffort, Laurence Bolot, Emilie Ch&eacut...