This paper presents a new corpus project, aiming at building a national corpus of Polish. What makes it different from a typical YACP (Yet Another Corpus Project) is 1) the fact t...
In this paper we describe the methodology and the first steps for the creation of WNTERM (from WordNet and Terminology), a specialized lexicon produced from the merger of the Euro...
Eli Pociello, Antton Gurrutxaga, Eneko Agirre, Iza...
Fixed, limited budgets often constrain the amount of expert annotation that can go into the construction of annotated corpora. Estimating the cost of annotation is the first step ...
Eric K. Ringger, Marc Carmen, Robbie Haertel, Kevi...
This paper describes the collect and transcription of a large set of Arabic broadcast news speech data. A total of more than 2000 hours of data was transcribed. The transcription ...
Krahmer et al.'s (2003) graph-based framework provides an elegant and flexible approach to the generation of referring expressions. In this paper, we present the first report...
Jette Viethen, Robert Dale, Emiel Krahmer, Mari&eu...