Sciweavers

735 search results - page 99 / 147
» Corpora and data preparation
Sort
View
DOCENG
2005
ACM
14 years 12 months ago
Encapsulating and manipulating component object graphics (COGs) using SVG
Scalable Vector Graphics (SVG) has an imaging model similar to that of PostScript and PDF but the XML basis of SVG allows it to participate fully, via namespaces, in generalised X...
Alexander J. Macdonald, David F. Brailsford, Steve...
FORTE
2004
14 years 11 months ago
PEPA Nets in Practice: Modelling a Decentralised Peer-to-Peer Emergency Medical Application
Abstract. We apply the PEPA nets modelling language to modelling a peer-topeer medical informatics application, the FieldCare PDA-based medical records system developed by SINTEF T...
Stephen Gilmore, Valentin Haenel, Jane Hillston, L...
NAACL
1994
14 years 11 months ago
MACROPHONE: An American English Telephone Speech Corpus
Macrophone is a corpus of approximately 200,000 utterances, recorded over the telephone from a broad sample of about 5,000 American speakers. Sponsored by the Linguistic Data Cons...
Kelsey Taussig, Jared Bernstein
TREC
2007
14 years 11 months ago
Lymba's PowerAnswer 4 in TREC 2007
This paper reports on Lymba Corporation’s (a spinoff of Language Computer Corporation) participation in the TREC 2007 Question Answering track. An overview of the PowerAnswer 4 ...
Dan I. Moldovan, Christine Clark, Moldovan Bowden
ICASSP
2010
IEEE
14 years 10 months ago
Using the Amazon Mechanical Turk for transcription of spoken language
We investigate whether Amazon’s Mechanical Turk (MTurk) service can be used as a reliable method for transcription of spoken language data. Utterances with varying speaker demog...
Matthew Marge, Satanjeev Banerjee, Alexander I. Ru...