The AMI Meeting Corpus is now publicly available, including manual annotation files generated in the NXT XML format, but lacking explicit metadata for the 171 meetings of the cor...
- We describe the algorithms we have developed to automatically generate street networks and building plots in the automatic procedural creation of a realistic city. Our system fir...
The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of whi...
We examine 89 websites from federal regulatory agencies in order to evaluate their ease of use for those interested in commenting on or learning about their proposed regulations. ...
— One way to handle data mining problems where class prior probabilities and/or misclassification costs between classes are highly unequal is to resample the data until a new, d...