Sciweavers

13 search results - page 1 / 3
» Constructing Reference Sets from Unstructured, Ungrammatical...
Sort
View
JAIR
2010
160views more  JAIR 2010»
13 years 3 months ago
Constructing Reference Sets from Unstructured, Ungrammatical Text
Vast amounts of text on the Web are unstructured and ungrammatical, such as classified ads, auction listings, forum postings, etc. We call such text “posts.” Despite their in...
Matthew Michelson, Craig A. Knoblock
JAIR
2008
173views more  JAIR 2008»
13 years 5 months ago
Creating Relational Data from Unstructured and Ungrammatical Data Sources
In order for agents to act on behalf of users, they will have to retrieve and integrate vast amounts of textual data on the World Wide Web. However, much of the useful data on the...
Matthew Michelson, Craig A. Knoblock
ACL
2008
13 years 6 months ago
Ad Hoc Treebank Structures
We outline the problem of ad hoc rules in treebanks, rules used for specific constructions in one data set and unlikely to be used again. These include ungeneralizable rules, erro...
Markus Dickinson
EMNLP
2006
13 years 6 months ago
Learning Field Compatibilities to Extract Database Records from Unstructured Text
Named-entity recognition systems extract entities such as people, organizations, and locations from unstructured text. Rather than extract these mentions in isolation, this paper ...
Michael L. Wick, Aron Culotta, Andrew McCallum
ICDAR
2003
IEEE
13 years 10 months ago
Writer Identification based on the fractal construction of a reference base
Our aim is to achieve writer identification process thanks to a fractal analysis of handwriting style. For each writer, a set of characteristics is extracted. They are specific to...
Audrey Seropian, M. Grimaldi, Nicole Vincent