Large-scale, parallel automatic patent annotation

15 years 2 months ago

Download gate.ac.uk

When researching new product ideas or filing new patents, inventors need to retrieve all relevant pre-existing know-how and/or to exploit and enforce patents in their technological domain. However, this process is hindered by lack of richer metadata, which if present, would allow more powerful concept-based search to complement the current keywordbased approach. This paper presents our approach to automatic patent enrichment, tested in large-scale, parallel experiments on USPTO and EPO documents. It starts by defining the metadata annotation task and examines its challenges. The text analysis tools are presented next, including details on automatic annotation of sections, references and measurements. The key challenges encountered were dealing with ambiguities and errors in the data; creation and maintenance of large, domain-independent dictionaries; and building an efficient, robust patent analysis pipeline, capable of dealing with terabytes of data. The accuracy of automatically cre...

Milan Agatonovic, Niraj Aswani, Kalina Bontcheva,

Real-time Traffic

CIKM 2008 | Information Management | Metadata Annotation Task | Patent Enrichment | Pre-existing Know-how And/or |

claim paper

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	CIKM
Authors	Milan Agatonovic, Niraj Aswani, Kalina Bontcheva, Hamish Cunningham, Thomas Heitz, Yaoyong Li, Ian Roberts, Valentin Tablan

Sciweavers

Large-scale, parallel automatic patent annotation

CIKM 2008 | Information Management | Metadata Annotation Task | Patent Enrichment | Pre-existing Know-how And/or |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers