Sciweavers

DAS
2010
Springer

Associating figures with descriptions for patent documents

13 years 9 months ago
Associating figures with descriptions for patent documents
Patent document images maintained by the U.S. patent database have a specific format, in which figures and text descriptions are separated into different sections. This makes it difficult for users to refer to a figure while reading the description or vice versa. This paper introduces a method to associate figures with corresponding description paragraphs, and thus help to make patent documents more friendly for users to browse. In this method, after extracting individual figures out of the drawing section, figures and relevant descriptions are associated by evaluating the similarity between the text content of figures and description paragraphs using vector space model. Categories and Subject Descriptors I.7 DOCUMENT AND TEXT PROCESSING [Document Capture]: Document analysis; I.7 DOCUMENT AND TEXT PROCESSING [Document Capture]: Graphics recognition and interpretation Keywords Patent Document Processing, Graphics Segmentation
Linlin Li, Chew Lim Tan
Added 19 Jul 2010
Updated 19 Jul 2010
Type Conference
Year 2010
Where DAS
Authors Linlin Li, Chew Lim Tan
Comments (0)