Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework

16 years 10 months ago

Download vision.cs.princeton.edu

Given an image, we propose a hierarchical generative model that classifies the overall scene, recognizes and segments each object component, as well as annotates the image with a list of tags. To our knowledge, this is the first model that performs all three tasks in one coherent framework. For instance, a scene of a ‘polo game’ consists of several visual objects such as ‘human’, ‘horse’, ‘grass’, etc. In addition, it can be further annotated with a list of more abstract (e.g. ‘dusk’) or visually less salient (e.g. ‘saddle’) tags. Our generative model jointly explains images through a visual model and a textual model. Visually relevant objects are represented by regions and patches, while visually irrelevant textual annotations are influenced directly by the overall scene class. We propose a fully automatic learning framework that is able to learn robust scene models from noisy web data such as images and user tags from Flickr.com. We demonstrate...

Fei-Fei Li 0002, Li-Jia Li, Richard Socher

Real-time Traffic

Annotation | Classification | Computer Vision | CVPR 2009 | Generative Models | Scene Understanding | Segmentation |

claim paper

Post Info
More Details (n/a)

Added	05 May 2009
Updated	10 Dec 2009
Type	Conference
Year	2009
Where	CVPR
Authors	Fei-Fei Li 0002, Li-Jia Li, Richard Socher

Comments (0)

Sciweavers

Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework

Annotation | Classification | Computer Vision | CVPR 2009 | Generative Models | Scene Understanding | Segmentation |

Explore & Download

Productivity Tools

Sciweavers