Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

38

WAPCV
2007
Springer

favoriteEmaildiscussreport

177views Computer Vision» more WAPCV 2007»

Language Label Learning for Visual Concepts Discovered from Video Sequences

14 years 3 months ago

Language Label Learning for Visual Concepts Discovered from Video Sequences

Download www.cse.iitk.ac.in

Computational models of grounded language learning have been based on the premise that words and concepts are learned simultaneously. Given the mounting cognitive evidence for concept formation in infants, we argue that the availability of pre-lexical concepts (learned from image sequences) leads to considerable computational eﬃciency in word acquisition. Key to the process is a model of bottom-up visual attention in dynamic scenes. Background learning and foreground segmentation is used to generate robust tracking and detect occlusion events. Trajectories are clustered to obtain motion event concepts. The object (image schemas) are abstracted from the combined appearance and motion data. The set of acquired concepts under visual attentive focus are then correlated with contemporaneous commentary to learn the grounded semantics of words and multi-word phrasal concatenations from the narrative. We demonstrate that even based on a mere half hour of video (of a scene involving many obje...

Prithwijit Guha, Amitabha Mukerjee

Real-time Traffic

Computational Model | Computer Vision | Motion Event Concepts | Multi-word Phrasal Concatenations | WAPCV 2007 |

claim paper

Related Content

» Understanding Videos Constructing Plots Learning a Visually Grounded Storyline Model from...

» Understanding videos constructing plots learning a visually grounded storyline model from ...

» Sequence MultiLabeling A Unified Video Annotation Scheme With Spatial and Temporal Context

» Learning TRECVID08 HighLevel Features from YouTube

» Mining Video Associations for Efficient Database Management

» Unsupervised Learning of Event Classes from Video

» SpecifictoGeneral Learning for Temporal Events with Application to Learning Event Definiti...

» Visual Tracking via Weakly Supervised Learning from Multiple Imperfect Oracles

» Learning Deformable Action Templates from Crowded Videos

Post Info
More Details (n/a)

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	WAPCV
Authors	Prithwijit Guha, Amitabha Mukerjee

Comments (0)