Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

10

COGSCI
2002

favoriteEmaildiscussreport

99views more COGSCI 2002»

Learning words from sights and sounds: a computational model

13 years 4 months ago

Learning words from sights and sounds: a computational model

Download web.media.mit.edu

This paper presents an implemented computational model of word acquisition which learns directly from raw multimodal sensory input. Set in an information theoretic framework, the model acquires a lexicon by finding and statistically modeling consistent cross-modal structure. The model has been implemented in a system using novel speech processing, computer vision, and machine learning algorithms. In evaluations the model successfully performed speech segmentation, word discovery and visual categorization from spontaneous infant-directed speech paired with video images of single objects. These results demonstrate the possibility of using state-of-the-art techniques from sensory pattern recognition and machine learning to implement cognitive models which can process raw sensor data without the need for human transcription or labeling.

Deb Roy, Alex Pentland

Real-time Traffic

COGSCI 2002 | Implemented Computational Model | Raw Multimodal Sensory | Spontaneous Infant-directed Speech |

claim paper

Related Content

» Semantic Annotation and Retrieval of Music and Sound Effects

» A Probabilistic Computational Model of CrossSituational Word Learning

» Learning Words and Their Meanings from Unsegmented Childdirected Speech

» SoundSense scalable sound sensing for peoplecentric applications on mobile phones

» An ErrorDriven WordCharacter Hybrid Model for Joint Chinese Word Segmentation and POS Tagg...

» The Sounds of Silence Towards Automated Evaluation of Student Learning in a Reading Tutor ...

» Study of Feature Values for Subjective Classification of Music

» The Organization of a Neurocomputational Control Model for Articulatory Speech Synthesis

» Modeling Natural Sounds with Modulation Cascade Processes

Post Info
More Details (n/a)

Added	17 Dec 2010
Updated	17 Dec 2010
Type	Journal
Year	2002
Where	COGSCI
Authors	Deb Roy, Alex Pentland

Comments (0)