Text to 3D Scene Generation with Rich Lexical Grounding

8 years 5 months ago

Download nlp.stanford.edu

The ability to map descriptions of scenes to 3D geometric representations has many applications in areas such as art, education, and robotics. However, prior work on the text to 3D scene generation task has used manually speciﬁed object categories and language that identiﬁes them. We introduce a dataset of 3D scenes annotated with natural language descriptions and learn from this data how to ground textual descriptions to physical objects. Our method successfully grounds a variety of lexical terms to concrete referents, and we show quantitatively that our method improves 3D scene generation over previous work using purely rule-based methods. We evaluate the ﬁdelity and plausibility of 3D scenes generated with our grounding approach through human judgments. To ease evaluation on this task, we also introduce an automated metric that strongly correlates with human judgments.

Angel X. Chang, Will Monroe, Manolis Savva, Christ

Real-time Traffic

ACL 2015 | Computational Linguistics |

claim paper

Post Info
More Details (n/a)

Added	13 Apr 2016
Updated	13 Apr 2016
Type	Journal
Year	2015
Where	ACL
Authors	Angel X. Chang, Will Monroe, Manolis Savva, Christopher Potts, Christopher D. Manning

Comments (0)

Sciweavers

Text to 3D Scene Generation with Rich Lexical Grounding

ACL 2015 | Computational Linguistics |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers