Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

134

RIAO
2007

163views Information Technology» more RIAO 2007»

Extracting Useful Information from the Full Text of Fiction

15 years 5 months ago

Extracting Useful Information from the Full Text of Fiction

Download riao.free.fr

In this paper, we describe some experiments in large-scale Information Extraction (IE) focusing on book texts. We investigate the scalability of IE techniques to full-sized books, and the utility of IE techniques in extracting useful information from fiction. In particular, we evaluate a variety of Named Entity Recognition (NER) techniques in identifying the central characters in works of fiction. First, we describe the creation of a gold standard for evaluation, which contains ordered lists of characters for a corpus of classic book texts in Project Gutenberg. Second, we describe several approaches to the task of character identification, where our best model achieves an average coverage score of 78.4% across all central characters. Finally, we propose a number of approaches for future work.

Sharon Givon, Maria Milosavljevic

Real-time Traffic

Book Texts | Central Characters | IE Techniques | Information Technology | RIAO 2007 |

claim paper

Related Content

» Efficient Extraction of ProteinProtein Interactions from FullText Articles

» Learning to Extract TextBased Information from the World Wide Web

» Extracting Data Records from Unstructured Biomedical Full Text

» Knowledge Extraction From Texts By Sintesi

» Learning Statistical Models for Annotating Proteins with Function Information using Biomed...

» TileBars Visualization of Term Distribution Information in Full Text Information Access

» Information Retrieval and Information Extraction in TREC Genomics 2007

» Coupling information retrieval and information extraction A new text technology for gather...

» Information extraction from biomedical text

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	RIAO
Authors	Sharon Givon, Maria Milosavljevic

Comments (0)