Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

9

COLING
2008

favoriteEmaildiscussreport

116views Computational Linguistics» more COLING 2008»

A Framework for Identifying Textual Redundancy

13 years 5 months ago

A Framework for Identifying Textual Redundancy

Download www.aclweb.org

The task of identifying redundant information in documents that are generated from multiple sources provides a significant challenge for summarization and QA systems. Traditional clustering techniques detect redundancy at the sentential level and do not guarantee the preservation of all information within the document. We discuss an algorithm that generates a novel graph-based representation for a document and then utilizes a set cover approximation algorithm to remove redundant text from it. Our experiments show that this approach offers a significant performance advantage over clustering when evaluated over an annotated dataset.

Kapil Thadani, Kathleen McKeown

Real-time Traffic

COLING 2008 | Computational Linguistics | Cover Approximation Algorithm | Significant Performance Advantage | Traditional Clustering Techniques |

claim paper

Related Content

» Efficient search in large textual collections with redundancy

» Iconizer A Framework to Identify and Create Effective Representations for Visual Informati...

» Recognizing Textual Parallelisms with Edit Distance and Similarity Degree

» Controlling Redundancy in Referring Expressions

» Structural identifiability of generalized constraint neural network models for nonlinear r...

» Identifying adaptation dimensions in digital talking books

» Efficient Spectral Feature Selection with Minimum Redundancy

» Redundancies in Dependently Typed Lambda Calculi and Their Relevance to Proof Search

» Semantic Annotation of Reported Information in Arabic

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	COLING
Authors	Kapil Thadani, Kathleen McKeown

Comments (0)