We present a new approach to automatic summarization based on neural nets, called NetSum. We extract a set of features from each sentence that helps identify its importance in the...
Krysta Marie Svore, Lucy Vanderwende, Christopher ...
In this paper, we propose a new method for correcting distorted document images using a stereo camera pair. Geometric and photometric distortions may occur in document images of t...
This paper describes a new method for the classification of a HTML document into a hierarchy of categories. The hierarchy of categories is involved in all phases of automated docum...
Abstract. Because the notion of context is multi-disciplinary [17], it encompasses lots of issues in Information Retrieval. In this paper, we define the context as the information ...
We describe a methodology for retrieving document images from large extremely diverse collections. First we perform content extraction, that is the location and measurement of reg...