Sciweavers

NAACL
1994
13 years 6 months ago
Integrated Text and Image Understanding for Document Understanding
Because of the complexity of documents and the variety of applications which must be supported, document understanding requires the integration of image understanding with text un...
Suzanne Liebowitz Taylor, Deborah A. Dahl, Mark Li...
ANLP
1994
104views more  ANLP 1994»
13 years 6 months ago
Language Determination: Natural Language Processing from Scanned Document Images
Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...
Penelope Sibun, A. Lawrence Spitz
PAKM
1998
13 years 6 months ago
Knowledge Management: A Text Mining Approach
Knowledge Discovery in Databases (KDD), also known as data mining, focuses on the computerized exploration of large amounts of data and on the discovery of interesting patterns wi...
Ronen Feldman, Moshe Fresko, Haym Hirsh, Yonatan A...
NIPS
1998
13 years 6 months ago
Restructuring Sparse High Dimensional Data for Effective Retrieval
The task in text retrieval is to find the subset of a collection of documents relevant to a user's information request, usually expressed as a set of words. Classically, docu...
Charles Lee Isbell Jr., Paul A. Viola
ECIR
1998
Springer
13 years 6 months ago
Independence of Contributing Retrieval Strategies in Data Fusion for Effective Information Retrieval
: In information retrieval, data fusion is a technique for combining the outputs of more than one retrieval strategy which rank documents for retrieval. One of the observations oft...
Alan F. Smeaton
BMVC
1998
13 years 6 months ago
Automatic Processing of Document Annotations
A common authoring technique involves making annotations on a printed draft and then typing the corrections into a computer at a later date. In this paper, we describe a system th...
Jacob Stevens, Andrew H. Gee, Chris Dance
ACL
2000
13 years 6 months ago
Headline Generation Based on Statistical Translation
Extractive summarization techniques cannot generate document summaries shorter than a single sentence, something that is often required. An ideal summarization system would unders...
Michele Banko, Vibhu O. Mittal, Michael J. Witbroc...
SDM
2003
SIAM
134views Data Mining» more  SDM 2003»
13 years 6 months ago
Hierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...
Benjamin C. M. Fung, Ke Wang, Martin Ester
KRDB
2003
137views Database» more  KRDB 2003»
13 years 6 months ago
Focused Search on the Web using WeQueL
Keyword-based web query languages suffer from a lack of precision when searching for a precise kind of documents. Indeed, some documents cannot be simply characterized by a list o...
Amar-Djalil Mezaour
JISBD
2003
13 years 6 months ago
Coupling the ontology layer with the resource layer: a rule-based approach
Abstract. Ontology languages are being proposed to provide machine-understandable descriptions of resources that permit easy location of these resource. Content managers can also b...
Jon Iturrioz, Oscar Díaz, Sergio Fern&aacut...