text documents | Sciweavers

12

ECAI
2000
Springer

166views Artificial Intelligence» more ECAI 2000»

Background Knowledge, Indexing and Matching Interdependencies of Document Management and Ontology-Maintenance

13 years 8 months ago

Download ol2000.aifb.uni-karlsruhe.de

This position paper presents an algorithm, which determines similarities between text documents. These text documents are indexed with keywords and further background knowledge-ter...

Andreas Faatz, Thomas Kamps, Ralf Steinmetz

claim paper

Read More »

15

click to vote

HT
2003
ACM

102views Internet Technology» more HT 2003»

Untangling compound documents on the web

13 years 9 months ago

Download mccurley.org

Most text analysis is designed to deal with the concept of a “document”, namely a cohesive presentation of thought on a unifying subject. By contrast, individual nodes on the ...

Nadav Eiron, Kevin S. McCurley

claim paper

Read More »

8

click to vote

SPIRE
2004
Springer

82views Information Technology» more SPIRE 2004»

Indexing Text Documents Based on Topic Identification

13 years 10 months ago

Download tigger.cs.uwm.edu

This work provides algorithms and heuristics to index text documents by determining important topics in the documents. To index text documents, the work provides algorithms to gene...

Manonton Butarbutar, Susan McRoy

claim paper

Read More »

21

click to vote

ISI
2004
Springer

160views Security Privacy» more ISI 2004»

Generating Concept Hierarchies from Text for Intelligence Analysis

13 years 10 months ago

Download wkd.iis.sinica.edu.tw

It is important to automatically extract key information from sensitive text documents for intelligence analysis. Text documents are usually unstructured and information extraction...

Jenq-Haur Wang, Chien-Chung Huang, Jei-Wen Teng, L...

claim paper

Read More »

14

click to vote

CIKM
2005
Springer

131views Information Technology» more CIKM 2005»

Inferring document similarity from hyperlinks

13 years 10 months ago

Download david.grangier.info

Assessing semantic similarity between text documents is a crucial aspect in Information Retrieval systems. In this work, we propose to use hyperlink information to derive a simila...

David Grangier, Samy Bengio

claim paper

Read More »

11

click to vote

ICMCS
2005
IEEE

126views Multimedia» more ICMCS 2005»

Protocols for data-hiding based text document security and automatic processing

13 years 10 months ago

Download www.cecs.uci.edu

Text documents, in electronic and hardcopy forms, are and will probably remain the most widely used kind of content in our digital age. The goal of this paper is to overview proto...

Frédéric Deguillaume, Yuriy Rytsar, ...

claim paper

Read More »

10

click to vote

ICDAR
2005
IEEE

112views Document Analysis» more ICDAR 2005»

A Model for Detecting and Merging Vertically Spanned Table Cells in Plain Text Documents

13 years 10 months ago

Download web.science.mq.edu.au

A spanned cell in a table is a single, complete unit that physically occupies multiple columns and/or multiple rows. Spanned cells are common in tables, and they are a signiﬁcan...

Vanessa Long, Robert Dale, Steve Cassidy

claim paper

Read More »

21

click to vote

AH
2008
Springer

264views Internet Technology» more AH 2008»

Collection Browsing through Automatic Hierarchical Tagging

13 years 11 months ago

Download wwwiti.cs.uni-magdeburg.de

In order to navigate huge document collections eﬃciently, tagged hierarchical structures can be used. For users, it is important to correctly interpret tag combinations. In this ...

Korinna Bade, Marcel Hermkes

claim paper

Read More »

13

click to vote

KDD
2007
ACM

136views Data Mining» more KDD 2007»

Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases

14 years 4 months ago

Download www.benyah.net

We now have incrementally-grown databases of text documents ranging back for over a decade in areas ranging from personal email, to news-articles and conference proceedings. While...

Benyah Shaparenko, Thorsten Joachims

claim paper

Read More »

15

click to vote

ICML
2005
IEEE

126views Machine Learning» more ICML 2005»

Hierarchical Dirichlet model for document classification

14 years 5 months ago

Download www.machinelearning.org

The proliferation of text documents on the web as well as within institutions necessitates their convenient organization to enable efficient retrieval of information. Although tex...

Sriharsha Veeramachaneni, Diego Sona, Paolo Avesan...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers