Sciweavers

3090 search results - page 179 / 618
» Document Processing with LinkIT
Sort
View
AAAI
2008
15 years 6 months ago
Automatic Extraction of Data Points and Text Blocks from 2-Dimensional Plots in Digital Documents
Two dimensional plots (2-D) in digital documents on the web are an important source of information that is largely under-utilized. In this paper, we outline how data and text can ...
Saurabh Kataria, William Browuer, Prasenjit Mitra,...
LREC
2008
150views Education» more  LREC 2008»
15 years 5 months ago
Automatic Document Quality Control
This paper focuses on automatically improving the readability of documents. We explore mechanisms relating to content control that could be used (i) by authors to improve the qual...
Neil Newbold, Lee Gillam
INTERACT
2007
15 years 5 months ago
FaericWorld: Browsing Multimedia Events Through Static Documents and Links
This paper describes a novel browsing paradigm, taking benefit of the various types of links (e.g. thematic, temporal, references, etc.) that can be automatically built between mul...
Maurizio Rigamonti, Denis Lalanne, Rolf Ingold
SIGIR
2002
ACM
15 years 3 months ago
Unsupervised document classification using sequential information maximization
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...
Noam Slonim, Nir Friedman, Naftali Tishby
CIKM
2010
Springer
15 years 2 months ago
Fast dimension reduction for document classification based on imprecise spectrum analysis
This paper proposes an algorithm called Imprecise Spectrum Analysis (ISA) to carry out fast dimension reduction for document classification. ISA is designed based on the one-sided...
Hu Guan, Bin Xiao, Jingyu Zhou, Minyi Guo, Tao Yan...