Sciweavers

1261 search results - page 142 / 253
» Extracting Text from PostScript
Sort
View
NAACL
2004
14 years 11 months ago
Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization
We consider the problem of modeling the content structure of texts within a specific domain, in terms of the topics the texts address and the order in which these topics appear. W...
Regina Barzilay, Lillian Lee
CLEF
2010
Springer
14 years 11 months ago
A Plagiarism Detector for Intrinsic Plagiarism - Lab Report for PAN at CLEF 2010
In this paper, we describe the algorithm that has been used to carry out our plagiarism detection within the context of PAN10 competition. Our system is based on the LempelZiv dist...
Pablo Suárez, José Carlos Gonz&aacut...
MIR
2004
ACM
176views Multimedia» more  MIR 2004»
15 years 3 months ago
Analysing the performance of visual, concept and text features in content-based video retrieval
This paper describes revised content-based search experiments in the context of TRECVID 2003 benchmark. Experiments focus on measuring content-based video retrieval performance wi...
Mika Rautiainen, Timo Ojala, Tapio Seppänen
DRR
2009
14 years 7 months ago
Figure content analysis for improved biomedical article retrieval
Biomedical images are invaluable in medical education and establishing clinical diagnosis. Clinical decision support (CDS) can be improved by combining biomedical text with automa...
Daekeun You, Emilia Apostolova, Sameer Antani, Din...
DCC
2001
IEEE
15 years 9 months ago
LIPT: A Reversible Lossless Text Transform to Improve Compression Performance
Lossless compression researchers have developed highly sophisticated approaches, such as Huffman encoding, arithmetic encoding, the Lempel-Ziv family, Dynamic Markov Compression (D...
Fauzia S. Awan, Nan Zhang 0005, Nitin Motgi, Raja ...