Search Sciweavers | Sciweavers

11

EMNLP
2007

116views Natural Language Processing» more EMNLP 2007»

Topic Segmentation with Hybrid Document Indexing

13 years 5 months ago

We present a domain-independent unsupervised topic segmentation approach based on hybrid document indexing. Lexical chains have been successfully employed to evaluate lexical cohe...

Irina Matveeva, Gina-Anne Levow

claim paper

Read More »

16

click to vote

CLEF
2010
Springer

263views Information Technology» more CLEF 2010»

External and Intrinsic Plagiarism Detection Using a Cross-Lingual Retrieval and Segmentation System - Lab Report for PAN at CLEF

13 years 5 months ago

Download www.uni-weimar.de

We present our hybrid system for the PAN challenge at CLEF 2010. Our system performs plagiarism detection for translated and non-translated externally as well as intrinsically plag...

Markus Muhr, Roman Kern, Mario Zechner, Michael Gr...

claim paper

Read More »

16

click to vote

CIKM
2004
Springer

105views Information Technology» more CIKM 2004»

Processing content-oriented XPath queries

13 years 9 months ago

Download staff.science.uva.nl

Document-centric XML collections contain text-rich documents, marked up with XML tags that add lightweight semantics to the text. Querying such collections calls for a hybrid quer...

Börkur Sigurbjörnsson, Jaap Kamps, Maart...

claim paper

Read More »

12

click to vote

ICDAR
1997
IEEE

143views Document Analysis» more ICDAR 1997»

Representing OCRed documents in HTML

13 years 8 months ago

Download www.cedar.buffalo.edu

ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...

Tao Hong, Sargur N. Srihari

claim paper

Read More »

14

click to vote

ICIP
2009
IEEE

182views Image Processing» more ICIP 2009»

Semantic keyword extraction via adaptive text binarization of unstructured unsourced video

13 years 2 months ago

Download www1.cs.columbia.edu

We propose a fully automatic method for summarizing and indexing unstructured presentation videos based on text extracted from the projected slides. We use changes of text in the ...

Michele Merler, John R. Kender

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers