Search Sciweavers | Sciweavers

804 search results - page 80 / 161

» Text Segmentation Based on Similarity between Words

click to vote

ICDAR
1997
IEEE

143views Document Analysis» more ICDAR 1997»

Representing OCRed documents in HTML

15 years 2 months ago

Download www.cedar.buffalo.edu

ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...

Tao Hong, Sargur N. Srihari

claim paper

Read More »

click to vote

PAKDD
2009
ACM

127views Data Mining» more PAKDD 2009»

Clustering Documents Using a Wikipedia-Based Concept Representation

15 years 4 months ago

Download www.cs.waikato.ac.nz

Abstract. This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document clustering. We ﬁrst create a concept-based document representation b...

Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...

claim paper

Read More »

click to vote

LREC
2010

188views Education» more LREC 2010»

How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method

14 years 11 months ago

Download www.lrec-conf.org

We investigate the impact of input data scale in corpus-based learning using a study style of Zipf's law. In our research, Chinese word segmentation is chosen as the study ca...

Hai Zhao, Yan Song, Chunyu Kit

claim paper

Read More »

click to vote

ICIW
2009
IEEE

147views Internet Technology» more ICIW 2009»

Detecting Ontology Mappings via Descriptive Statistical Methods

14 years 7 months ago

Download cogsci.uni-osnabrueck.de

Instance-based ontology mapping comprises a collection of theoretical approaches and applications for identifying the implicit semantic similarities between two ontologies on the ...

Konstantin Todorov

claim paper

Read More »

111

click to vote

STACS
1992
Springer

220views Theoretical Computer Science» more STACS 1992»

Speeding Up Two String-Matching Algorithms

15 years 2 months ago

Download www-igm.univ-mlv.fr

We show how to speed up two string-matching algorithms: the Boyer-Moore algorithm (BM algorithm), and its version called here the reverse factor algorithm (RF algorithm). The RF al...

Maxime Crochemore, Thierry Lecroq, Artur Czumaj, L...

claim paper

Read More »

« Prev « First page 80 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers