Search Sciweavers | Sciweavers

290 search results - page 21 / 58

» Document normalization revisited

142

click to vote

COLING
2000

107views Computational Linguistics» more COLING 2000»

A Method of Measuring Term Representativeness - Baseline Method Using Co-occurrence Distribution

15 years 6 months ago

Download acl.ldc.upenn.edu

This paper introduces a scheme, which we call the baseline method, to define a measure of term representativeness and measures defined by using the scheme. The representativeness ...

Toru Hisamitsu, Yoshiki Niwa, Jun-ichi Tsujii

claim paper

Read More »

149

click to vote

SIGIR
2011
ACM

259views Information Technology» more SIGIR 2011»

When documents are very long, BM25 fails!

14 years 8 months ago

Download sifaka.cs.uiuc.edu

We reveal that the Okapi BM25 retrieval function tends to overly penalize very long documents. To address this problem, we present a simple yet eﬀective extension of BM25, namel...

Yuanhua Lv, ChengXiang Zhai

claim paper

Read More »

130

click to vote

ICPR
2002
IEEE

139views computer vision» more ICPR 2002»

Robust Text Detection from Binarized Document Images

16 years 6 months ago

Download www.ee.oulu.fi

Many document images are rich in color and have complex background. To detect text from them, a standard approach utilizes both color and binary information. This often leads to t...

Oleg Okun, Yu Yan, Matti Pietikäinen

claim paper

Read More »

141

click to vote

ICPR
2010
IEEE

209views Computer Vision» more ICPR 2010»

Text Separation from Mixed Documents Using a Tree-Structured Classifier

15 years 3 months ago

Download www.visionopen.com

In this paper, we propose a tree-structured multiclass classifier to identify annotations and overlapping text from machine printed documents. Each node of the tree-structured cla...

Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...

claim paper

Read More »

171

click to vote

DOCENG
2010
ACM

203views Document Analysis» more DOCENG 2010»

Diffing, patching and merging XML documents: toward a generic calculus of editing deltas

15 years 2 months ago

Download www.xrce.xerox.com

This work addresses what we believe to be a central issue in the field of XML diff and merge computation: the mathematical modeling o-called editing deltas and the study of their ...

Jean-Yves Vion-Dury

claim paper

Read More »

« Prev « First page 21 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers