Search Sciweavers | Sciweavers

95

NAACL
2004

138views Computational Linguistics» more NAACL 2004»

Cross-Document Coreference on a Large Scale Corpus

15 years 4 months ago

In this paper, we will compare and evaluate the effectiveness of different statistical methods in the task of cross-document coreference resolution. We created entity models for d...

Chung Heong Gooi, James Allan

claim paper

Read More »

119

Voted

CLEF
2010
Springer

159views Information Technology» more CLEF 2010»

MapReduce for Information Retrieval Evaluation: "Let's Quickly Test This on 12 TB of Data"

15 years 4 months ago

Download eprints.eemcs.utwente.nl

We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use ...

Djoerd Hiemstra, Claudia Hauff

claim paper

Read More »

160

click to vote

ICDAR
2003
IEEE

113views Document Analysis» more ICDAR 2003»

Word Segmentation of Handwritten Dates in Historical Documents by Combining Semantic A-Priori-Knowledge with Local Features

15 years 8 months ago

Download www.cse.salford.ac.uk

The recognition of script in historical documents requires suitable techniques in order to identify single words. Segmentation of lines and words is a challenging task because lin...

Markus Feldbach, Klaus D. Tönnies

claim paper

Read More »

162

click to vote

ACST
2006

274views Computer Science» more ACST 2006»

Distributed hierarchical document clustering

15 years 4 months ago

Download nsm1.nsm.iup.edu

This paper investigates the applicability of distributed clustering technique, called RACHET [1], to organize large sets of distributed text data. Although the authors of RACHET c...

Debzani Deb, M. Muztaba Fuad, Rafal A. Angryk

claim paper

Read More »

130

click to vote

ICDAR
2009
IEEE

214views Document Analysis» more ICDAR 2009»

Metadata Extraction from PDF Papers for Digital Library Ingest

15 years 9 months ago

Download www.cvc.uab.es

In this paper we analyze our recent research on the use of document analysis techniques for metadata extraction from PDF papers. We describe a package that is designed to extract ...

Simone Marinai

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers