Search Sciweavers | Sciweavers

4645 search results - page 72 / 929

» Using Information Extraction to Improve Document Retrieval

158

click to vote

ICDIM
2008
IEEE

351views Information Technology» more ICDIM 2008»

Unsupervised key-phrases extraction from scientific papers using domain and linguistic knowledge

15 years 11 months ago

Download dit.unitn.it

The domain of Digital Libraries presents specific challenges for unsupervised information extraction to support both the automatic classification of documents and the enhancement ...

Mikalai Krapivin, Maurizio Marchese, Andrei Yadran...

claim paper

Read More »

119

click to vote

WSDM
2010
ACM

215views Data Mining» more WSDM 2010»

Boilerplate Detection using Shallow Text Features

16 years 1 months ago

Download www.wsdm-conference.org

In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...

Christian Kohlschütter, Peter Fankhauser, Wol...

claim paper

Read More »

137

click to vote

ICDAR
2009
IEEE

222views Document Analysis» more ICDAR 2009»

Enhanced Text Extraction from Arabic Degraded Document Images Using EM Algorithm

15 years 11 months ago

Download www.cvc.uab.es

This paper presents a new enhanced text extraction algorithm from degraded document images on the basis of the probabilistic models. The observed document image is considered as a...

Wafa Boussellaa, Aymen Bougacha, Abderrazak Zahour...

claim paper

Read More »

128

click to vote

SIGIR
2010
ACM

188views Information Technology» more SIGIR 2010»

Hierarchical pitman-yor language model for information retrieval

15 years 8 months ago

Download www.lsv.uni-saarland.de

In this paper, we propose a new application of Bayesian language model based on Pitman-Yor process for information retrieval. This model is a generalization of the Dirichlet distr...

Saeedeh Momtazi, Dietrich Klakow

claim paper

Read More »

140

click to vote

CLIN
2001

103views Computational Linguistics» more CLIN 2001»

Creating a Dutch Information Retrieval Test Corpus

15 years 5 months ago

Download eprints.eemcs.utwente.nl

This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch te...

Djoerd Hiemstra, David van Leeuwen

claim paper

Read More »

« Prev « First page 72 / 929 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers