Search Sciweavers | Sciweavers

336 search results - page 61 / 68

» Content-based language models for spoken document retrieval

118

click to vote

CICLING
2010
Springer

174views Natural Language Processing» more CICLING 2010»

Word Length n-Grams for Text Re-use Detection

15 years 5 months ago

Download users.dsic.upv.es

Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...

Alberto Barrón-Cedeño, Chiara Basile...

claim paper

Read More »

121

click to vote

IJCNLP
2005
Springer

138views Natural Language Processing» more IJCNLP 2005»

Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora

15 years 7 months ago

Download www.cs.ust.hk

Abstract. We present a new implication of Wu’s (1997) Inversion Transduction Grammar (ITG) Hypothesis, on the problem of retrieving truly parallel sentence translations from larg...

Dekai Wu, Pascale Fung

claim paper

Read More »

107

click to vote

SIGIR
2009
ACM

136views Information Technology» more SIGIR 2009»

Estimating query performance using class predictions

15 years 8 months ago

Download research.microsoft.com

We investigate using topic prediction data, as a summary of document content, to compute measures of search result quality. Unlike existing quality measures such as query clarity ...

Kevyn Collins-Thompson, Paul N. Bennett

claim paper

Read More »

125

click to vote

SIGIR
2009
ACM

101views Information Technology» more SIGIR 2009»

Incorporating prior knowledge into a transductive ranking algorithm for multi-document summarization

15 years 8 months ago

Download eprints.pascal-network.org

This paper presents a transductive approach to learn ranking functions for extractive multi-document summarization. At the ﬁrst stage, the proposed approach identiﬁes topic th...

Massih-Reza Amini, Nicolas Usunier

claim paper

Read More »

122

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

15 years 8 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

« Prev « First page 61 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers