Search Sciweavers | Sciweavers

41 search results - page 6 / 9

» Text Genre Detection Using Common Word Frequencies

169

click to vote

TOIS
2002

97views more TOIS 2002»

Burst tries: a fast, efficient data structure for string keys

15 years 6 months ago

Download goanna.cs.rmit.edu.au

Many applications depend on efficient management of large sets of distinct strings in memory. For example, during index construction for text databases a record is held for each d...

Steffen Heinz, Justin Zobel, Hugh E. Williams

claim paper

Read More »

190

click to vote

AIRWEB
2008
Springer

176views Internet Technology» more AIRWEB 2008»

Cleaning search results using term distance features

15 years 9 months ago

Download airweb.cse.lehigh.edu

The presence of Web spam in query results is one of the critical challenges facing search engines today. While search engines try to combat the impact of spam pages on their resul...

Josh Attenberg, Torsten Suel

claim paper

Read More »

212

click to vote

CIKM
2011
Springer

191views Information Technology» more CIKM 2011»

Partial duplicate detection for large book collections

14 years 7 months ago

Download www.cs.umass.edu

A framework is presented for discovering partial duplicates in large collections of scanned books with optical character recognition (OCR) errors. Each book in the collection is r...

Ismet Zeki Yalniz, Ethem F. Can, R. Manmatha

claim paper

Read More »

223

click to vote

AMTA
2004
Springer

254views Information Technology» more AMTA 2004»

A Fluency Error Categorization Scheme to Guide Automated Machine Translation Evaluation

16 years 12 days ago

Download www.comp.leeds.ac.uk

Abstract. Existing automated MT evaluation methods often require expert human translations. These are produced for every language pair evaluated and, due to this expense, subsequen...

Debbie Elliott, Anthony Hartley, Eric Atwell

claim paper

Read More »

228

click to vote

EMNLP
2008

174views Natural Language Processing» more EMNLP 2008»

Relative Rank Statistics for Dialog Analysis

15 years 8 months ago

Download www.aclweb.org

We introduce the relative rank differential statistic which is a non-parametric approach to document and dialog analysis based on word frequency rank-statistics. We also present a...

Juan Huerta

claim paper

Read More »

« Prev « First page 6 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers