Search Sciweavers | Sciweavers

1319 search results - page 173 / 264

» Using the Structure of HTML Documents to Improve Retrieval

105

click to vote

ACSC
2006
IEEE

143views Theoretical Computer Science» more ACSC 2006»

Improvements of TLAESA nearest neighbour search algorithm and extension to approximation search

15 years 9 months ago

Download crpit.com

Nearest neighbour (NN) searches and k nearest neighbour (k-NN) searches are widely used in pattern recognition and image retrieval. An NN (k-NN) search ﬁnds the closest object (...

Ken Tokoro, Kazuaki Yamaguchi, Sumio Masuda

claim paper

Read More »

108

click to vote

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

16 years 4 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

131

Voted

GRID
2006
Springer

123views Distributed And Parallel Com...» more GRID 2006»

A Parallel Approach to XML Parsing

15 years 3 months ago

Download www.cs.indiana.edu

A language for semi-structured documents, XML has emerged as the core of the web services architecture, and is playing crucial roles in messaging systems, databases, and document p...

Wei Lu, Kenneth Chiu, Yinfei Pan

claim paper

Read More »

139

Voted

SIGIR
2008
ACM

114views Information Technology» more SIGIR 2008»

A study of learning a merge model for multilingual information retrieval

15 years 3 months ago

Download nlg3.csie.ntu.edu.tw

This paper proposes a learning approach for the merging process in multilingual information retrieval (MLIR). To conduct the learning approach, we also present a large number of f...

Ming-Feng Tsai, Yu-Ting Wang, Hsin-Hsi Chen

claim paper

Read More »

114

Voted

ACL
2010

128views Computational Linguistics» more ACL 2010»

Profiting from Mark-Up: Hyper-Text Annotations for Guided Parsing

15 years 1 months ago

Download nlp.stanford.edu

We show how web mark-up can be used to improve unsupervised dependency parsing. Starting from raw bracketings of four common HTML tags (anchors, bold, italics and underlines), we ...

Valentin I. Spitkovsky, Daniel Jurafsky, Hiyan Als...

claim paper

Read More »

« Prev « First page 173 / 264 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers