Search Sciweavers | Sciweavers

563 search results - page 65 / 113

» Crawling the web for structured documents

159

click to vote

ACL
1998

173views Computational Linguistics» more ACL 1998»

Automatic Text Summarization Based on the Global Document Annotation

15 years 7 months ago

Download www.aclweb.org

The GDA (Global Document Annotation) project proposes a tag set which allows machines to automatically infer the underlying semantic/pragmatic structure of documents. Its objectiv...

Katashi Nagao, Kôiti Hasida

claim paper

Read More »

184

click to vote

NAACL
2010

182views Computational Linguistics» more NAACL 2010»

Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment

15 years 4 months ago

Download research.microsoft.com

The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...

Jason R. Smith, Chris Quirk, Kristina Toutanova

claim paper

Read More »

133

click to vote

CEEMAS
2005
Springer

88views Intelligent Agents» more CEEMAS 2005»

Selection in Scale-Free Small World

15 years 11 months ago

Download www.cse.sc.edu

Abstract. In this paper we compare our selection based learning algorithm with the reinforcement learning algorithm in Web crawlers. The task of the crawlers is to ﬁnd new inform...

Zsolt Palotai, Csilla Farkas, András Lö...

claim paper

Read More »

185

click to vote

CIKM
2005
Springer

114views Information Technology» more CIKM 2005»

Maximal termsets as a query structuring mechanism

15 years 11 months ago

Download homepages.dcc.ufmg.br

Search engines process queries conjunctively to restrict the size of the answer set. Further, it is not rare to observe a mismatch between the vocabulary used in the text of Web p...

Bruno Pôssas, Nivio Ziviani, Berthier A. Rib...

claim paper

Read More »

158

click to vote

WWW
2003
ACM

171views Internet Technology» more WWW 2003»

Improving pseudo-relevance feedback in web information retrieval using web page segmentation

16 years 7 months ago

Download research.microsoft.com

In contrast to traditional document retrieval, a web page as a whole is not a good information unit to search because it often contains multiple topics and a lot of irrelevant inf...

Shipeng Yu, Deng Cai, Ji-Rong Wen, Wei-Ying Ma

claim paper

Read More »

« Prev « First page 65 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers