Search Sciweavers | Sciweavers

4234 search results - page 265 / 847

» A Method for Web Information Extraction

173

click to vote

ACL
2011

219views Computational Linguistics» more ACL 2011»

Joint Annotation of Search Queries

14 years 9 months ago

Download ciir.cs.umass.edu

Marking up search queries with linguistic annotations such as part-of-speech tags, capitalization, and segmentation, is an important part of query processing and understanding in ...

Michael Bendersky, W. Bruce Croft, David A. Smith

claim paper

Read More »

164

click to vote

WWW
2006
ACM

112views Internet Technology» more WWW 2006»

Using graph matching techniques to wrap data from PDF documents

16 years 6 months ago

Download rewerse.net

Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...

Tamir Hassan, Robert Baumgartner

claim paper

Read More »

141

click to vote

CIKM
2009
Springer

127views Information Technology» more CIKM 2009»

Vetting the links of the web

16 years 24 days ago

Download www.cse.lehigh.edu

Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...

Na Dai, Brian D. Davison

claim paper

Read More »

162

click to vote

SIGIR
2004
ACM

168views Information Technology» more SIGIR 2004»

Block-based web search

15 years 11 months ago

Download research.microsoft.com

Multiple-topic and varying-length of web pages are two negative factors significantly affecting the performance of web search. In this paper, we explore the use of page segmentati...

Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma

claim paper

Read More »

130

click to vote

JCDL
2005
ACM

100views Education» more JCDL 2005»

Automatic extraction of titles from general documents using machine learning

15 years 11 months ago

Download research.microsoft.com

In this paper, we propose a machine learning approach to title extraction from general documents. By general documents, we mean documents that can belong to any one of a number of...

Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, Q...

claim paper

Read More »

« Prev « First page 265 / 847 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers