web pages | Sciweavers

17

IUI
2006
ACM

172views Software Engineering» more IUI 2006»

Recovering semantic relations from web pages based on visual cues

13 years 10 months ago

Recovering semantic relations between different parts of web pages are of great importance for multi-platform web interface development, as they make it possible to re-distribute ...

Peifeng Xiang, Yuanchun Shi

claim paper

Read More »

14

click to vote

HT
2006
ACM

150views Internet Technology» more HT 2006»

Implementation and evaluation of a quality-based search engine

13 years 10 months ago

Download quiew.itc.it

In this paper, an approach for the implementation of a qualitybased Web search engine is proposed. Quality retrieval is introduced and an overview on previous efforts to implement...

Thomas Mandl

claim paper

Read More »

15

click to vote

HT
2006
ACM

150views Internet Technology» more HT 2006»

Hyperlink assessment based on web usage mining

13 years 10 months ago

Download www.zsi.pwr.wroc.pl

One of the basic methods of web usage mining are association rules that indicate relationships among common use of web pages. Positive and confined negative association rules are ...

Przemyslaw Kazienko, Marcin Pilarczyk

claim paper

Read More »

15

click to vote

HT
2006
ACM

109views Internet Technology» more HT 2006»

Just-in-time recovery of missing web pages

13 years 10 months ago

Download www.cs.odu.edu

We present Opal, a light-weight framework for interactively locating missing web pages (http status code 404). Opal is an example of “in vivo” preservation: harnessing the col...

Terry L. Harrison, Michael L. Nelson

claim paper

Read More »

18

click to vote

ADC
2006
Springer

130views Database» more ADC 2006»

A two-phase rule generation and optimization approach for wrapper generation

13 years 10 months ago

Download crpit.com

Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...

Yanan Hao, Yanchun Zhang

claim paper

Read More »

9

click to vote

WEBI
2007
Springer

144views Internet Technology» more WEBI 2007»

Geographically-Sensitive Link Analysis

13 years 10 months ago

Download www.cs.toronto.edu

Many web pages and resources are primarily relevant to certain geographic locations. For example, in many queries web pages on restaurants, hotels, or movie theaters are only rele...

Hyun Chul Lee, Haifeng Liu, Renée J. Miller

claim paper

Read More »

15

click to vote

WEBDB
2007
Springer

126views Database» more WEBDB 2007»

Towards a Content-Provider-Friendly Web Page Crawler

13 years 10 months ago

Download leo.saclay.inria.fr

Search engine quality is impacted by two factors: the quality of the ranking/matching algorithm used and the freshness of the search engine’s index, which maintains a “snapsho...

Jie Xu, Qinglan Li, Huiming Qu, Alexandros Labrini...

claim paper

Read More »

9

click to vote

WEBDB
2007
Springer

133views Database» more WEBDB 2007»

EntityAuthority: Semantically Enriched Graph-Based Authority Propagation

13 years 10 months ago

Download www.cs.columbia.edu

This paper pursues the recently emerging paradigm of searching for entities that are embedded in Web pages. We utilize informationextraction techniques to identify entity candidat...

Julia Stoyanovich, Srikanta J. Bedathur, Klaus Ber...

claim paper

Read More »

9

click to vote

SOFSEM
2007
Springer

156views Theoretical Computer Science» more SOFSEM 2007»

Creating Permanent Test Collections of Web Pages for Information Extraction Research

13 years 10 months ago

Download www.dbai.tuwien.ac.at

In the research area of automatic web information extraction, there is a need for permanent and annotated web page collections enabling objective performance evaluation of differen...

Bernhard Pollak, Wolfgang Gatterbauer

claim paper

Read More »

14

click to vote

PKDD
2007
Springer

120views Data Mining» more PKDD 2007»

Site-Independent Template-Block Detection

13 years 10 months ago

Download research.microsoft.com

Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...

Aleksander Kolcz, Wen-tau Yih

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers