Search Sciweavers | Sciweavers

609 search results - page 27 / 122

» Adaptive record extraction from web pages

129

Voted

ITCC
2000
IEEE

145views Information Technology» more ITCC 2000»

Towards Knowledge Discovery from WWW Log Data

15 years 6 months ago

Download media.inhatc.ac.kr

As the result of interactions between visitors and a web site, an http log file contains very rich knowledge about users on-site behaviors, which, if fully exploited, can better c...

Feng Tao, Fionn Murtagh

claim paper

Read More »

116

click to vote

WEBDB
2010
Springer

156views Database» more WEBDB 2010»

Redundancy-Driven Web Data Extraction and Integration

15 years 7 months ago

Download www.dia.uniroma3.it

A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although s...

Paolo Papotti, Valter Crescenzi, Paolo Merialdo, M...

claim paper

Read More »

135

click to vote

WWW
2002
ACM

130views Internet Technology» more WWW 2002»

Using web structure for classifying and describing web pages

16 years 2 months ago

Download dpennock.com

The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...

Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...

claim paper

Read More »

108

click to vote

HICSS
2008
IEEE

105views Biometrics» more HICSS 2008»

Using Visual Features for Fine-Grained Genre Classification of Web Pages

15 years 8 months ago

Download csdl2.computer.org

The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...

Ryan Levering, Michal Cutler, Lei Yu

claim paper

Read More »

117

click to vote

EHCI
2004

141views Human Computer Interaction» more EHCI 2004»

Finding Iteration Patterns in Dynamic Web Page Authoring

15 years 3 months ago

Download arantxa.ii.uam.es

Most of the current WWW is made up of dynamic pages. The development of dynamic pages is a difficult and costly endeavour, out-of-reach for most users, experts, and content produce...

José A. Macías, Pablo Castells

claim paper

Read More »

« Prev « First page 27 / 122 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers