Search Sciweavers | Sciweavers

820 search results - page 45 / 164

» Deep web data extraction

151

Voted

WIDM
2003
ACM

97views Internet Technology» more WIDM 2003»

Schema-guided wrapper maintenance for web-data extraction

15 years 11 months ago

Download www.ics.uci.edu

Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. There are two main issues relevant t...

Xiaofeng Meng, Dongdong Hu, Chen Li

claim paper

Read More »

157

click to vote

IPM
2007

149views more IPM 2007»

Web page title extraction and its application

15 years 5 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...

claim paper

Read More »

190

click to vote

IJBIDM
2011

127views more IJBIDM 2011»

WebUser: mining unexpected web usage

15 years 18 days ago

Download www.lirmm.fr

: Web usage mining has been much concentrated on the discovery of relevant user behaviours from Web access record data. In this paper, we present WebUser, an approach to discover u...

Dong (Haoyuan) Li, Anne Laurent, Pascal Poncelet

claim paper

Read More »

172

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

15 years 11 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

145

click to vote

EACL
2006
ACL Anthology

143views Natural Language Processing» more EACL 2006»

Web Text Corpus for Natural Language Processing

15 years 7 months ago

Download www.cs.usyd.edu.au

Web text has been successfully used as training data for many NLP applications. While most previous work accesses web text through search engine hit counts, we created a Web Corpu...

Vinci Liu, James R. Curran

claim paper

Read More »

« Prev « First page 45 / 164 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers