Search Sciweavers | Sciweavers

874 search results - page 18 / 175

» Jedi: Extracting and Synthesizing Information from the Web

140

click to vote

WWW
2009
ACM

209views Internet Technology» more WWW 2009»

Incorporating site-level knowledge to extract structured data from web forums

16 years 2 months ago

Download www2009.eprints.org

Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...

Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...

claim paper

Read More »

114

Voted

CLEF
2010
Springer

164views Information Technology» more CLEF 2010»

Person Attribute Extraction from the Textual Parts of Web Pages

15 years 2 months ago

Download www.clef2010.org

We present the RGAI systems which participated in the third Web People Search Task challenge. The chief characteristics of our approach are that we focus on the raw textual parts o...

István Nagy, Richárd Farkas

claim paper

Read More »

108

click to vote

CIKM
2009
Springer

115views Information Technology» more CIKM 2009»

Data extraction from the web using wild card queries

15 years 6 months ago

Download webdocs.cs.ualberta.ca

This paper presents an overview of our framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task o...

Davood Rafiei, Haobin Li

claim paper

Read More »

128

Voted

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

15 years 7 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

128

click to vote

SYNASC
2006
IEEE

211views Algorithms» more SYNASC 2006»

HTML Pattern Generator--Automatic Data Extraction from Web Pages

15 years 7 months ago

Download www.informatik.tu-cottbus.de

Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...

Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...

claim paper

Read More »

« Prev « First page 18 / 175 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers