Sciweavers

874 search results - page 18 / 175
» Jedi: Extracting and Synthesizing Information from the Web
Sort
View
WWW
2009
ACM
15 years 10 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...
83
Voted
CLEF
2010
Springer
14 years 10 months ago
Person Attribute Extraction from the Textual Parts of Web Pages
We present the RGAI systems which participated in the third Web People Search Task challenge. The chief characteristics of our approach are that we focus on the raw textual parts o...
István Nagy, Richárd Farkas
CIKM
2009
Springer
15 years 2 months ago
Data extraction from the web using wild card queries
This paper presents an overview of our framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task o...
Davood Rafiei, Haobin Li
SIGIR
2005
ACM
15 years 3 months ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
15 years 3 months ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...