Sciweavers

489 search results - page 2 / 98
» Effective techniques for automatic extraction of Web publica...
Sort
View
COLING
2010
13 years 22 days ago
A Method for Automatically Generating a Mediatory Summary to Verify Credibility of Information on the Web
In this paper, we propose a method for mediatory summarization, which is a novel technique for facilitating users' assessments of the credibility of information on the Web. A...
Hideyuki Shibuki, Takahiro Nagai, Masahiro Nakano,...
AAAI
2006
13 years 7 months ago
Table Extraction Using Spatial Reasoning on the CSS2 Visual Box Model
Tables on web pages contain a huge amount of semantically explicit information, which makes them a worthwhile target for automatic information extraction and knowledge acquisition...
Wolfgang Gatterbauer, Paul Bohunsky
ICDE
2006
IEEE
153views Database» more  ICDE 2006»
13 years 11 months ago
Automatic Extraction of Publication Time from News Search Results
The publication time of a page can have a big impact on its relevance to a query, especially for time-sensitive pages such as news items. For news search engines, the publication ...
Yiyao Lu, Weiyi Meng, Wanjing Zhang, King-Lup Liu,...
WWW
2004
ACM
14 years 6 months ago
Automatic web news extraction using tree edit distance
The Web poses itself as the largest data repository ever available in the history of humankind. Major efforts have been made in order to provide efficient access to relevant infor...
Davi de Castro Reis, Paulo Braz Golgher, Altigran ...
PVLDB
2008
141views more  PVLDB 2008»
13 years 5 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...