Sciweavers

2137 search results - page 62 / 428
» Extraction of Structural Information from the Web
Sort
View
AAAI
2010
14 years 11 months ago
Toward an Architecture for Never-Ending Language Learning
We consider here the problem of building a never-ending language learner; that is, an intelligent computer agent that runs forever and that each day must (1) extract, or read, inf...
Andrew Carlson, Justin Betteridge, Bryan Kisiel, B...
WWW
2005
ACM
15 years 10 months ago
Extracting context to improve accuracy for HTML content extraction
Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...
Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo
WWW
2011
ACM
14 years 4 months ago
FACTO: a fact lookup engine based on web tables
Recently answers for fact lookup queries have appeared on major search engines. For example, for the query {Barack Obama date of birth} Google directly shows “4 August 1961” a...
Xiaoxin Yin, Wenzhao Tan, Chao Liu
BIS
2006
106views Business» more  BIS 2006»
14 years 11 months ago
Expected Utility of Content Blocks in Web Content Extraction
In this paper we discuss the possible application of new concepts in web content extraction: utility assessment, utility annealing, and dynamic aggregated document generation. Aft...
Marek Kowalkiewicz
JCDL
2006
ACM
237views Education» more  JCDL 2006»
15 years 3 months ago
Automatic extraction of table metadata from digital documents
Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and high...
Ying Liu, Prasenjit Mitra, C. Lee Giles, Kun Bai