Sciweavers

119 search results - page 4 / 24
» Learning to Extract Text-Based Information from the World Wi...
Sort
View
CIS
2005
Springer
13 years 12 months ago
A Method for Automating the Extraction of Specialized Information from the Web
The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer sys...
Ling Lin, Antonio Liotta, Andrew Hippisley
RIAO
1997
13 years 7 months ago
Coupling information retrieval and information extraction: A new text technology for gathering information from the web
The techniques of information retrieval and information extraction are complementary, but to date there has been little concrete work aimed at integrating the two. We describe how...
Robert J. Gaizauskas, Alexander M. Robertson
WWW
2007
ACM
14 years 7 months ago
Answering bounded continuous search queries in the world wide web
Search queries applied to extract relevant information from the World Wide Web over a period of time may be denoted as continuous search queries. The improvement of continuous sea...
Dirk Kukulenz, Alexandros Ntoulas
WWW
2003
ACM
14 years 7 months ago
Efficient URL caching for world wide web crawling
Crawling the web is deceptively simple: the basic algorithm is (a) Fetch a page (b) Parse it to extract all linked URLs (c) For all the URLs not seen before, repeat (a)?(c). Howev...
Andrei Z. Broder, Marc Najork, Janet L. Wiener
WWW
2010
ACM
13 years 11 months ago
Web-scale knowledge extraction from semi-structured tables
A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...
Eric Crestan, Patrick Pantel