Sciweavers

119 search results - page 3 / 24
» Learning to Extract Text-Based Information from the World Wi...
Sort
View
WWW
2005
ACM
14 years 6 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
IC
2000
13 years 6 months ago
A Hyperlink Focused Browse Assistant for the World Wide Web
This paper describes a browse assistant focusing on hyperlinks. It discusses the concept and an accompanying prototype implementation of the assistant. The aim of the assistant is ...
Andreas Heuer 0002, Ernst Georg Haffner, Uwe Roth,...
ICASSP
2008
IEEE
13 years 12 months ago
On-demand new word learning using world wide web
Most of the Web-based methods for lexicon augmenting consist in capturing global semantic features of the targeted domain in order to collect relevant documents from the Web. We s...
Stanislas Oger, Georges Linares, Fréd&eacut...
WEBI
2001
Springer
13 years 10 months ago
World Wide Web - A Multilingual Language Resource
Abstract. This paper argues that the World Wide Web could be regarded not only as an information resource but also as a dynamic, multilingual, least controlled, easy to access and ...
Fang Li, Huanye Sheng, Wilhelm Weisweber
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
14 years 5 months ago
Web site mining: a new way to spot competitors, customers and suppliers in the world wide web
When automatically extracting information from the world wide web, most established methods focus on spotting single HTMLdocuments. However, the problem of spotting complete web s...
Martin Ester, Hans-Peter Kriegel, Matthias Schuber...