Sciweavers

2137 search results - page 56 / 428
» Extraction of Structural Information from the Web
Sort
View
ECIR
2010
Springer
14 years 11 months ago
Estimating Translation Probabilities from the Web for Structured Queries on CLIR
We present two methods for estimating replacement probabilities without using parallel corpora. The first method proposed exploits the possible translation probabilities latent in ...
Xabier Saralegi, Maddalen Lopez de Lacalle
ITCC
2000
IEEE
15 years 2 months ago
Towards Knowledge Discovery from WWW Log Data
As the result of interactions between visitors and a web site, an http log file contains very rich knowledge about users on-site behaviors, which, if fully exploited, can better c...
Feng Tao, Fionn Murtagh
WEBI
2004
Springer
15 years 3 months ago
Semi-Structured Complex List Extraction
The semi-structured information available in HTML and similar documents provide valuable information that can be used for information extraction applications. This information tog...
Anders Arpteg
ECML
2005
Springer
15 years 3 months ago
Learning (k, l)-Contextual Tree Languages for Information Extraction
This paper introduces a novel method for learning a wrapper for extraction of information from web pages, based upon (k,l)-contextual tree languages. It also introduces a method to...
Stefan Raeymaekers, Maurice Bruynooghe, Jan Van de...
WIRI
2005
IEEE
15 years 3 months ago
A Fast Linkage Detection Scheme for Multi-Source Information Integration
Record linkage refers to techniques for identifying records associated with the same real-world entities. Record linkage is not only crucial in integrating multi-source databases ...
Akiko N. Aizawa, Keizo Oyama