Sciweavers

4234 search results - page 164 / 847
» A Method for Web Information Extraction
Sort
View
DEBU
1999
109views more  DEBU 1999»
15 years 1 months ago
Data Management for XML: Research Directions
This paper is a July 1999 snapshot of a "whitepaper" that I've been working on. The purpose of the whitepaper, which I initially drafted in April 1999, was to formu...
Jennifer Widom
WWW
2005
ACM
16 years 2 months ago
Web data cleansing for information retrieval using key resource page selection
With the page explosion of WWW, how to cover more useful information with limited storage and computation resources becomes more and more important in web IR research. Using web p...
Yiqun Liu, Canhui Wang, Min Zhang, Shaoping Ma
HICSS
1999
IEEE
178views Biometrics» more  HICSS 1999»
15 years 6 months ago
Collaborative Web Crawling: Information Gathering/Processing over Internet
The main objective of the IBM Grand Central Station (GCS) is to gather information of virtually any type of formats (text, data, image, graphics, audio, video) from the cyberspace...
Shang-Hua Teng, Qi Lu, Matthias Eichstaedt, Daniel...
ICDAR
2003
IEEE
15 years 7 months ago
Two Approaches for Text Segmentation in Web Images
There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents a...
Dimosthenis Karatzas, Apostolos Antonacopoulos
HPCC
2007
Springer
15 years 8 months ago
DISH - Dynamic Information-Based Scalable Hashing on a Cluster of Web Cache Servers
Caching web pages is an important part of web infrastructure. The effects of caching services are even more pronounced for wireless infrastructures due to their limited bandwidth. ...
Andrew Sohn, Hukeun Kwak, Kyusik Chung