Sciweavers

563 search results - page 66 / 113
» Crawling the web for structured documents
Sort
View
CIDR
2003
164views Algorithms» more  CIDR 2003»
15 years 3 months ago
Capacity Bound-free Web Warehouse
Web cache technologies have been developed as an extension of CPU cache, by modifying LRU (Least Recently Used) algorithms. Actually in web cache systems, we can use disks and ter...
Yahiko Kambayashi, Kai Cheng
DATESO
2004
174views Database» more  DATESO 2004»
15 years 3 months ago
On Efficient Part-match Querying of XML Data
The XML language have been becoming de-facto a standard for representation of heterogeneous data in the Internet. From database point of view, XML is a new approach to data modelli...
Michal Krátký, Marek Andrt
WSE
2003
IEEE
15 years 7 months ago
Resolution of Static Clones in Dynamic Web Pages
Cloning is extremely likely to occur in web sites, much more so than in other software. While some clones exist for valid reasons, or are too small to eliminate, cloning percentag...
Nikita Synytskyy, James R. Cordy, Thomas R. Dean
HICSS
2002
IEEE
115views Biometrics» more  HICSS 2002»
15 years 7 months ago
Fuzzy Rules for HTML Transcoding
With the increasing availability of Web-enabled mobile devices, we are facing the problem to effectively adapt Web content for those devices. For adaptation, Web page structures r...
Robbie Schaefer, Andreas Dangberg, Wolfgang Mü...
WWW
2007
ACM
16 years 2 months ago
Web object retrieval
The primary function of current Web search engines is essentially relevance ranking at the document level. However, myriad structured information about real-world objects is embed...
Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen,...