Sciweavers

3705 search results - page 229 / 741
» Building Documentation Generators
Sort
View
CEAS
2007
Springer
15 years 11 months ago
Hardening Fingerprinting by Context
Near-duplicate detection is not only an important pre and post processing task in Information Retrieval but also an effective spam-detection technique. Among different approache...
Aleksander Kolcz, Abdur Chowdhury
WWW
2005
ACM
15 years 11 months ago
Finding the boundaries of information resources on the web
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov
IDEAS
2002
IEEE
125views Database» more  IDEAS 2002»
15 years 10 months ago
Integrating HTML Tables Using Semantic Hierarchies And Meta-Data Sets
As the Internet is a global network, there is a demand on accessing closely related data without browsing through di erent Web documents. A signi cant amount of these data are pre...
Seung Jin Lim, Yiu-Kai Ng, Xiaochun Yang
WISE
2000
Springer
15 years 9 months ago
Object-Oriented Mediator Queries to XML Data
The mediator/wrapper approach is used to integrate data from different databases and other data sources by introducing a middleware virtual database that provides high level abstr...
Hui Lin, Tore Risch, Timour Katchaounov
DOCENG
2010
ACM
15 years 4 months ago
From templates to schemas: bridging the gap between free editing and safe data processing
In this paper we present tools that provide an easy way to edit XML content directly on the web, with the usual benefit of valid XML content. These tools make it possible to crea...
Vincent Quint, Cécile Roisin, Stépha...