Sciweavers

502 search results - page 47 / 101
» Extracting Partial Structures from HTML Documents
Sort
View
DAS
1998
Springer
15 years 3 months ago
Group 4 Compressed Document Matching
Numerous approaches, including textual, structural and featural, to detecting duplicate documents have been investigated. Considering document images are usually stored and transm...
Dar-Shyang Lee, Jonathan J. Hull
ICDAR
2005
IEEE
15 years 5 months ago
Enhancement of Layout-based Identification of Low-resolution Documents using Geometrical Color Distribution
This paper proposes a multi-signature document identification method that works robustly with lowresolution documents captured from handheld devices. The proposed method is based ...
Ardhendu Behera, Denis Lalanne, Rolf Ingold
ICADL
2007
Springer
112views Education» more  ICADL 2007»
15 years 5 months ago
Automated Template-Based Metadata Extraction Architecture
This paper describes our efforts to develop a toolset and process for automated metadata extraction from large, diverse, and evolving document collections. A number of federal agen...
Paul Flynn, Li Zhou, Kurt Maly, Steven J. Zeil, Mo...
106
Voted
HT
2009
ACM
15 years 5 months ago
2LIPGarden: 3D hypermedia for everyone
The early Web was hailed for being easy to use, and what is more important, giving people a chance to participate in its growth. The Web3D was believed to have potential to be the...
Jacek Jankowski, Izabela Irzynska, Bill McDaniel, ...
WEBDB
2010
Springer
156views Database» more  WEBDB 2010»
15 years 4 months ago
Redundancy-Driven Web Data Extraction and Integration
A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although s...
Paolo Papotti, Valter Crescenzi, Paolo Merialdo, M...