Sciweavers

8795 search results - page 206 / 1759
» Measuring Generality of Documents
Sort
View
154
Voted
SIGCOMM
1996
ACM
15 years 8 months ago
Removal Policies in Network Caches for World-Wide Web Documents
World-Wide Web proxy servers that cache documents can potentially reduce three quantities: the number of requests that reach popular servers, the volume of network trac resulting ...
Marc Abrams, Charles R. Standridge, Ghaleb Abdulla...
186
Voted
ICDAR
2011
IEEE
14 years 4 months ago
Ternary Entropy-Based Binarization of Degraded Document Images Using Morphological Operators
—A vast number of historical and badly degraded document images can be found in libraries, public, and national archives. Due to the complex nature of different artifacts, such p...
T. Hoang Ngan Le, Tien D. Bui, Ching Y. Suen
ICDM
2006
IEEE
132views Data Mining» more  ICDM 2006»
15 years 10 months ago
High Quality, Efficient Hierarchical Document Clustering Using Closed Interesting Itemsets
High dimensionality remains a significant challenge for document clustering. Recent approaches used frequent itemsets and closed frequent itemsets to reduce dimensionality, and to...
Hassan H. Malik, John R. Kender
WWW
2007
ACM
16 years 5 months ago
Extensible schema documentation with XSLT 2.0
XML Schema documents are defined using an XML syntax, which means that the idea of generating schema documentation through standard XML technologies is intriguing. We present X2Do...
Felix Michel, Erik Wilde
ICDAR
2003
IEEE
15 years 10 months ago
Gabor Filter Based Multi-class Classifier for Scanned Document Images
When scanning documents with a large number of pages such as books, it is often feasible to provide a minimal number of training samples to personalize the system to compensate fo...
Huanfeng Ma, David S. Doermann