Sciweavers

382 search results - page 9 / 77
» Using symbolic objects to cluster web documents
Sort
View
IC
2003
14 years 11 months ago
Internet Collaboration Using the W3C Document Object Model
The Internet makes it possible to share information (e.g. text, image, audio, video and other formats of data) across the globe. In this paper we look at collaborative Internet en...
Xiaohong Qiu, Bryan Carpenter, Geoffrey Fox
ICCS
2009
Springer
15 years 4 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov
78
Voted
WWW
2005
ACM
15 years 3 months ago
Finding the boundaries of information resources on the web
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov
WIDM
2003
ACM
15 years 2 months ago
Clustering documents in a web directory
Hierarchical categorization of documents is a task receiving growing interest due to the widespread proliferation of topic hierarchies for text documents. The worst problem of hie...
Giordano Adami, Paolo Avesani, Diego Sona
ICDAR
2007
IEEE
15 years 4 months ago
Representing and Characterizing Handwritten Mathematical Symbols through Succinct Functional Approximation
We model on-line ink traces for a set of 219 symbols to “best fit” low-degree polynomial series. Using a collection of mathematical writing samples, we find that in many cas...
Bruce W. Char, Stephen M. Watt