Sciweavers

445 search results - page 46 / 89
» Distributed hierarchical document clustering
Sort
View
140
Voted
IPM
2007
118views more  IPM 2007»
15 years 3 months ago
Cluster-based patent retrieval
Through the recent NTCIR workshops, patent retrieval casts many challenging issues to information retrieval community. Unlike newspaper articles, patent documents are very long an...
In-Su Kang, Seung-Hoon Na, Jungi Kim, Jong-Hyeok L...
112
Voted
PDP
2008
IEEE
15 years 10 months ago
Bulk-Synchronous On-Line Crawling on Clusters of Computers
This paper describes the design of a crawler devised to perform the periodic retrieval of Web documents for a search engine able to accept on-line updates in a concurrent manner. ...
Mauricio Marín, Carolina Bonacic
ICML
2007
IEEE
16 years 4 months ago
Mixtures of hierarchical topics with Pachinko allocation
The four-level pachinko allocation model (PAM) (Li & McCallum, 2006) represents correlations among topics using a DAG structure. It does not, however, represent a nested hiera...
David M. Mimno, Wei Li, Andrew McCallum
109
Voted
LAWEB
2003
IEEE
15 years 9 months ago
On the Evolution of Clusters of Near-Duplicate Web Pages
This paper expands on a 1997 study of the amount and distribution of near-duplicate pages on the World Wide Web. We downloaded a set of 150 million web pages on a weekly basis ove...
Dennis Fetterly, Mark Manasse, Marc Najork
135
Voted
MONET
1998
94views more  MONET 1998»
15 years 3 months ago
Hierarchically-Organized, Multihop Mobile Wireless Networks for Quality-of-Service Support
MMWN is a modular system of link- and network-layer algorithms that enables a multihop mobile wireless network to support distributed, real-time multimedia applications. In this pa...
Ram Ramanathan, Martha Steenstrup