Sciweavers

285 search results - page 54 / 57
» Ontology-based Text Document Clustering
Sort
View
73
Voted
HICSS
2003
IEEE
102views Biometrics» more  HICSS 2003»
15 years 5 months ago
Prototype-Matching System for Allocating Conference Papers
Conferences on applied research require more complicated taxonomy than traditional organization of conferences by tracks. A topic of a paper, submitted to a conference on the appl...
Antonina Kloptchenko, Barbro Back, Hannu Vanharant...
ICDE
2010
IEEE
273views Database» more  ICDE 2010»
16 years 3 days ago
WikiAnalytics: Ad-hoc Querying of Highly Heterogeneous Structured Data
Searching and extracting meaningful information out of highly heterogeneous datasets is a hot topic that received a lot of attention. However, the existing solutions are based on e...
Andrey Balmin, Emiran Curtmola
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 7 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
107
Voted
SIGIR
2009
ACM
15 years 6 months ago
Compressing term positions in web indexes
Large search engines process thousands of queries per second on billions of pages, making query processing a major factor in their operating costs. This has led to a lot of resear...
Hao Yan, Shuai Ding, Torsten Suel
KDD
2005
ACM
160views Data Mining» more  KDD 2005»
16 years 23 days ago
Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering
Heterogeneous data co-clustering has attracted more and more attention in recent years due to its high impact on various applications. While the co-clustering algorithms for two t...
Bin Gao, Tie-Yan Liu, Xin Zheng, QianSheng Cheng, ...