Sciweavers

1950 search results - page 143 / 390
» Informative sampling for large unbalanced data sets
Sort
View
159
Voted
BIOINFORMATICS
2011
14 years 7 months ago
When the Web meets the cell: using personalized PageRank for analyzing protein interaction networks
Motivation: Enormous, and constantly increasing quantity of biological information is represented in protein interaction network databases. Most of these data are freely accessibl...
Gábor Iván, Vince Grolmusz
238
Voted
CIKM
2003
Springer
15 years 8 months ago
Using titles and category names from editor-driven taxonomies for automatic evaluation
Evaluation of IR systems has always been difficult because of the need for manually assessed relevance judgments. The advent of large editor-driven taxonomies on the web opens the...
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury...
116
Voted
IQIS
2004
ACM
15 years 9 months ago
A Framework for Analysis of Data Freshness
Data freshness has been identified as one of the most important data quality attributes in information systems. This importance increases particularly in the context of distribute...
Mokrane Bouzeghoub, Verónika Peralta
128
Voted
ITNG
2010
IEEE
15 years 8 months ago
A Fast and Stable Incremental Clustering Algorithm
— Clustering is a pivotal building block in many data mining applications and in machine learning in general. Most clustering algorithms in the literature pertain to off-line (or...
Steven Young, Itamar Arel, Thomas P. Karnowski, De...
170
Voted
WWW
2002
ACM
16 years 4 months ago
OCTOPUS: aggressive search of multi-modality data using multifaceted knowledge base
An important trend in Web information processing is the support of multimedia retrieval. However, the most prevailing paradigm for multimedia retrieval, content-based retrieval (C...
Jun Yang 0003, Qing Li, Yueting Zhuang