Sciweavers

1950 search results - page 161 / 390
» Informative sampling for large unbalanced data sets
Sort
View
123
Voted
DATAMINE
2006
89views more  DATAMINE 2006»
15 years 3 months ago
Scalable Clustering Algorithms with Balancing Constraints
Clustering methods for data-mining problems must be extremely scalable. In addition, several data mining applications demand that the clusters obtained be balanced, i.e., be of ap...
Arindam Banerjee, Joydeep Ghosh
143
Voted
IJCAI
1997
15 years 5 months ago
Toward Structured Retrieval in Semi-structured Information Spaces
A semi-structured information space consists of multiple collections of textual documents containing fielded or tagged sections. The space can be highly heterogeneous, because eac...
Scott B. Huffman, Catherine Baudin
126
Voted
WIESS
2000
15 years 5 months ago
Operational Information Systems: An Example from the Airline Industry
Our research is motivated by the scaleability, availability, and extensibility challenges in deploying open systems based, enterprise operational applications. We present Delta�...
Van Oleson, Karsten Schwan, Greg Eisenhauer, Beth ...
118
Voted
SIGIR
2006
ACM
15 years 9 months ago
Improving web search ranking by incorporating user behavior information
We show that incorporating user behavior data can significantly improve ordering of top results in real web search setting. We examine alternatives for incorporating feedback into...
Eugene Agichtein, Eric Brill, Susan T. Dumais
112
Voted
BMCBI
2010
165views more  BMCBI 2010»
15 years 3 months ago
Multivariate meta-analysis of proteomics data from human prostate and colon tumours
Background: There is a vast need to find clinically applicable protein biomarkers as support in cancer diagnosis and tumour classification. In proteomics research, a number of met...
Lina Hultin Rosenberg, Bo Franzén, Gert Aue...