Sciweavers

1950 search results - page 127 / 390
» Informative sampling for large unbalanced data sets
Sort
View
154
Voted
BMCBI
2008
175views more  BMCBI 2008»
15 years 3 months ago
Synonym set extraction from the biomedical literature by lexical pattern discovery
Background: Although there are a large number of thesauri for the biomedical domain many of them lack coverage in terms and their variant forms. Automatic thesaurus construction b...
John McCrae, Nigel Collier
139
Voted
BMCBI
2004
206views more  BMCBI 2004»
15 years 3 months ago
Combining gene expression data from different generations of oligonucleotide arrays
Background: One of the important challenges in microarray analysis is to take full advantage of previously accumulated data, both from one's own laboratory and from public re...
Kyu Baek Hwang, Sek Won Kong, Steven A. Greenberg,...
194
Voted

Publication
197views
13 years 11 months ago
Convex non-negative matrix factorization for massive datasets
Non-negative matrix factorization (NMF) has become a standard tool in data mining, information retrieval, and signal processing. It is used to factorize a non-negative data matrix ...
C. Thurau, K. Kersting, M. Wahabzada, and C. Bauck...
CVPR
2005
IEEE
16 years 5 months ago
Semi-Supervised Cross Feature Learning for Semantic Concept Detection in Videos
For large scale automatic semantic video characterization, it is necessary to learn and model a large number of semantic concepts. But a major obstacle to this is the insufficienc...
Rong Yan, Milind R. Naphade
ICML
2007
IEEE
16 years 4 months ago
Self-taught learning: transfer learning from unlabeled data
We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabele...
Rajat Raina, Alexis Battle, Honglak Lee, Benjamin ...