Sciweavers

1950 search results - page 209 / 390
» Informative sampling for large unbalanced data sets
Sort
View
AAAI
2008
15 years 6 months ago
Clustering via Random Walk Hitting Time on Directed Graphs
In this paper, we present a general data clustering algorithm which is based on the asymmetric pairwise measure of Markov random walk hitting time on directed graphs. Unlike tradi...
Mo Chen, Jianzhuang Liu, Xiaoou Tang
BMCBI
2004
150views more  BMCBI 2004»
15 years 4 months ago
Rank Difference Analysis of Microarrays (RDAM), a novel approach to statistical analysis of microarray expression profiling data
Background: A key step in the analysis of microarray expression profiling data is the identification of genes that display statistically significant changes in expression signals ...
Dietmar E. Martin, Philippe Demougin, Michael N. H...
BICOB
2009
Springer
15 years 1 months ago
A Biclustering Method to Discover Co-regulated Genes Using Diverse Gene Expression Datasets
We propose a two-step biclustering approach to mine co-regulation patterns of a given reference gene to discover other genes that function in a common biological process. Currently...
Doruk Bozdag, Jeffrey D. Parvin, Ümit V. &Cce...
BMCBI
2002
137views more  BMCBI 2002»
15 years 3 months ago
ORFer - retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files
Background: Functional genomics involves the parallel experimentation with large sets of proteins. This requires management of large sets of open reading frames as a prerequisite ...
Konrad Büssow, Steve Hoffmann, Volker Sievert
145
Voted
CPM
2000
Springer
177views Combinatorics» more  CPM 2000»
15 years 8 months ago
Identifying and Filtering Near-Duplicate Documents
Abstract. The mathematical concept of document resemblance captures well the informal notion of syntactic similarity. The resemblance can be estimated using a fixed size “sketch...
Andrei Z. Broder