Sciweavers

2383 search results - page 79 / 477
» Finding Representative Set from Massive Data
Sort
View
KDD
2008
ACM
206views Data Mining» more  KDD 2008»
15 years 10 months ago
Identifying biologically relevant genes via multiple heterogeneous data sources
Selection of genes that are differentially expressed and critical to a particular biological process has been a major challenge in post-array analysis. Recent development in bioin...
Zheng Zhao, Jiangxin Wang, Huan Liu, Jieping Ye, Y...
BMCBI
2006
159views more  BMCBI 2006»
14 years 10 months ago
MultiSeq: unifying sequence and structure data for evolutionary analysis
Background: Since the publication of the first draft of the human genome in 2000, bioinformatic data have been accumulating at an overwhelming pace. Currently, more than 3 million...
Elijah Roberts, John Eargle, Dan Wright, Zaida Lut...
MSR
2010
ACM
15 years 3 months ago
The Ultimate Debian Database: Consolidating bazaar metadata for Quality Assurance and data mining
—FLOSS distributions like RedHat and Ubuntu require a lot more complex infrastructures than most other FLOSS projects. In the case of community-driven distributions like Debian, ...
Lucas Nussbaum, Stefano Zacchiroli
AAAI
2007
15 years 9 days ago
The Impact of Time on the Accuracy of Sentiment Classifiers Created from a Web Log Corpus
We investigate the impact of time on the predictability of sentiment classification research for models created from web logs. We show that sentiment classifiers are time dependen...
Kathleen T. Durant, Michael D. Smith
EDBT
2004
ACM
268views Database» more  EDBT 2004»
15 years 10 months ago
DBDC: Density Based Distributed Clustering
Abstract. Clustering has become an increasingly important task in modern application domains such as marketing and purchasing assistance, multimedia, molecular biology as well as m...
Eshref Januzaj, Hans-Peter Kriegel, Martin Pfeifle