Sciweavers

922 search results - page 143 / 185
» A data mining approach to database compression
Sort
View
KDD
2004
ACM
196views Data Mining» more  KDD 2004»
15 years 10 months ago
Adversarial classification
Essentially all data mining algorithms assume that the datagenerating process is independent of the data miner's activities. However, in many domains, including spam detectio...
Nilesh N. Dalvi, Pedro Domingos, Mausam, Sumit K. ...
WWW
2009
ACM
15 years 10 months ago
Mapping the world's photos
We investigate how to organize a large collection of geotagged photos, working with a dataset of about 35 million images collected from Flickr. Our approach combines content analy...
David J. Crandall, Lars Backstrom, Daniel P. Hutte...
PKDD
2009
Springer
118views Data Mining» more  PKDD 2009»
15 years 4 months ago
Protein Identification from Tandem Mass Spectra with Probabilistic Language Modeling
This paper presents an interdisciplinary investigation of statistical information retrieval (IR) techniques for protein identification from tandem mass spectra, a challenging probl...
Yiming Yang, Abhay Harpale, Subramaniam Ganapathy
BIBE
2006
IEEE
138views Bioinformatics» more  BIBE 2006»
15 years 3 months ago
Assigning Schema Labels Using Ontology And Hueristics
Bioinformatics data is growing at a phenomenal rate. Besides the exponential growth of individual databases, the number of data depositories is increasing too. Because of the comp...
Xuan Zhang, Ruoming Jin, Gagan Agrawal
SIGMOD
2006
ACM
219views Database» more  SIGMOD 2006»
15 years 10 months ago
Modeling skew in data streams
Data stream applications have made use of statistical summaries to reason about the data using nonparametric tools such as histograms, heavy hitters, and join sizes. However, rela...
Flip Korn, S. Muthukrishnan, Yihua Wu