Database queries can be broadly classified into two categories: reporting queries and aggregation queries. The former retrieves a collection of records from the database that mat...
Probabilistic Latent Semantic Indexing is a novel approach to automated document indexing which is based on a statistical latent class model for factor analysis of count data. Fit...
Affymetrix high-density oligonucleotide microarrays measure expression of DNA transcripts using probesets, i.e. multiple probes per transcript. Usually, these multiple measurement...
Dick de Ridder, Frank J. T. Staal, Jacques J. M. v...
We present an analysis of user conversations in on-line social media and their evolution over time. We propose a dynamic model that predicts the growth dynamics and structural pro...
Recent experiments and analysis suggest that there are about 800 million publicly-indexable Web pages. However, unlike books in a traditional library, Web pages continue to change...