Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

11

GCB
1997
Springer

favoriteEmaildiscussreport

77views Biometrics» more GCB 1997»

Statistics of large scale sequence searching

13 years 8 months ago

Statistics of large scale sequence searching

Download bioinformatics.oxfordjournals.org

Motivation: Database search programs such as FASTA, BLAST or a rigorous Smith–Waterman algorithm produce lists of database entries, which are assumed to be related to the query. The computation of statistical significance of similarity scores is well established for single pairs of sequences and using purely random models. However, the multi-trial context of a database search poses new problems. The credibility of a certain score obtained in a database search decreases with the amount of data that is compared. To improve p-value computation for database search experiments, statistical properties of the databases, such as the distribution of sequence length and effects induced by frequently repeated sequence patterns, need to be taken into account. Results: We investigated the SWISS-PROT protein database

Rainer Spang, Martin Vingron

Real-time Traffic

Biometrics | Database Search | Database Search Programs | GCB 1997 | Rigorous Smith–waterman Algorithm |

claim paper

Related Content

» A Scalable Parallel Approach for Peptide Identification from LargeScale Mass Spectrometry ...

» Structure Learning on Large Scale Common Sense Statistical Models of Human State

» MultiFaceted Information Retrieval System for Large Scale Email Archives

» Local sequence alignments statistics deviations from Gumbel statistics in the rareevent ta...

» Tuffy Scaling up Statistical Inference in Markov Logic Networks using an RDBMS

» An approach to large scale identification of nonobvious structural similarities between pr...

» HiLighter Automatically Building Robust Signatures of Performance Behavior for Small and L...

» Modified Logistic Regression An Approximation to SVM and Its Applications in LargeScale Te...

» P4P Practical LargeScale PrivacyPreserving Distributed Computation Robust against Maliciou...

Post Info
More Details (n/a)

Added	07 Aug 2010
Updated	07 Aug 2010
Type	Conference
Year	1997
Where	GCB
Authors	Rainer Spang, Martin Vingron

Comments (0)