Large scale genomic sequence SVM classifiers

14 years 10 months ago

Download www.machinelearning.org

In genomic sequence analysis tasks like splice site recognition or promoter identification, large amounts of training sequences are available, and indeed needed to achieve sufficiently high classification performances. In this work we study two recently proposed and successfully used kernels, namely the Spectrum kernel and the Weighted Degree kernel (WD). In particular, we suggest several extensions using Suffix Trees and modifications of an SMO-like SVM training algorithm in order to accelerate the training of the SVMs and their evaluation on test sequences. Our simulations show that for the spectrum kernel and WD kernel, large scale SVM training can be accelerated by factors of 20 and 4 times, respectively, while using much less memory (e.g. no kernel caching). The evaluation on new sequences is often several thousand times faster using the new techniques (depending on the number of Support Vectors). Our method allows us to train on sets as large as one million sequences.

Bernhard Schölkopf, Gunnar Rätsch, S&oum

Real-time Traffic

ICML 2005 | Machine Learning | Spectrum Kernel | WD Kernel | Weighted Degree Kernel |

claim paper

» Tree Decomposition for LargeScale SVM Problems

» IsoSVM Distinguishing isoforms and paralogs on the protein level

» Modified Logistic Regression An Approximation to SVM and Its Applications in LargeScale Te...

» MPrime efficient large scale multiple primer and oligonucleotide design for customized gen...

» Gene prediction in metagenomic fragments A large scale machine learning approach

» HighPerformance Direct Pairwise Comparison of Large Genomic Sequences

» Feature shaping for linear SVM classifiers

» Classification of arrayCGH data using fused SVM

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2005
Where	ICML
Authors	Bernhard Schölkopf, Gunnar Rätsch, Sören Sonnenburg

Comments (0)

Sciweavers

Large scale genomic sequence SVM classifiers

ICML 2005 | Machine Learning | Spectrum Kernel | WD Kernel | Weighted Degree Kernel |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers