Weave amino acid sequences for protein secondary structure prediction

11 years 10 months ago
Weave amino acid sequences for protein secondary structure prediction
Given a known protein sequence, predicting its secondary structure can help understand its three-dimensional (tertiary) structure, i.e., the folding. In this paper, we present an approach for predicting protein secondary structures. Different from the existing prediction methods, our approach proposes an encoding schema that weaves physio-chemical information in encoded vectors and a prediction framework that combines the context information with secondary structure segments. We employed Support Vector Machine (SVM) for training the CB513 and RS126 data sets, which are collections of protein secondary structure sequences, through sevenfold cross validation to uncover the structural differences of protein secondary structures. Hereafter, we apply the sliding window technique to test a set of protein sequences based on the group classification learned from the training set. Our approach achieves 77.8% segment overlap accuracy (SOV) and 75.2% three-state overall per-residue accuracy (Q...
Xiaochun Yang, Bin Wang
Added 05 Jul 2010
Updated 05 Jul 2010
Type Conference
Year 2003
Where DMKD
Authors Xiaochun Yang, Bin Wang
Comments (0)