Solving the Protein Secondary Structure Prediction problem with the Hessian Free Optimization algorithm

Konstantinos Charalampous, Michalis Agathocleous, Chris Christodoulou, Vasilis Promponas

Research output: Contribution to journalArticlepeer-review

Abstract

Trying to extract features from complex sequential data for classification and prediction problems is an extremely difficult task. This task is even more challenging when both the upstream and downstream information of a time-series is important to process the sequence at a specific time-step. One typical problem which falls in this category is Protein Secondary Structure Prediction (PSSP). Recurrent Neural Networks (RNNs) have been successful in handling sequential data. These methods are demanding in terms of time and space efficiency. On the other hand, simple Feed-Forward Neural Networks (FFNNs) can be trained really fast with the Backpropagation algorithm, but in practice they give poor results in this category of problems. The Hessian Free Optimization (HFO) algorithm is one of the latest developments in the field of Artificial Neural Network (ANN) training algorithms which can converge faster and more accurately. In this paper, we present the implementation of simple FFNNs trained with the powerful HFO second-order learning algorithm for the PSSP problem. In our approach, a single FFNN trained with the HFO learning algorithm can achieve an approximately 79.6% per residue (Q3) accuracy on the PISCES dataset. Despite the simplicity of our method, the results are comparable to some of the state of the art methods which have been designed for this problem. A majority voting ensemble method and filtering with Support Vector Machines have also been applied, which increase our results to 80.4% per residue (Q3) accuracy. Finally, our method has been tested on the CASP13 independent dataset to achieve 78.14% per residue (Q3) accuracy. Moreover, the HFO does not require tuning of any parameters which makes training much faster than other state of the art methods, a very important feature with big datasets and facilitates fast training of FFNN ensembles.

Original languageEnglish (US)
JournalIEEE Access
DOIs
StateAccepted/In press - 2022

Keywords

  • Approximation algorithms
  • Hafnium oxide
  • Hessian Free Optimization
  • Neural Networks
  • Optimization
  • Predictive models
  • Protein Secondary Structure Prediction
  • Proteins
  • Second Order Learning Algorithms
  • Three-dimensional displays
  • Training

ASJC Scopus subject areas

  • Computer Science(all)
  • Materials Science(all)
  • Engineering(all)

Fingerprint

Dive into the research topics of 'Solving the Protein Secondary Structure Prediction problem with the Hessian Free Optimization algorithm'. Together they form a unique fingerprint.

Cite this