Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

MotivationIn vitro and in vivo selection of vaccines is time consuming, expensive and the selected vaccines may not be able to provide protection against broad-spectrum viruses because of emerging antigenically novel disease strains. A powerful computational model that incorporates these protein/DNA or RNA level fluctuations can effectively predict antigenically variant strains, and can minimize the amount of resources spent on exclusive serological testing of vaccines and make wide spectrum vaccines possible for many diseases. However, in silico vaccine prediction remains a grand challenge. To address the challenge, we investigate the use of linear and non-linear regression models to predict the antigenic similarity in foot-and-mouth disease virus strains and in influenza strains, where the structure and parameters of the non-linear model are optimized using an evolutionary algorithm (EA). In addition, we examine two different scoring methods for weighting the type of amino acid substitutions in the linear and non-linear models. We also test our models with some unseen data.ResultsWe achieved the best prediction results on three datasets of SAT2 (Foot-and-Mouth disease), two datasets of serotype A (Foot-and-Mouth disease) and two datasets of influenza when the scoring method based on biochemical properties of amino acids is employed in combination with a non-linear regression model. Models based on substitutions in the antigenic areas performed better than those that took the entire exposed viral capsid proteins. A majority of the non-linear regression models optimi Z: ed with the EA: performed better than the linear and non-linear models whose parameters are estimated using the least-squares method. In addition, for the best models, optimi Z: ed non-linear regression models consist of more terms than their linear counterparts, implying a non-linear nature of influences of amino acid substitutions. Our models were also tested on five recently generated FMDV datasets and the best model was able to achieve an 80% agreement rate.

Original publication

DOI

10.1093/bioinformatics/btu768

Type

Journal article

Journal

Bioinformatics (Oxford, England)

Publication Date

03/2015

Volume

31

Pages

834 - 840

Addresses

Department of Computing, University of Surrey, Guildford GU2 7XH, UK, The Pirbright Institute, Pirbright GU24 0NF, UK and Department of Microbial and Cellular Sciences, University of Surrey, Guildford GU2 7XH, UK.

Keywords

Animals, Foot-and-Mouth Disease Virus, Foot-and-Mouth Disease, Capsid Proteins, Influenza Vaccines, Antigens, Viral, Computational Biology, Antigenic Variation, Algorithms, Nonlinear Dynamics, Computer Simulation, Antibodies, Neutralizing, Biological Evolution