Prediction Machines: Applied Machine Learning for Therapeutic Protein Design and Development.


Mid America


Overland Park Regional Medical Center

Document Type


Publication Date



machine learning, pharmaceutical protein, chemistry


Medical Biochemistry | Medical Pharmacology | Pharmaceutical Preparations


The rapid growth in technological advances and quantity of scientific data over the past decade has led to several challenges including data storage and analysis. Accurate models of complex datasets were previously difficult to develop and interpret. However, improvements in machine learning algorithms have since enabled unparalleled classification and prediction capabilities. The application of machine learning can be seen throughout diverse industries due to their ease of use and interpretability. In this review, we describe popular machine learning algorithms and highlight their application in pharmaceutical protein development. Machine learning models have now been applied to better understand the nonlinear concentration dependent viscosity of protein solutions, predict protein oxidation and deamidation rates, classify sub-visible particles and compare the physical stability of proteins. We also applied several machine learning algorithms using previously published data and describe models with improved predictions and classification. The authors hope that this review can be used as a resource to others and encourage continued application of machine learning algorithms to problems in pharmaceutical protein development.

Publisher or Conference

Journal of Pharmaceutical Sciences