2018 article

Modeling Dependence in Evolutionary Inference for Proteins

RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB 2018, Vol. 10812, pp. 122–137.

By: G. Larson*, J. Thorne* & S. Schmidler n

author keywords: Protein structure; Evolution; Dynamic programming; Phylogeny; Diffusion process
TL;DR: It is demonstrated that this extended, site-dependent model (SDM), extended to account for correlated evolutionary drift among neighboring amino acid positions, yields a significant reduction of bias in estimated evolutionary distances and helps further improve phylogenetic tree reconstruction. (via Semantic Scholar)
UN Sustainable Development Goal Categories
15. Life on Land (Web of Science)
Source: Web Of Science
Added: March 18, 2019

Protein structure alignment is a classic problem of computational biology, and is widely used to identify structural and functional similarity and to infer homology among proteins. Previously a statistical model for protein structural evolution has been introduced and shown to significantly improve phylogenetic inferences compared to approaches that utilize only amino acid sequence information. Here we extend this model to account for correlated evolutionary drift among neighboring amino acid positions, resulting in a spatio-temporal model of protein structure evolution. The result is a multivariate diffusion process convolved with a spatial birth-death process, which comes with little additional computational cost or analytical complexity compared to the site-independent model (SIM). We demonstrate that this extended, site-dependent model (SDM) yields a significant reduction of bias in estimated evolutionary distances and helps further improve phylogenetic tree reconstruction.