Protein structure informs functional mechanisms and the risk of disease

Publication date

2021-09-30



Abstract

Classifying proteins into evolutionary families is important for identifying conserved sequence and structure features that are key to the functional mechanisms of these proteins. Our in-house CATH classification currently classifies ~450,000 protein structures and nearly 150 million protein domain sequences into ~5500 evolutionary families. The recent success in protein structure prediction by DeepMind’s AlphaFold2 (AF2) method and the expected release of hundreds of thousands of AF2 models, will change the scientific landscape by massively extending the structural data available for these protein evolutionary families. We have developed a strategy to bring this extensive new 3D data into CATH families and are examining how this data will expand our understanding of structure – function relationships and our ability to detect functional sites. Functional site predictions can be enhanced by combining structural features and evolutionary conservation patterns and some examples will be given of the application of CATH functional site data to understand protein splice events, the risk of Covid infection and the development of more aggressive lung cancer following whole genome duplication.

Document Type

Conference report

Language

English

Publisher

Barcelona Supercomputing Center

Recommended citation

This citation was generated automatically.

Rights

http://creativecommons.org/licenses/by-nc-nd/4.0/

Open Access

Attribution-NonCommercial-NoDerivatives 4.0 International

This item appears in the following Collection(s)

Congressos [11156]