/
D irect- C oupling  A nalysis (DCA) and Its Applications in Protein Structure and Protein-Protein D irect- C oupling  A nalysis (DCA) and Its Applications in Protein Structure and Protein-Protein

D irect- C oupling A nalysis (DCA) and Its Applications in Protein Structure and Protein-Protein - PowerPoint Presentation

alyssa
alyssa . @alyssa
Follow
66 views
Uploaded On 2023-10-28

D irect- C oupling A nalysis (DCA) and Its Applications in Protein Structure and Protein-Protein - PPT Presentation

Prediction Wang Yang 201413 Outline Molecular Coevolution phenomenon A pplications of Coevolution in protein structure prediction and PPI prediction Coevolution measurement ID: 1026043

direct protein evolution information protein direct information evolution interactions model prediction statistical pair coupling correlations couplings contacts structure constraints

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "D irect- C oupling A nalysis (DCA) and ..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

1. Direct-Coupling Analysis (DCA) and Its Applications in Protein Structure and Protein-Protein Interaction Prediction Wang Yang2014.1.3

2. OutlineMolecular Co-evolution phenomenonApplications of Co-evolution in protein structure prediction and PPI prediction.Co-evolution measurement: Local model-Mutual Information (MI) measure of coupling.Global model-Direct Coupling Analysis (DCA).Principle of Direct Coupling Analysis (DCA).Summary 2

3. Molecular Co-evolution3What is Molecular coevolution?Two (or more) genes/ proteins/ residues : exert selective pressures on each otherevolve in response to each other Molecular co-evolution can be due to specific co-adaptation between the two co-evolving elements, where changes in one of them are compensated by changes in the other, or by a less specific external force affecting the evolutionary rates of both elements in a similar magnitude. Co-evolutionary signatures between proteins serve as markers of physical interactions and/or functional relationshipsFor this reason, computational methods emerged for studying co-evolution at the protein or residue level so as to predict features such as protein-protein interactions, residue contacts within protein structures and protein functional sites.

4. Native contact by co-evolution analysis4Co-evolution information for protein structure predictionMarks, D. S.; Colwell, L. J.; Sheridan, R.; Hopf, T. A.; Pagnani, A.; Zecchina, R.; Sander, C. PLoS One 2011, 6, e28766.

5. Use Co-evolution to predict protein 3D structure5De Juan, D.; Pazos, F.; Valencia, A. Nat Rev Genet 2013, 14, 249-61.

6. Groups of co‑evolving residues are implicated in functional specificity and structure–function coordinationSpecificity‑determining positions (SDPs) are groups of positions that coordinately mutate in the context of subfamily divergence.6De Juan, D.; Pazos, F.; Valencia, A. Nat Rev Genet 2013, 14, 249-61.

7. Co-evolution measurement7Local statistical model : calculate correlation of each residue pair (i, j) in the multiple sequence alignment independently.Mutual Information(MI):Global statistical model : Coupling of the pair i and j depends on the rest of the alignment. To compute a set of direct residue couplings that best explains all pair correlations observed in the multiple sequence alignment .Direct-coupling Information(DI):

8. Shortcomings of Local statistical model 8Correlation in amino acid substitution may arise from direct as well as indirect interactions. Local covariance methods are unable to distinguish between direct and indirect correlation. However:All direct interactions are contained in the local correlations.All detected correlations in substitutions are generated by the set of direct interactionsABCDirect InteractionDirect InteractionIndirect Interaction

9. Direct information VS. Mutual information9Intradomain contacts prediction using DI and MI pairs.

10. Direct information VS. Mutual information10Intradomain contacts (<=8Å) prediction using DI and MI pairs. Interdomain contacts prediction using DI and MI pairs.Morcos, F.; Pagnani, A.; Lunt, B.; Bertolino, A.; Marks, D. S.; Sander, C.; Zecchina, R.; Onuchic, J. N.; Hwa, T.; Weigt, M. Proc Natl Acad Sci U S A 2011, 108, E1293-301.Weigt, M.; White, R. A.; Szurmant, H.; Hoch, J. A.; Hwa, T. Proc Natl Acad Sci U S A 2009, 106, 67-72.

11. Principle of DCA11To find a minimal set of pair interactions that, through transitivity, will produce all the observed pair correlations. More precisely, to seek a general model, the full joint-probability distribution P(A1…AL), for a particular amino acid sequence A1…AL to be a member of the family under consideration that the marginals probability Pij(Ai,Aj) for pair occurrences are consistent with the observation of the MSA: Where:

12. Maximum-entropy Modeling12Information is the reduction of uncertainty.When you have only limited information, the best and safest guess is to model all that is known and assume nothing about the uncertainty. satisfy a set of constraints that must holdchoose the most “uniform” distribution Choose the one with maximum entropyConstraints:Maximum S: For our case:

13. Why Maximum-entropy Modeling can find the direct interactions(coupling)?13Actually, we are not finding the direct couplings!What we have observed in the MSA (Mutual information) is self-redundant.We are finding the minimum set of couplings that can deduce all observed couplings. The model that can reflect all our observations.But we reduce all pair-wise couplings to as low as possible. Thus, indirect couplings are removed! No addition assumption (information) was added to the system, thus our guess has the lowest risk!

14. Maximization of the entropy14Constraints:Our goal is Maximum S and keep all the constraints: Lagrange multipliersWhere Z is the partition function:MCMC samplingMessage passing samplingMean field approximation…

15. Example1: Use DI to predict protein-protein binding interface15Low MI implies low DI, but high MI does not necessarily imply high DI.

16. 16High DI pairs are physical interactions.Example1: Use DI to predict protein-protein binding interfaceWeigt, M.; White, R. A.; Szurmant, H.; Hoch, J. A.; Hwa, T. Proc Natl Acad Sci U S A 2009, 106, 67-72.Direct Information is inversely correlated with residue distance of pairs in the Spo0B/Spo0F cocrystal structure

17. Example2: Use DI to predict protein 3D structure without using template information17Top-ranked predicted structures can make correct contacts in the absence of constraints and avoid incorrect contacts in spite of false positive constraintsMarks, D. S.; Colwell, L. J.; Sheridan, R.; Hopf, T. A.; Pagnani, A.; Zecchina, R.; Sander, C. PLoS One 2011, 6, e28766.

18. How many distance constraints are needed for fold prediction?18

19. When would it have been possible to fold from sequence?19

20. Summary20Until recently, co-evolution information has not been effectively used. Local statistical models are not good enough.DCA is a powerful global statistical model to find direct interactions.However, statistical background noise (e.g. low statistical resolution in the empirical correlations due to an insufficient number of proteins in the family or due to global correlations from phylogenetic bias in the frequency counts) and functional constraints may not be spatially close, such as functional constraints imposed by protein-protein or protein-ligand interactions.But it can be used to improve the current structure prediction, refinement and identify potential binding proteins!Thank you very much!