/
Decomposition for Reasoning with Biological Network Decomposition for Reasoning with Biological Network

Decomposition for Reasoning with Biological Network - PowerPoint Presentation

anastasia
anastasia . @anastasia
Follow
28 views
Uploaded On 2024-02-02

Decomposition for Reasoning with Biological Network - PPT Presentation

Gauvain Bourgne Katsumi Inoue ISSSB11 Shonan Village November 13 th 17 th 2011 Motivation In bioinformatics need to reason on huge amount of data Huge networks eg metabolic ID: 1043677

problem decomposition clausal based decomposition problem based clausal problems algorithmautomated results cycle overviewreasoning taskpartition theory decompositionexperimental set carc partition

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "Decomposition for Reasoning with Biologi..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

1. Decomposition for Reasoning with Biological NetworkGauvain Bourgne, Katsumi InoueISSSB’11, Shonan Village, November 13th -17th 2011

2. MotivationIn bioinformatics, need to reason on huge amount of dataHuge networks (e.g. metabolic pathways, signaling pathways…)On such problems, centralized methodsLong computation timeMemory overflowProblem decompositionDivide into smaller problems or steps to recompose a global solutionNeed for (1) an automated process to decompose and (2) an algorithm to solve local problems and recompose global solution2Automated Problem Decomposition /33

3. Example Problem (Krebs Cycle)3succinateformaldehydecreatininecreatinebeta-alanine2-oxe-glutaratel-lysinel-2-aminoadipateisocitratetrans-aconitatetaurinenmndnmnahippurateformatesarcosinel-ascitrullineornithinearginineureamethylaminetmaolactateglucoseacetateacryloyl-coapyruvateFumaratefumarate2.6.1.391.1.1.422.3.1.614.2.1.34.2.1.21.3.99.11.13.11.162.1.1.12.1.1.76.3.4.52.1.3.32.1.1.23.5.3.13.5.3.33.5.2.101.5.99.11.1.99.81.4.99.34.1.2.324.2.1.544.3.1.62.1.3.14.1.1.202.6.1.141.2.1.31glycolisis1.1.1.274.3.2.13.5.1.592.6.1.-acetylcoa2.3.3.11.2.4.16.2.1.1citrate3Automated Problem Decomposition /33

4. Example Problem (Krebs Cycle)4succinateformaldehydecreatininecreatinebeta-alanine2-oxe-glutaratel-lysinel-2-aminoadipateisocitratetrans-aconitatetaurinenmndnmnahippurateformatesarcosinel-ascitrullineornithinearginineureamethylaminetmaolactateglucoseacetateacryloyl-coapyruvateFumaratefumarate2.6.1.391.1.1.422.3.1.614.2.1.34.2.1.21.3.99.11.13.11.162.1.1.12.1.1.76.3.4.52.1.3.32.1.1.23.5.3.13.5.3.33.5.2.101.5.99.11.1.99.81.4.99.34.1.2.324.2.1.544.3.1.62.1.3.14.1.1.202.6.1.141.2.1.31glycolisis1.1.1.274.3.2.13.5.1.592.6.1.-acetylcoa2.3.3.11.2.4.16.2.1.1citrateAg2Ag0Ag4Ag1Ag3Ag54.2.1.21.1.1.424.1.1.202.3.3.14.3.1.62.1.3.12.1.3.33.5.3.11.5.99.11.3.99.14Automated Problem Decomposition /33

5. OverviewReasoning taskPartition-based algorithmAutomated decompositionExperimental evaluationConclusion5Automated Problem Decomposition /33

6. OverviewReasoning taskPartition-based algorithmAutomated decompositionExperimental evaluationConclusion6Automated Problem Decomposition /33

7. Logical representationMetabolic pathways: set of reactions Ri: Ri: m1,m2,…,mp  p1,p2,…,pn Such reactions can be represented as an activation rule¬m1v¬m2v…v¬mp v Ri n production rules¬Ri v p1¬Ri v p2…¬Ri v pn Clausal theory7Automated Problem Decomposition /33

8. Problems(Conditional) accessibility problemsSources (si), Conditional sources (ci), Targets (ti)Find which ti can be produced from si, possibly with the addition of ci as a new sourceFind all consequences of the form ¬civ…v¬ckv tjExtraction of sub-networksPathways completion (abduction)Find reactions (set of clauses)Hypothesis on state of reaction given experiments Consequence finding (with specific form)8Automated Problem Decomposition /33

9. Main reasoning taskConsequence Finding (CF) in clausal theoriesInputA clausal theory TA production field P=<L,Cond>L is a list of literalsCond is a condition (maximal length of the consequences, or number of occurrences of some literals)Output All the consequences of T that are subsumption-minimal and belongs to P (formed with literals of L respecting condition Cond). Carc(T,P)9Automated Problem Decomposition /33

10. OverviewReasoning taskPartition-based algorithmAutomated decompositionExperimental evaluationConclusion10Automated Problem Decomposition /33

11. Partition-based CFThe taskConsequence Finding (CF) in clausal theoriesInputA set of clausal theory Ti such that UTi=T, and a set of reasoners ai associated with each partitionA production field P=<L,Cond>Output Carc(T,P)WhereThe output should be produced through local computations and interactions between reasoners (message exchange) 11Automated Problem Decomposition /33

12. Partition-based Consequence FindingGeneralization of Partition-based Theorem Proving [Amir & McIlraith, 2005]Based on Craig’s Interpolation Theorem: If C entails D, then there is a formula F involving only symbols common to C et D such that C entails F and F entails D.PrinciplesIdentify common symbols (communication languages)Build a tree structure (cycle-cut)Forward relevant consequences from leaf to rootCDF12Automated Problem Decomposition /33

13. Communication languagesGraph induced from the partitionProblem : eliminate cycles from it while ensuring a proper labeling.Cycle-cutWhile (G not acyclic)Take a minimal cycle S=(i1,i2),(i2,i3),…,(ip,i1).Choose (i,j) in S s.t. is minimalFor each (q,r)≠(i,j) in S, l(q,r)l(q,r)Ul(i,j)Remove (i,j) from EabcbfgadeacdfaacbfadbbAutomated Problem Decomposition /3313

14. Forward Message-passing Algorithm(Sequential)PreprocessingDetermine initial l(i,j)Apply Cut-cyclesDetermine PiNon-root agents ai (with parent aj): Pi=<LUl(i,j)>Root ak: Pk=PConsequence-FindingFrom leaves to rootDetermine Cni=Carc(∑i,Pi)Forward CniCarcCarcCarcCarc14Automated Problem Decomposition /33

15. Parallel VariantCarcCarcCarcCarcNewcarcNewcarcIncremental computations: Newcarc(TUC,P)=Carc(TUC,P)\Carc(T,P)15Automated Problem Decomposition /33

16. OverviewReasoning taskPartition-based algorithmAutomated decompositionExperimental evaluationConclusion16Automated Problem Decomposition /33

17. Decomposition of clausal theoriesGiven a Clausal Theory TFind a set of partitions Ti, such thatUTi=TReasoning is easier ie the application of partition-based algorithm to this decomposition is as efficient as possible.Minimize the size of the communication languagesEnsure that some simplification can be done locally Partitions should be cohesive and loosely coupled. 17Automated Problem Decomposition /33

18. c1: ¬b∨c∨e∨fc2: ¬a∨d∨ec3: ¬d∨g∨hc4: ¬e∨gc5: ¬g∨¬h∨ic2c1c4c3c5adhigecfbc2c1c4c3c5adhigecfbc2c1c4c3c5eedg,hgGraph representationClausal theory can be represented as graphFocus on common symbols18Automated Problem Decomposition /33c2c1c4c3c511121

19. ArchitectureInitial Theory.sol fileReduced graph representationPartitioned graphPartitioned clausal theory.dcf fileRootSolutionkmetisNumber of partitionsPartition-based CFbuildGraphgraph2dcfRoot choice heuristicChoose root with maximal average clause size19Automated Problem Decomposition /33

20. Problem Decompositionsuccinateformaldehydecreatininecreatinebeta-alanine2-oxe-glutaratel-lysinel-2-aminoadipateisocitratetrans-aconitatetaurinenmndnmnahippurateformatesarcosinel-ascitrullineornithinearginineureamethylaminetmaolactateglucoseacetateacryloyl-coapyruvateFumaratefumarate2.6.1.391.1.1.422.3.1.614.2.1.34.2.1.21.3.99.11.13.11.162.1.1.12.1.1.76.3.4.52.1.3.32.1.1.23.5.3.13.5.3.33.5.2.101.5.99.11.1.99.81.4.99.34.1.2.324.2.1.544.3.1.62.1.3.14.1.1.202.6.1.141.2.1.31Glycolisis path1.1.1.274.3.2.13.5.1.592.6.1.-acetylcoa2.3.3.11.2.4.16.2.1.1citrateag1ag3ag2ag5ag4ag020Automated Problem Decomposition /33

21. OverviewReasoning taskPartition-based algorithmAutomated decompositionExperimental evaluationConclusion21Automated Problem Decomposition /33

22. Benchmark ProblemsBiological networksTPTP problemsProduction field : Vocabulary of conjecture (+ removing conjecture)Full vocabulary with length limitSAT problemsProduction fieldBased on frequency of literalsN% most/less frequent literalsSizeProblems still not tractable as CF problemsSolving only a cohesive sub-problem (obtained by partition of the clause graph)22Automated Problem Decomposition /33

23. Problems characteristics23Automated Problem Decomposition /33

24. Results – Biological Networks2 682 252 (3 321 857)24Automated Problem Decomposition /33

25. Results – SAT problems25Automated Problem Decomposition /33

26. Results – TPTP problems26Automated Problem Decomposition /33

27. Results - summary27Automated Problem Decomposition /33

28. Results - summary28Automated Problem Decomposition /33

29. ResultsFor almost all problems, decomposition can reduce the number of resolve operations needed.Especially, it can solve some problems that could not be solvedTime is no often improvedDue to communication time (parsing, and such)Approached decomposition with metis: ok.Root choice heuristic: still insufficient, though not bad for biological networks problems.Automated Problem Decomposition /3329

30. OverviewReasoning taskPartition-based algorithmAutomated decompositionExperimental evaluationConclusion30Automated Problem Decomposition /33

31. ConclusionA sound and complete algorithm combined with automated problem decompositionCan increase efficiency (nb of operation) for almost all problemsBut, results dependent on the choice of root31Automated Problem Decomposition /33

32. Future worksPartition-based algorithmVariant for Newcarc computationsCommon Theories for 1st order representationsOrdered partitions to break cycle (without removing links)DecompositionDirectly from metabolic pathwayRoot choice heuristicLearning preference relation on root choiceChoosing the number of partition32Automated Problem Decomposition /33

33. Thank you for your attentionAny question ? /3333Automated Problem Decomposition