/
The Complexity of Causality and Responsibility The Complexity of Causality and Responsibility

The Complexity of Causality and Responsibility - PowerPoint Presentation

danika-pritchard
danika-pritchard . @danika-pritchard
Follow
421 views
Uploaded On 2016-09-14

The Complexity of Causality and Responsibility - PPT Presentation

f or Query Answers and nonAnswers Alexandra Meliou Wolfgang Gatterbauer Katherine Moore and Dan Suciu httpdbcswashingtoneducausality 1 Motivating E xample Explanations ID: 466058

washington causality responsibility http causality washington http responsibility tuples queries query ptime linear contingency complexity counterfactual endogenous dichotomy conjunctive

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "The Complexity of Causality and Responsi..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

The Complexity of Causality and Responsibility

for Query Answers and non-Answers

Alexandra Meliou, Wolfgang Gatterbauer, Katherine Moore, and Dan Suciu

http://db.cs.washington.edu/causality/

1Slide2

Motivating

E

xample: Explanations

?

Query

IMDB Database Schema

Relevant lineage

:

137 tuples !!

“What genres does Tim Burton

direct

?”

http://db.cs.washington.edu/causality/

2Slide3

Example cont. (Musicals)

Ranking Provenance

i

mportant tuples

unimportant tuple

Goal:

Rank tuples in order of importance

http://db.cs.washington.edu/causality/

3Slide4

Solution: Causality

The fundamental question of causality:

“What is the cause of an effect?”Causality theory has long been studied in AI and philosophy.[Lewis73, EiterLucasiewicz02

, HalpernPearl05, Menzies08]Offers a metric (

responsibility

) for measuring the contribution of a variable to an outcome

ranking

[ChocklerHalpern04]

http://db.cs.washington.edu/causality/

4Slide5

Contributions

We suggest responsibility as an effective measure for ranking provenance.Explanations

Error tracingWe define causality and responsibility in a database context.Complete complexity analysis for computing causality and responsibility for the case of conjunctive queries without self-joinsInteresting dichotomy result.Non-trivial algorithm for computing responsibility in the PTIME cases.

http://db.cs.washington.edu/causality/

5Slide6

Endogenous/exogenous tuples

Partition the data into 2 groups:

Exogenous tuples

(denoted by )

tuples that we consider correct/verified/

trusted. They are

not

candidate causes

E.g.

the

Genre

, and Movie_Director tablesEndogenous tuples (denoted by )Untrusted tuples,

or simply of interest to the user. They are potential causesE.g.

the Director and Movie tables

http://db.cs.washington.edu/causality/

6Slide7

Counterfactuals

A variable is a counterfactual cause if a change in its value, changes the value of the result

E.g.Limitations: disjunctive causesE.g.

A and B are both

counterfactual

causes of C

http://db.cs.washington.edu/causality/

7Slide8

Contingencies

Generalize counterfactual causes

A contingency is a hypothetical setting of the endogenous variables that makes a tuple counterfactual

A is a cause under the contingency B=0

http://db.cs.washington.edu/causality/

8Slide9

Responsibility (intuition)

Measures the degree of causality, the contribution of a tuple

A larger contingency, means a tuple has smaller degree of causalityCounterfactual causes have the most contribution (empty contingency set)

http://db.cs.washington.edu/causality/

9Slide10

Causality for Conjunctive Queries

Definition:

Causality

(contingency)

Definition:

Responsibility

Intuition:

If the removal of t removes the answer, then t is counterfactual

If there is a set of tuples whose removal makes t counterfactual, t is a cause

Intuition:

The more tuples that need to be removed, the less important t is

(an answer to q)

(endogenous tuple)

(database)

(endogenous tuples)

http://db.cs.washington.edu/causality/

10Slide11

Example

Query:

Database:

Lineage expression:

(

Datalog

notation)

Responsibility:

Assume all endogenous

http://db.cs.washington.edu/causality/

11

NOTE:

If is exogenous,

is not a cause.Slide12

Complexity Results (Data Complexity)

dichotomy

a

nswers

n

on-answers

http://db.cs.washington.edu/causality/

12Slide13

Responsibility: PTIME Queries

Assume conjunctive queries with no self joinsA simple case:

The lineage of q will be of the form:

What is the responsibility of

PTIME

http://db.cs.washington.edu/causality/

13Slide14

Responsibility: PTIME Queries

More interesting:

easy

Intuition:

a cut in the graph interrupts the s-t flow. The addition of t re-instantiates it.

t becomes counterfactual

*

*

(R tuples)

(S tuples)

http://db.cs.washington.edu/causality/

14Slide15

Responsibility: Hard Queries

endogenous

If unspecified, it could be either

Theorem:

The following queries are NP-hard:

http://db.cs.washington.edu/causality/

15Slide16

Query Dual Hypergraph

Query

hypergraph

Query dual

hypergraph

Definition:

Linear Queries

There exists an ordering of the nodes of the dual

hypergraph

, such that every

hyperedge

is a consecutive subsequence.

Theorem:

Computing responsibility for all linear queries is in PTIME.

None of these are linear

http://db.cs.washington.edu/causality/

16Slide17

Weakenings

R is exogenous, and therefore its tuples cannot be part of the contingency set

Expand R with the domain of z. Responsibility of T tuples is not affected!

Dissociation

http://db.cs.washington.edu/causality/

17

PTIME

NP-hardSlide18

Responsibility Dichotomy

Dichotomy Theorem:

(data complexity)

If q is

weakly linear

, then computing responsibility for q is in PTIME

If

q

is

not

weakly linear, then it is NP-hard

Definition: Weakly Linear QueriesA query is weakly linear, if there exists a set of

weakenings that leads to a linear queryhttp://db.cs.washington.edu/causality/

18Slide19

Conclusions

Defined causality and responsibility for conjunctive queriesComplete complexity analysis for CQ without self-joins

Interesting dichotomy resultNon-trivial algorithm for PTIME casesOpen problem:Self-joinshttp://db.cs.washington.edu/causality/

19