/
Colocation Pattern Discovery Colocation Pattern Discovery

Colocation Pattern Discovery - PowerPoint Presentation

marina-yarberry
marina-yarberry . @marina-yarberry
Follow
396 views
Uploaded On 2017-07-10

Colocation Pattern Discovery - PPT Presentation

Zhe Jiang zjiangcsuaedu Colocation Pattern and Examples Colocation a set of spatial features that frequently occur in together Example Ecology symbiotic relationship in animals or plants ID: 568613

event colocation pattern instance colocation event instance pattern size prevalent spatial relationship table candidate types patterns based coarse threshold

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "Colocation Pattern Discovery" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

Colocation Pattern Discovery

Zhe Jiang

zjiang@cs.ua.eduSlide2

Colocation Pattern and Examples

Colocation: a

set of

spatial features that frequently occur in together Example:Ecology: symbiotic relationship in animals or plantsPublic health: environmental factors and cancersPublic safety: crime generators and crime events

Nile Crocodiles and Egyptian Plover

Bar closing

events and

crimes

http://www.startribune.com/

http://

www.alamy.com

/Slide3

Basic Concepts

Spatial event type

Example: Bar closing, drunk driving

Spatial event instance Belong to an event type, associated with a locationExample: one specific drunk driving eventColocation pattern :A subset of spatial event types: (bar closing, drunk driving)

Instances of these event types frequently occur together

 Slide4

Basic Concepts

Neighbor relationship

Binary relationship on two event instances

Determined by adjacency or a distance threshold-proximity neighborhood

A clique of multiple event instancesAny pair of instances are neighbors under

Row instance of a colocation pattern

An

-proximity neighborhoodEach event type in appear only once

Table instance of a colocation pattern Collection of all row instances of

 Slide5

Basic Concept Example

Spatial event types

A, B, C

Spatial event instancesA.1, A.2, A.3,

… ...

Candidate Colocation(A, B), (B, C) …

Neighbor relationship (solid line)

(A.1, B.1), (A.1, C.2) …

Table instance of (A, B)(A.1, B.1)(A.2, B.4)(A.3, B.4)

Q: Table instance of (A, B, C)?Slide6

Interestingness Measure

Participation ratio

Given colocation pattern

Participation index

Example:

 

T1

T2

T3

A

B

C

A.1

B.1

C.1

A.2

B.2

C.2

A.3

B.3

C.3

A.4

B.4

B.5

T7

ABC

A.3

B.4 C.1

 

 

 

 Slide7

Problem Definition

Input:

A set of spatial event types

A table instance for each event type

Spatial neighbor relationship across instances

A participation index threshold

Find:

All colocation patterns

such that

 Slide8

Problem Example

 

Input:

Output:

{A,C} with

{B,C} with

 Slide9

Colocation Mining Algorithm: Baseline

Starting

with

Iterative until no prevalent patternGenerate size

colocation patterns {

Generate table instance of each

Compute each

, add to result if prevalent

 

T1

T2

T3

A

B

C

A.1

B.1

C.1

A.2

B.2

C.2

A.3

B.3

C.3

A.4

B.4

B.5

T4

T5

T6

A B

A C

B C

A.1

, B.1

A.1,

C.2B.2, C.1

A.2,

B.4

A.3, C.1

B.4, C.1

A.3,

B.4

B.5,

C.3

T7

A B C

A.3, B.4, C.1

A

B

C

AB

AC

BC

ABC

k=1

k=2

k=3

 

 

 

 Slide10

Colocation Mining Algorithm: Filter-Based

Starting

with

Iterative until no prevalent patternGenerate size

candidate patterns

from prevalent patterns

For each candidate

Check all subset patterns

If any subset pattern not prevalent,

prune out

Generate coarse table

instance of each remaining

Compute each

If

based on coarse resolution below threshold,

prune out

Generate table instance of each remaining

Compute each

, if above threshold, add

to set

 

Symbol

Description

candidate colocation of size k

all candidate colocation of size k

all prevalent colocation of size k

Symbol

Description

candidate colocation of size k

all candidate colocation of size k

all prevalent colocation of size kSlide11

Prevalence-based Pruning

Lemma (

apriori

property):If a colocation pattern is not prevalent, then any superset of

is also not prevalent

Example

 

T1

T2

T3

A

B

C

A.1

B.1

C.1

A.2

B.2

C.2

A.3

B.3

C.3

A.4

B.4

B.5

T4

T5

T6

A B

A C

B C

A.1

, B.1

A.1,

C.2B.2, C.1

A.2,

B.4A.3, C.1B.4, C.1

A.3,

B.4

B.5,

C.3

 

 

 

A

B

C

AB

AC

BC

k=1

k=2

Don’t need to check (A,B,C)Slide12

Multi-resolution Pruning

Key idea:

Overlay a grid of size h

Each grid cell is a coarse instance of event types inside itNeighbor relationship is imposed on the same cell or touching cellsPropertyParticipation index

based on coarse resolution is upper bound

of true valueCandidate pattern can be pruned if based on coarse resolution is below the threshold

 Slide13

Reference

[1] Huang

, Yan, Shashi

Shekhar, and Hui Xiong. "Discovering colocation patterns from spatial data sets: a general approach." IEEE Transactions on Knowledge and data engineering 16.12 (2004): 1472-1485.