Comparison Group11 LEUNG Chung Yin Please go to the following link for the most updated version httpswwwdropboxcoms2u6g3z6ichv3t6cTopKRulespptxdl0 Once upon a time Philippe FournierViger1 ID: 729070
Download Presentation The PPT/PDF document "Mining Top-K Association Rules" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Mining Top-K Association RulesComparison
Group11 (LEUNG Chung Yin)
Please go to the following link for the most updated version:
https://www.dropbox.com/s/2u6g3z6ichv3t6c/TopKRules.pptx?dl=0Slide2
Once upon a time…
Philippe Fournier-Viger1
Cheng-Wei Wu2
Vincent S. Tseng2Slide3
Traditional way
Difficult to set minsup
⬇0.1: No Rules Found…
⬆0.1: >10000 Rules…
How to tune?
Read 10K rules?Slide4
Traditional wayFind Frequent
ItemsetsFind Association Rules using frequent itemsets(Another Big Task)
Ex. ABCD is frequent
A
B
CD, B
ACD,…
ABCD, ACBC,…
Big Task
Big Task
Store every count?
Exponential!Slide5
TopKRules (Idea)
Scan DB: Record each item’s tidset (Set of transaction ID)Gen Base Rules
r
into
L
Loop all rules in
L
LeftExpansion
RightExpansion
Until L is emptySlide6
TopKRules (Idea)
An auto increased
minsupSlide7
TopKRules (Idea)
Always
Set
minsup
= lowest support of
TopK
rules Slide8
TopKRules (Base)
Items: A, B, C, DBase Rules (Totally 4C2 = 12):A
B, AC, AD, BC, BD, CD,
B
A, CA, DA, CB, DB, DCSlide9
TopKRules (Base)
Ex. Rule A
B
Record 2 things:
tidset
(A) ꓵ
tidset
(B)
tidset
(A)
Count Support Efficiently
(Covered Transactions)
(Evidenced Transactions)Slide10
TopKRules (Left Expansion)
Ex. Rule A
B
∀ distinct item c from
covered transactions
in Rule
A
B
If c
⊈ RHS AND c > all items in LHS
Then put
A
⋃{c}
B to LSlide11
TopKRules (Right Expansion)
Ex. Rule A
B
∀ distinct item c from
covered transactions
in Rule
A
B
If c
⊈ LHS AND c > all items in RHS
Then put
A
B
⋃{c}
to LSlide12
TopKRules (Duplicated Rules)
AC
AB
C
AB
CD
Duplication Examples:
Left Expand
Right Expand
A
C
A
CD
AB
CD
Right Expand
left Expand
Same!Slide13
TopKRules (Duplicated Rules)
If a rule is Right Expanded,
Don’t Left Expand it.Slide14
At the same year…
Philippe Fournier-Viger1
Vincent S. Tseng2Slide15
Something weird…Slide16
1. ABC2. AB
CD3. ACDSlide17
1. ABC2. AB
CD3. ACD
Redundant!
Least Items
Most ItemsSlide18
TNR
TopK Non-redundant RulesSlide19
TNR (Strategies)
New Rule A
B
TopKRules
Attemp
to addSlide20
TNR (Case1)
New Rule A
B
TopKRules
Throw away if redundantSlide21
TNR (Case2)
New Rule A
B
TopKRules
Throw a redundant
and add the new oneSlide22
TNR (Case2)
Old Rule
Throw a redundant
and add the new one
Can we throw this??Slide23
TNR (Case2)
Can we throw this??
Already Pruned Search Space
Using Old Rule’s
minsup
!Slide24
TNR (Case2)New Parameter
Δk’ = k +
ΔSlide25
TNR (Case2)If # of Case2 >
ΔThen it is ApproximationSlide26
ResultsSlide27
ResultsSlide28
Results
TR is very fast, full of redundantTNR is slow, no redundant
Never get a good
Δ
…
For k>2000, TNR is super slow.
Acceptable RAM usageSlide29
Can we do better?Slide30
ATNR
Approximate TopK
Non-redundant RulesSlide31
ATNR(Reason)
Recall Case2 in TNR:
Redundant Rules added to the
TopK
list
Prevent it?
Search Space ReorderingSlide32
ATNR(Idea)
Fix LHSKeep Right Expand Until Not Frequent
Repeat
Reduce Case 2Slide33
ATNR(Property)
Ex. Rule AB
Support =
What if A is
fixed
?
Then
Support
is
monotonic
!Slide34
ATNR(Problem)
minsup NOT set perfectlySlide35
ATNR(Problem)
Not increase immediatelyRight Expand skip good minsup
Reasons: (Like EM)
An Approximation Again…
But NO need to tune
Δ
!Slide36
LimitationsMeasures Limited:
Lift/Interest/GiniSlide37
Very Similar Papers…2015 IJIRT | Volume 1 Issue 12 |
ISSN: 2349-6002:TECHNIQUE FOR MINING TOP-K ASSOCIATION RULES2016 IJETST- Vol.||03||Issue||01||Pages 3491-3500||January||
ISSN 2348-9480:
An Efficient Algorithm to Mine Non Redundant Top K Association Rules
http://www.ijirt.org/vol1/paperpublished/IJIRT102265_PAPER.pdf
http://dx.doi.org/10.18535/ijetst/v3i01.12Slide38
Thank You