/
Community detection via random walk Community detection via random walk

Community detection via random walk - PowerPoint Presentation

myesha-ticknor
myesha-ticknor . @myesha-ticknor
Follow
414 views
Uploaded On 2017-07-04

Community detection via random walk - PPT Presentation

Draft slides Background Consider a social graph GV E where V n and E m Girvan and Newmans algorithm for community detection runs in Om 2 n time and On 2 space The ID: 566426

distance random community walk random distance walk community communities probability walktrap based vertices time algorithm pons high detection graph

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "Community detection via random walk" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

Community detection via random walk

Draft slidesSlide2

Background

Consider a social graph G=(V, E), where |V|= n and |E|= m

Girvan and Newman’s algorithm for community detection runs

in O(m2n) time, and O(n2) space.The Walktrap algorithm by Pons et al. computes a community structure (dendogram) in O(mnH) time, where H is the height of the dendogram – more scalable. The worst case is O(m2n) time.

HSlide3

Random walk

= probability that a random walk from j reaches

a neighbor

k, where A is the graph matrix (0-1) The probability of going fromi to j through a random walk oflength t is Slide4

Random Walk

If two

vertices are

in the same community, the probability then will surely be high. But the fact that is high does not necessarily imply that are in the same community.Slide5

Ward’s agglomerative clustering

Well known statistical method that estimates the distance

between two clusters C1 and C2 (see Wikipedia).

Walktrap uses this idea, but defines its own measure of distance basedon random walk and probability.Slide6

Random Walk

Intuition behind

Walktrap

Random walkers tend to get ‘trapped’ into densely connected parts (communities).Establish a distance measure between vertices (and between clusters) based on PtSlide7

Distance between nodes

Let

be two vertices

of the graph. Pons et al. defined distancewhere is the probability of reaching j from through a random walk of length t

High degree nodes trap

most

random

walksSlide8

Distance between two communities

Consider

the probability that a

random walk from a random vertex in community C to reach a vertex in steps. Call itThen the distance between two communities isSlide9

Distance between two communitiesSlide10

The Algorithm

Initially there is one partition

In each step, choose two communities

(based on the distance between them) and create a new partition Where is the new communityUpdate the distance between them. Slide11

Communities to merge

This choice plays a central role in the quality of the community detection. At each step

k

, merge two communities that minimize the mean of the squared distance between themNP-hard for a given kCan be reduced to theK—median problemLSlide12

Conclusion

Walktrap

(Pons et al.

) has been implemented in iGraph Library