Collaborators on a Research Grant vs Those Posting on SASL Larry Hoyle Institute for Policy and Social Research University of Kansas 1 SGF2009 paper 229 Larry Hoyle Visualize These Data SGF2009 paper 229 Larry Hoyle ID: 760458
Download Presentation The PPT/PDF document "Visualizing Two Social Networks Across T..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Visualizing Two Social Networks Across Time with SAS®:Collaborators on a Research Grant vs. Those Posting on SAS-L
Larry HoyleInstitute for Policy and Social ResearchUniversity of Kansas
1
SGF2009 paper 229, Larry Hoyle
Slide2Visualize These Data
SGF2009 paper 229, Larry Hoyle
2
Nodes
Links
Slide3A Social Network
SGF2009 paper 229, Larry Hoyle
3
Slide4Constellation Chart: Nodes
SGF2009 paper 229, Larry Hoyle
4
Nodes Have:Size (age)Color(gender)Tip (text)
Slide5Constellation Chart Links
SGF2009 paper 229, Larry Hoyle
5
Links Have:Width (Hours)Color(family)Tip (text)
Slide6Social Network Graph
SGF2009 paper 229, Larry Hoyle
6
Two SAS tools:
Constellation Chart Applet (and Macro)
Annotate File
Slide7Constellation Chart Slider
SGF2009 paper 229, Larry Hoyle
7
Slider set to show only links with 19 or more hours spent together
Slide8Constellation Chart Slider
SGF2009 paper 229, Larry Hoyle
8
Slider set to show only links with 14 or more hours spent together
Slide9Constellation Code
title 'Mean Hours Spent Together';%ds2const( ndata=Flints, ldata=FlintTimes, datatype=assoc, minlnkwt=30, height=360, width=480, codebase=&jarpath, htmlfile=&outfile, colormap=y, fntsize=12, nid=Person, nlabel=Person, nvalue=age, ncolor=gender, ncolfmt=Gcolor., ntip=ntip, lfrom=PersonFrom, lto=PersonTo, lvalue=MeanHours, linktype=line, lcolor=linktype, lcolfmt=Lcolor., ltip=ltip, sclnkwt=N);
SGF2009 paper 229, Larry Hoyle
9
Files
Appearance
Nodes
Links
Slide10Two Different Sets of DataEach With Their Own Challenges
SAS-L (the SAS Listserv)Nodes are email addresses of posts (23,827)Links are posts to the same thread in the same year (267,209 messages to 82,279 threads ).Kansas NSF EPSCoR GrantNodes are projects and nodes are peoplePeople have different roles (PI, researcher, support staff)Multiple types of links, together on:authorship, proposals, listed together in narrativeChanges across time
SGF2009 paper 229, Larry Hoyle
10
Slide11SAS-L Data – Available on the Web
SGF2009 paper 229, Larry Hoyle
11
Linked-
posting to the same thread
Data Cleaning –
Addresses Change
Slide12SAS-L - Too Many Nodes for AppletApproach: Limit the number of nodes
SGF2009 paper 229, Larry Hoyle
12
Slide13SAS-L Those With Over 100 Posts
SGF2009 paper 229, Larry Hoyle
13
Slide14Most Links are With a Core Group
SGF2009 paper 229, Larry Hoyle
14
Slide15Too Many Nodes for AppletApproach: Display All w/ SAS Annotate File
SGF2009 paper 229, Larry Hoyle
15
Slide16SAS Annotate File – Arrange Nodes
SGF2009 paper 229, Larry Hoyle
16
How do you arrange the nodes in some meaningful way?
All Nodes Around a Circle orMultidimensional Scaling of some or all nodes
proc
mds
data=SGF2009.
TOPPOSTERSSIMILARITY
out=SGF2009.TopPosters2D
similar
dimension = 2
level=ordinal;
run;
Slide17Problem: MDS on 23K nodes?
SGF2009 paper 229, Larry Hoyle
17
Scale the nodes with the most links
(shown in red)Arrange the others randomly in a circle around them (shown in gray)Links to red nodes in blue, others in black
Slide18Zoom and Pan With Applet
SGF2009 paper 229, Larry Hoyle
18
With annotate – Vector output (E.G.) RTF would allow zoom, but not tip on links
Slide193D with PROC G3D and AnnotateActiveX and Java Devices Only
SGF2009 paper 229, Larry Hoyle
19
Slide203D with PROC G3D and AnnotateGenerated in SAS 9.2
SGF2009 paper 229, Larry Hoyle
20
Slide213D with PROC G3D and AnnotateGenerated From EG 4.1
SGF2009 paper 229, Larry Hoyle
21
Slide223D with PROC G3D and AnnotateActiveX and Java Devices Only
SGF2009 paper 229, Larry Hoyle
22
Slide23Kansas NSF EPSCoR Phase VVisualization Needs
Show relationships among 247 people And among 50 projectsShow change in collaboration across timeDifferentiate core peopleDifferentiate principal investigators (Pis)Differentiate institutionsAnimate across time
SGF2009 paper 229, Larry Hoyle
23
Slide24Projects Layer Arranged by People in Common Across all Years
SGF2009 paper 229, Larry Hoyle
24
Slide25Core People Layer Arranged by Centroid of Projects to Which They Belong
SGF2009 paper 229, Larry Hoyle
25
Slide26People and Links
SGF2009 paper 229, Larry Hoyle
26
People
Color indicates institution
White dot is Principal Investigator
Size is count (e.g. publications)
Large tan dot indicates core person
Links
Width represents count in common
Slide27People in Fixed Positions Allows Animation Across Time (2006)
SGF2009 paper 229, Larry Hoyle
27
Slide28People in Fixed Positions Allows Animation Across Time (2007)
SGF2009 paper 229, Larry Hoyle
28
Slide29People in Fixed Positions Allows Animation Across Time (2008)
SGF2009 paper 229, Larry Hoyle
29
Slide30Other Comparisons – All Proposals and Submissions
SGF2009 paper 229, Larry Hoyle
30
Slide31Other Comparisons – Successful Proposals
SGF2009 paper 229, Larry Hoyle
31
Slide32Other Comparisons – Proposals
SGF2009 paper 229, Larry Hoyle
32
Slide33Other Comparisons – Scientific Product
SGF2009 paper 229, Larry Hoyle
33
Slide34Other Comparisons – Combined
SGF2009 paper 229, Larry Hoyle
34
Slide35Method Comparisons
AppletCoding is QuickSliderLink TipsMemory LimitsScreen Capture to PublishDynamic Pan and ZoomData Driven Color and Size
AnnotateAdditional Data StepsAnimated GIFHTML Link Tips (Difficult)Many Nodes PossibleHigh Quality ReproductionNo Tips (ODS Vector Output)Richer Symbology
SGF2009 paper 229, Larry Hoyle
35
Slide36Animation Issues – Fix Node Position
Fix the position of nodes across all framesArrange in circleDimension reduction (MDS?)Example: KNEGIF.htm
SGF2009 paper 229, Larry Hoyle
36
Slide37Animation Issues - Interpolation
Dimension reduction that preserves orientation - then interpolate between observations SAS Example:could do something likeKansas Data Archive Bubble Plots
SGF2009 paper 229, Larry Hoyle
37
Chart from http://www.ipsr.ku.edu/ksdata/Inspired by Trendalyzer Software http://www.gapminder.org
Slide38Other Tools
SAS Graph NV WorkshopEnterprise Miner See paper 109-2009 Barry de Ville, Discover and Drive Brand Activity in Social Networks
SGF2009 paper 229, Larry Hoyle
38
Slide39Statistics - Clustering
Clustering CoefficientGlobal Proportion of triads that have third link
SGF2009 paper 229, Larry Hoyle
39
B
A
C
?
When BA and BC are present,
Is AC present?
Slide40Statistics - Betweenness
Betweenness CentralityIndividualSum of proportion of shortest paths that go through a given link
SGF2009 paper 229, Larry Hoyle
40
w
x
v
y
z
Contributing to Centrality
for v –
wvz
and
wxz
– v is central 1 of 2 shortest w-z paths
Slide41Statistics - Betweenness
Betweenness CentralityIndividualSum of proportion of shortest paths that go through a given link
SGF2009 paper 229, Larry Hoyle
41
w
x
v
y
z
Contributing to Centrality
for v –
wvz and wxz – v is central in 1 of 2 shortest w-z paths
wvy
- v is central in 1 of 1 shortest w-y paths
Slide42Statistics - Betweenness
Betweenness CentralityIndividualSum of proportion of shortest paths that go through a given link
SGF2009 paper 229, Larry Hoyle
42
Contributing to Centrality for v – wvz and wxz – v is central in 1 of 2 shortest w-z pathswvy - v is central in 1 of 1 shortest w-y pathswx – v is central in 0 of 1 shortest w-paths
w
x
v
y
z
Slide43Questions?
Larry HoyleLarryHoyle@ku.edu
SGF2009 paper 229, Larry Hoyle
43