This morning Context A Data Science Environment Data Science Studio Pilot Incubator Program Discussion 2 A 5year 378 million crossinstitutional collaboration 3 Establish a virtuous cycle ID: 934562
Download Presentation The PPT/PDF document "ß Data Science Incubator" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
ß
Data Science Incubator
Slide2This morningContext: A Data Science Environment
Data Science StudioPilot Incubator Program
Discussion2
Slide3A 5-year, $37.8 million cross-institutional collaboration
3
Slide4Establish a virtuous cycle
6 working groups, each with
3-6 faculty from each institution
Slide5Pilot Program OrganizersAndrew Whitaker, Research Scientist
Dan Halperin, Director of Research, Scalable Data AnalyticsJake Vanderplas, Director of Research, Physical Sciences
Bill Howe, Associate Director
5
Slide6The Data Science StudioAn open collaborative research space
A resident data science teamPermanent staff of ~5 data scientists – applied research and development
~15-20 data science fellows (research scientists, visitors, postdocs, students) How to Engage:Drop-in open workspaceStudio “Office Hours”
Incubation Program
…plus seminars, sponsored lunches, workshops, bootcamps, joint proposals...
6
Slide77
6
th
floor Physics Astronomy Building
A partnership among …
Provost
UW Libraries
Physics, Astronomy,
Arts & Sciences
eScience Institute
Slide88
Estimated Timeline:
Design Phase Jan-June
Construction June – Sep
Target: October 1, 2014
Slide9Incubator Program OverviewGoal
: Create watercooler opportunities and scale our efforts by co-locating collaborations from different fields in the studio
Protocol: ~
1-page proposals for 1-quarter, on-site data science collaborations with usWhat we're looking for
:
Projects where fruitful collaboration is possible, with potential for significant impact, and that have sustained engagement
This meeting
: Pilot program for Spring Quarter to inform full launch Fall 2014.
9
http://data.uw.edu/incubator
Slide10Spring Incubator Pilot Program LogisticsApplications due online 3/10
Each proposal identifies a Project Lead (PL) The person doing the work, not the thesis advisorIncubator participants join the studio 2 days/week
Days decided collectively by participants and teamPilot program operates out of Sieg 326Milestones at 3, 6, 9 weeksblog posts + demo, visualization, IPython notebook, dataset, GitHub repo, preliminary results, etc.Networking/poster session during 9
th week
10
Slide11Areas of interestscalable data management and analytics
learning and predictive modelsinteractive visualizationparallel algorithms
code review, publishing, and reproducibilityonline teaching materials, tutorials
11
Slide12A Live SeaFlow Dashboard
12
Laser
Microscope Objective
Pine Hole
Lens
Nozzle
d1
d2
FSC
(Forward scatter)
Orange fluo
Red fluo
Francois Ribalet
Jarred Swalwell
Ginger Armbrust
Slide13SeaFlow AmbitionsSeaFlow is a huge success! NSF wants one on every R/V
13
Slide14SeaFlow AmbitionsUnderway biology
should enable adaptive sampling - a sort of “holy grail”
How can remote collaborators participate?What about citizen science?
14
“Wait! We saw a population
change
between
P3 and P4!”
“Let’s go back!”
Slide15A Live SeaFlow Dashboard
15
Is the instrument
working?
Where is the ship?
What is it doing?
What phytoplankton populations
are we seeing?
Slide16The AscotDB Project
16
A multi-year collaboration between UW Astronomy and UW Computer Science researchers and students
ASCOT = the
AS
tronomy
CO
llaborative
T
oolkit
Goal:
Provide an
interactive
and
collaborative
environment for analysis of astronomical data
.
Slide17The AscotDB Project
17
Interacting browser-based widgets for generating database queries & associated visualization.
The resulting visualizations can be shared with collaborators through a browser URL
Slide18Pilot cohort desideratagood clusteringalignment with sponsor and program goals
new directions, new questionsavailability, engagement, commitment“do only what we can only do together”
with apologies to Djikstraclarity and shovel-readinesscapacity for measurable outcomes
18
Slide19Spring Schedule3/10: Proposals due3/14: Follow-up requests3/21: Pilot participants notified
3/31: Spring program start date4/21: First milestone5/12: Second milestone6/2: Third milestone
6/6: Poster/networking event
19