Data Assembly Center Rob Bochenek Axiom Data Science Define cyberinfrastructure Talk about data lifecycle Animal t elemetry c ase s tudies Summary The National Science Foundation defines cyberinfrastructure as ID: 784700
Download The PPT/PDF document "Architecting the Next Generation ATN" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Architecting the Next Generation ATN Data Assembly Center
Rob Bochenek - Axiom Data Science
Slide2Define cyberinfrastructure
Talk about data lifecycle
Animal telemetry
case
studies
Summary
Slide3The National Science Foundation defines cyberinfrastructure as:
In scientific usage, cyberinfrastructure is a technological and sociological solution to the problem of efficiently connecting laboratories, data, computers, and people with the goal of enabling derivation of novel scientific theories and knowledge.
Understanding of the existing community developed data standards, protocols and software
Scalable compute and storage infrastructure (HPC)
Human capacity - data scientists, data librarians, data coordinators, software engineers…
Science community that can benefit from support
Supporting ATN through Cyberinfrastructure
Slide4Data Lifecycle
DATA CREATION & QUALITY CONTROL
Science/Lab Teams
DATA STORAGE
CENTRALIZATION
& ORGANIZATION
DATA DESCRIPTION
Metadata
PUBLICATION &
ARCHIVE
DOI Generation &
Repository submission
DATA ACCESS & DISCOVERY
Data portals & search catalogs
REUSE & TRANSFORMATION
Synthesis
Slide5Data Lifecycle
DATA CREATION & QUALITY CONTROL
Science teams
DATA STORAGE
Upload to Workspace or Tag Manufacturer Integration
DATA DESCRIPTION
Metadata Editor
ARCHIVE & PRESERVATION
Repository submission pathway
DATA ACCESS & DISCOVERY
Data portals & search catalogs
REUSE & TRANSFORMATION
Jupyter Notebook & data analyses
Slide6Organize into projects, research campaigns and organizations
Coordinate data exchange across networks, groups, programsISO 19110/19115-2 standards metadata editor
Execute server side R and Python numeric workflows (Jupyter) on uploaded data AND any data in IOOS stack (State Space Models and other Analysis)
Archive pathway to DataONE, NCEI & DOI minting (Emerging)
Slide7Examples and Case Studies
Marine Arctic Ecosystem Study (MARES)
AFSC Marine Mammal Lab
Florida Atlantic Coast Telemetry Project
Slide8Slide9Slide10Slide11Slide12Slide13Demos
https://docs.google.com/document/d/1UWZaTcIYXjsJahFkPE-vd3RIINZZBcQ-T8b01bBRcx4/edit?usp=sharing