The RECODE Project perspective Lorenzo bigagli and the recode team CNR IIA florence division Florence Division bigagliiiacnrit Centre International de Conférences de Genève CICG ID: 556378
Download Presentation The PPT/PDF document "Infrastructure and technology challenges..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Infrastructure and technology challenges in Open Access to research data –The RECODE Project perspective
Lorenzo bigagli and the recode teamCNR IIA (florence division )
Florence
Division
bigagli@iia.cnr.it
Centre International de Conférences de Genève (CICG)
17, rue de
Varembé
Geneva (
Switerland
)
Tenth Plenary Session of the Group on Earth Observations (GEO-X) & Geneva Ministerial SummitSlide2
OutlineIntroduction
The RECODE ProjectKey findings on Infrastructure and Technology challengesOnline questionnaire, case studies interviews, literature review, workshopContacts and next datesSlide3
The RECODE projectPolicy RECommendations
for Open access to research Data in Europe (RECODE)1 February 2013 – 31 January 2015 (24 months)Eight partners across five countries
Grant agreement no: 321463Slide4
Objectives
Produce policy guidelinesSlide5
Case StudiesSlide6
Grand challengesSlide7
The GEOSS lesson Global Earth Observation System of Systems
EuroGEOSS EGIDACoordinating Earth and Environmental cross-disciplinary projects to promote GEOSSEGIDA MethodologySlide8
EGIDA MethodologyGeneral
approach for (re-) engineering national Science & Technology infrastructures for a sustainable contribution to GEOSSSystem of Systems (SoS) approachMobilization of resources made available by national, European and global initiatives Slide9
RECODE Stakeholder TaxonomyWe congregate the disseminator and curator
rolesWe assume they share similar concerns for our purposeCitizens may be considered as research data user/producers (cf. Citizen Science)We assume their involvement in use and production of data is mediated by appropriate applications (e.g. mobile apps) that practically isolate them from the implied technological and infrastructural
issuesSlide10
Online questionnaireClosed, concise questions~50 responses so far
Still openhttp://goo.gl/raafCFSlide11
Online questionnaireSlide12
InterviewsExperience with OA publications, but not with data publications or data preservationMetadata are considered crucial to enable retrieval, re-use and preservation of research data
Financial and legal barriers are considered higher priority then technical onesData management plans are being developed, but still at an early stageSolutions for data management and preservation are not common nor centralizedSlide13
EC Public Consultation on Open Research Data2
nd July 2013, in Brussels~130 attendants stakeholders from the research community, industry, funders, libraries, publishers, infrastructure developers, etc.Focused on five questionsHow can we define research data and what types of research data should be open?When and how does openness need to be limited?How should the issue of data re-use be addressed?Where should research data be stored and made accessible?How can we enhance data awareness and a culture of sharing?Slide14
H2020 Pilot on Open Access to Research DataRequirements
Detailed data management plan covering individual datasets (within 6 months)Deposit the research data, preferably into a research data repository Take measures to enable for third parties to access, mine, exploit, reproduce and disseminate (free of charge for any user) this research dataE.g., Creative Commons License (CC-BY, CC0)Provide information about tools and instruments at the disposal of the beneficiaries and necessary for validating the resultsE.g., specialized software, algorithms, analysis protocols. Where possible, they should provide the tools and instruments themselves Slide15
Big DataThe Big Data issue (the 4 V’s)Volume - data at rest
Velocity - data in motion (streaming)Variety - many types, forms and structures or no structuresVeracity - trustworthiness, provenance, lineage, qualityValidity - data that is correctVisualization - data in patternsVulnerability - data at riskValue - data that is meaningfulSlide16
Workshop questionsWhere should open research data be stored and made accessible?How can we mitigate the technological barriers to Open Access to research Data?
How can we cope with technological sustainability and obsolescence?What emerging technologies could be optimized to promote ease of deposit and retrieval of research data?Slide17
Infrastructure and Technology challenges – Overview
CreatorDisseminator / CuratorFunderEnd user
Heterogeneitystandardization,
semanticsinteroperability, reuse, organizational arrangements, i18ndiscoverability, accessibility, usabilitySustainabilityobsolescence, persistence (PID), preservation, reuse
preservation, persistence, energy footprint
obsolescenceVolume
bandwidthbandwidth, storage, scalability (data deluge)scalability
discoverability, accessibility, bandwidthQuality
description, training, fitness for usedescription, completeness, peer
reviewcertification
description, peer review, fitness for useSecurityauthorization, licensingaccountinglicensing
authentication, privacy, trustSlide18
Inputs from the workshopImportance of metadata
For quality, etc.Heterogeneity is here to stayHeterogeneity of user requirements makes it difficult to identify a unique solution to be adoptedTo build on the existing research infrastructure, supplementing and not supplanting itTo be flexible applying extensible technological and organizational solutionsTo share does not mean to give everything to everyoneTo distinguish among different
approaches to Open Access
To discuss new professional roles stemming from the Data Science and Open Data in particularSlide19
Contacts and next dates http://recodeproject.eu/
@RECODE_Project3rd RECODE Workshop “Ethical and legal issues in open access to research data”14 March 2014 Meertens Institute, Joan Muyskenweg 25, 1096 CJ Amsterdam (NL) Slide20
Fill in our questionnaire! http
://goo.gl/raafCFSlide21
Visit us at the Exhibition!RECODE booth (booth 15)
GEO Members Area
GEO Participant
Organisations Area
GEO Projects Area
European Commission:
RTD + JRC
European Commission:
Copernicus
ESA
GCI Test Area
RECODE boothSlide22
Contribute to our session!European Geosciences Union General Assembly Meeting 2014
Vienna, 27 April-2 May 2014ESSI 2.13 - Open Access to Research Data and Public Sector Information: perspective, drivers, and barriersConvener: Lorenzo Bigagli Co-Conveners: Stefano Nativi , Paolo Mazzetti , Kush Wadhwa , José Miguel Rubio IglesiasCall for Abstract deadline:January 16th 2014, 13:00 CETSlide23
Infrastructure and technology challenges in Open Access to research data – the RECODE Project perspectiveLorenzo bigagli
and the recode teamCNR-IIA (florence division )
Florence
Division
bigagli@iia.cnr.it
Centre International de Conférences de Genève (CICG)
17, rue de
Varembé
Geneva (
Switerland
)
Tenth Plenary Session of the Group on Earth Observations (GEO-X) & Geneva Ministerial SummitSlide24
Extra slidesSlide25
Online questionnaireSlide26
Online questionnaireSlide27
Online questionnaireSlide28
Online questionnaireSlide29
Online questionnaireSlide30
Online questionnaireSlide31
Online questionnaireSlide32
Online questionnaireSlide33
Online questionnaireSlide34
EC Public Consultation on Open Research DataHow can we define research data and what types of research data should be
open?Definitions vary, with some contributions defining research data as potentially all data (including public sector information), and some limiting it to data that is the product of researchFor researchers, research data includes all data from an experiment, study or measurement, including metadata and details on processing dataFor publishers, data linked to publications is part of the publicationSlide35
EC Public Consultation on Open Research DataHow should the issue of data re-use be addressed
?Discussions about licensing, but also about technical aspects of open research data Not just on whether and how data should be re-used, but also on the adequacy of e-infrastructures for data re-useDirective on the re-use of public sector information (2003/98/EC, currently under revision) was mentioned several timesWhile public sector information (PSI) is distinct from research data and governed by a specific directive, it is important to remember that this type of information can also be useful for researchSlide36
EC Public Consultation on Open Research DataWhere should research data be stored and made accessible
?The need for improved data management practices and better data accessibility is a key concernIssues closely linked with data preservation and sustainability of data repositoriesThe readiness of professionals to engage in data curation was also highlightedAll stakeholders agreed that any funding body policy on open research data must call on researchers to take the issue of data management seriously by developing data management plans (DMPs) for their research projectsSlide37
H2020 Guidelines on Data ManagementScientific research data should be easily Discoverable
AccessibleAssessable and intelligibleUseable beyond the original purpose for which it was collectedInteroperable to specific quality standards