Metadata Technical Considerations amp Approach Ray Plante NIST 41416 NMI Registry Workshop BIPM Paris 1 dont worry or How we concentrate on concepts Creating amp Curating Records ID: 556833
Download Presentation The PPT/PDF document "Developing our" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Developing our Metadata:Technical Considerations& Approach
Ray PlanteNIST
4/14/16
NMI Registry Workshop BIPM, Paris
1
…don’t worry ;-)
or How we concentrate on conceptsSlide2
Creating & Curating RecordsDescriptions of an NMI’s data assets will be stored in a registry An NMI will be able to create and update their own records
Can operate own registry or use a remote oneNIST can provide a registry application, or NMI can create or adapt their owne.g. to connect their local infrastructure
Options will be described tomorrow
4/14/16
NMI Registry Workshop BIPM, Paris
2Slide3
Collecting Records for SearchingPropose using OAI-PMH as the protocol for exchanging metadataCommunity standardWidely used (including in the Virtual Observatory)
Well supported by open softwareSearchable RegistryWants to collect records for all data resources from all NMIsUses OAI-PMH to pull the records from the NMIs
Provide a means to searchWeb page GUIScriptable (REST) interface
4/14/16
NMI Registry Workshop BIPM, Paris
3Slide4
NMI Registry Federation
4/14/16
NMI Registry Workshop BIPM, Paris
4
Publishing
Registry
Portal
Dataset
Dataset
Full
Searchable
Registry
Dataset
Database
Dataset
Data Repository
Portal
Database
Publishing
Registry
Dataset
Portal
Database
Dataset
Full
Searchable
Registry
harvest
(pull)
manual
entry (push)
NMI
NMI
NMI
NMI
Registry
Of Registries
harvest
(pull)
harvest
(pull)Slide5
Record FormatWe will eventually decide the record encoding formatLeading choices: XML, JSON, JSON-LD
Choice is not criticalAt NIST, we have been developing conventions for defining schemas in all forms with mechanisms to convert between them as needed.Is there an opportunity to leverage local infrastructure, tools by picking a particular format?
Today, we want to concentrate on…What kinds of data resources we want to discover
What concepts are needed to describe themWhich concepts are important for discovering resources through a query
What information we need in order to access and use them
4/14/16NMI Registry Workshop BIPM, Paris
5Slide6
Defining a SchemaSchema = the set and organization of the terms (representing concepts) that we will use to describe our data resourcesSchema framework = the techniques and patterns we use to define our schemaKey requirement: Extensibility
Allows us to evolve schema with extensions add new terms as needed
Don’t need to solve the entire metadata problem today (or ever)!
Introducing extensions must not break existing systems
Successful strategy for extension demonstrated in the Virtual Observatory
4/14/16NMI Registry Workshop BIPM, Paris
6Slide7
What do we want to find?Each record will describe something that we want to be able to discoverDatasetsStandard Reference Data, Reference Data, Data associated with publications, …
DatabasesPortals and web sitesOther tools and servicesOur Member Institutes
Participating RegistriesOur discussion of sample queries will help tease this out
4/14/16
NMI Registry Workshop BIPM, Paris
7Slide8
Different types of resourcesResource = something we want to findWe expect to have a set of metadata attributes that are common to all resourcesWe can add additional metadata to describe specific kinds of resources
4/14/16
NMI Registry Workshop BIPM, Paris
8
A model being developed
for the materials science
communitySlide9
Kinds of metadataIdentity -- how we recognize itCuration
-- who is responsibleContent -- what it is about
Access -- how to get at it
Applicability -- how it applies to different
domainsExamples: Physics, Chemistry, Biology, Materials ScienceCan have multiple entries, each containing metadata specific to a different domain
4/14/16
NMI Registry Workshop BIPM, Paris
9Slide10
StrategyCollaborate on a demonstrationNMIs can participate at whatever level they are ableRefining the metadata schema: conceptually or technicallySoftware implementations
Leverage on-going registry development at NISTWhat do we want to find and how Sample queriesIssues
4/14/16
NMI Registry Workshop BIPM, Paris
10