Research Semantic Tagging Discoverability and Accessibility Terri Mitton OECD Publishing terrimittonoecdorg 2 Strategies for improving findabilty T agging Discoverability ID: 267136
Download Presentation The PPT/PDF document "OECD’s Approach to Facilitating" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
OECD’s Approach to Facilitating Research
Semantic
Tagging, Discoverability and Accessibility
Terri
Mitton
,
OECD Publishing
terri.mitton@oecd.orgSlide2
2
Strategies
for
improving findabilty
Tagging
Discoverability
Make it easier…
…for readers to find, use and understand data
AccessibilitySlide3
3
Accessibility
Make it easier…
…for readers to find, use and understand data
AccessibilitySlide4
Discover statistics
in various formats
(search, browse by topic, country)
Quick access
to
OECD.Stat
and tools for statisticians
API services
(for those who understand such things)
DATA PORTAL
Home page
https://data.oecd.org
4Slide5
Quick access to datasets,
ready-made
tables
Quick access to charts, maps and publications
DATA PORTAL
Topic pageSlide6
Real-time data in easy to use charts
Definition
Link to source database
Links to related indicators and publications
Go from main indicator to more detail…
Easy to use, understand
6Slide7
7
Trends and
country
ranking for selected indicators
Compare countries
Trends and rankings
And on mobile tooSlide8
8
How do we make data easy to find and understand?Slide9
9
Discoverability
DiscoverabilitySlide10
From data to discoverable contentSlide11
From data to discoverable
contentSlide12
Discovery via
OECD
Search
12
General public
Researcher
Search…
Chapters
Tables
Indicators
Databases
…
Search…
Indicators
Databases
Publications
www.oecd-ilibrary.org
d
ata.oecd.orgSlide13
Is it enough still ? No !!
Source: Gardner and Inger (2012): How readers discover content in scholarly journalsSlide14
Discovery usual suspects
For professionals . . .
. . . and the publicSlide15
Intensive work with industry key players
Now indexing our 170.000 published objects including datasets, tables…..Slide16
But we have also learned to
let go
we now encourage anyone to
read
and then
share and embed
our publications in
their websites
and blogs
for freeSlide17
Embedded full books and chartsSlide18
Searching
Tagging
18
Semantic taggingSlide19
Text
analysis tools that
identifies pertinent information
Combs
through documents and extracts conceptsTo enrich the documents we use ‘skill cartridges’ which contain taxonomies and specific business rules.
19Semantic Enrichment…Slide20
20
Example
taxonomy
« disabled students »
« disabled students »
« disabled students »
«
handicapped
students
» ?
« disabled children »?Slide21
21
Not
only
…Slide22
22
Or…Slide23
23
OECD Taxonomy
2. Geographical areas
3. Document
metadata
1. Topics
Different
ways
of
classifying
OECD
workSlide24
<XML>
xxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
</XML>
Content
Documents,Publications
Vocabularies
Business
Logic
Finance
Fiscal
affairs
Fraud
Education
Skills
Attainment
P
C1
E
T2
S
R
P
T1
C2
OECD
Subjects
Semantic
Enrichment
<
xml
>
xxxxxxx
</
xml
>
Fragment
<<Temis Luxid>>
Annotations
Triples
Linking
,
Inferencing
<<RDF>>
Linked
Business Data
Triples
<XML>
xxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
</XML>
External
Sources
P
olicy
E
vidence
S
tate
of
Affairs
R
ecommendation
C
ountry
T
heme
…
<<Triple Store>>
<<Triple Store>>
Candidate
Terms
(RSS)
24Slide25
Annotation
Factory
Fragment
Classification
…
…
<XML>
xxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
</XML>
Documents,
Publications
<
xml
>
xxxxxxx
</
xml
>
Fragment
<<Luxid Annotation
Factory
>>
Triple Store
Annotation
Web Service
<
xml
>
Industry
</
xml
>
<
xml
>Finance</
xml
Annotations
Extract
« fragments »
Submit
« fragments »
to Luxid
Store annotations as triples
Document Classification
P.E.R.S.
(*)
Taxonomy
Enrichment
<<
cartridge
>>
<<
cartridge
>>
<<
cartridge
>>
(*)
P
olicy/
E
vidence
/
R
ecommendation
/
S
tate-of-
Affairs
OECD Semantic enrichment factory
25Slide26
<XML>
xxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
Xxxxxxxxxxx
</XML>
Content
Annotations
Linked
Data
OECD.Discover
26Slide27
27
« Education
policy
» use cases
UC-8
UC-1
UC-6
UC-5
UC-4a
UC-3
UC-2
UC-4bSlide28
28
OECD.Discover
statistics
132 OECD publications
321,000 fragments
195,800 PERS
objects
89,200 state of
affairs
28,000 policies
72,300 evidences
6,300 recommendationsSlide29
Integration
29
Web services and connectors have been developed to facilitate the integration of the taxonomy and semantic tools with the different OECD applications
Luxid
(Temis) and
Sharepoint 2010 integration tested and the principle validated Luxid (Temis
) and OpenText Content Server (OECD.Records) integration tested and in productionSlide30
Thank you & Questions?
Terri.mitton@oecd.org