A case study in normalization Abigail Elbow Breena Krick Laura Kelly NIHNLMNCBIPMC JATSCon 9272011 PMC Overview What do those people do with data anyway But first The PMC process ID: 418856
Download Presentation The PPT/PDF document "PMC Tagging Guidelines" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
PMC Tagging Guidelines
A case study in normalization
Abigail Elbow, Breena Krick, Laura
Kelly
NIH/NLM/NCBI/PMC
JATS-Con
| 9.27.2011Slide2
PMC Overview
What do those people do with data, anyway?
But first…Slide3
The PMC process:35 schemas
Validate against declared DTDTransform into JATS XML (Green Archiving DTD)Check validityRun Style Checker
Load to PMC databaseSlide4
What’s that look like?Slide5
Q: Why have a style checker?
A: PMC is simply a user of the JATS / NLM DTDsSlide6
Can you be more specific?
More than one way to tag a structureNeed for normalizationStart with the basic & most inconsistently-tagged:
Article metadata
Figures
Tables
RelaxNG
schema used first
Replaced with XSL
stylesheets
Allow flexibility, reporting, and varying file outputSlide7
Q: So how does anyone know how to tag in PMC style?
A: The PMC Tagging Guidelines…Slide8
The Tagging GuidelinesHTML prose form of the style rules
General Tagging Practice, Document Objects, ElementsIntroduction and Update HistoryXML backbone
Covers PMC, NIHMS, and Bookshelf
Covers both 2.3 and 3.0Slide9
Tagging Guideline XML: @versionSlide10
Tagging Guideline HTMLSlide11
Tagging Guidelines: an elementSlide12
Q: How do I know if my file is compliant?
A: The PMC Style Checker Slide13
Five common style errorsMathML
<
related-article>
tagging
<
xref
>
and @
ref-type
DOIs
Empty
elements
Demo time:
http://
www.pubmedcentral.nih.gov/utils/style_checker/stylechecker.cgi
Slide14
Q: What if I have lots of files?
A: NLM Style Checker stylesheets
(v4.3.4)Slide15
The Style Checker Stylesheets
Main file: nlm-stylechecker.xslIt
xsl:include
(s):
stylecheck-match-templates.xsl
stylecheck-named-tests.xsl
stylecheck-helper-templates.xsl
Reports:
style-reporter.xsl
Generates an HTML Error/Warning reportSlide16
badstyle.XMLSlide17
Another report view: (PMC Production)Slide18
Special thanksLaura Kelly
Breena KrickJeff BeckSlide19
Resources:
PMC Tagging Guidelines:http://www.ncbi.nlm.nih.gov/pmc/pmcdoc/tagging-guidelines/article/style.html
PMC
Online Style Checker:
http://
www.pubmedcentral.nih.gov/utils/style_checker/stylechecker.cgi
Downloadable Style Checker
stylesheets
and instructions:
http://
www.ncbi.nlm.nih.gov/pmc/pmcdoc/tagging-guidelines/stylechecker/stylecheck-README.html
PMC Utilities:
http://www.ncbi.nlm.nih.gov/pmc/pub/validation/
Tagging
Guidelines email list:
http://www.ncbi.nlm.nih.gov/mailman/listinfo/pmc-tagging-guidelines