Wido van Peursen Eep Talstra Centre for Bible and Computer shebanq PeursenWTvan The corpus Hebrew Bible Ca 400000 words Probably composed over a period of ID: 815665
Download The PPT/PDF document "THE HEBREW BIBLE AS DATA" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
THE HEBREW BIBLE AS DATA
Wido van
PeursenEep Talstra Centre for Bible and Computer
@
shebanq
_ / @PeursenWTvan
Slide2The corpusHebrew Bible Ca. 400.000 wordsProbably composed over a period of about 1000 years (1200-200 BC)
Complex transmission historyOldest complete manuscript: Codex Leningradensis, 1008/9 ADVarious linguistic layers (e.g. vowel signs
)No native speakers
Slide3The databaseWIVU database of the Hebrew Bible[WIVU = Werkgroep Informatica Vrije Universiteit]Createted since 1970sLinguistic levels:Morphology (
encoding rather than tagging!)WordsPhrasesClausesSentencesText hierarchy
Slide4The data structure
Slide5EMDROSCentral concept: objects with featuresEach object can carry unlimited featuresObjects can be aggregated arbitrarily into new objectsStructure that can deal with overlapping hierarchiesquery language: MQL
Slide6HOWEVEr….No dedicated space on the web where an authorized version of this resource is guaranteed to exist. No possibility to annotate it, link to it or build (open source) tools around it.Results of existing queries cannot be shown on the web.EMDROS is maintained by
one-person private company.Mainly used by specialists in Bible & Computer.
Slide7shebanqTo build a bridge between the linguistically annotated Hebrew Text corpus and biblical scholars.Three steps:make text & annotations, available to scholarsdemonstrate how queries can function to address research questions: repository of saved queries.give textual scholarship more empirical basis, by creating the opportunity of unique identifiers referring to saved queries.
Slide8