Stanbol Overview Features overview Components Stanbol Content Enhancer Stanbol Entity Hub Stanbol Content Hub Stanbol Ontology Technologies 2 Features Apache Stanbol provides a set of reusable components for semantic ID: 265979
Download Presentation The PPT/PDF document "Apache" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Apache
Stanbol
Slide2
Overview
Features overview
ComponentsStanbol Content EnhancerStanbol Entity HubStanbol Content HubStanbol OntologyTechnologies
2Slide3
Features
Apache
Stanbol provides a set of reusable components for semantic content management.Apache Stanbol's main features are:Content EnhancementServices that add semantic information to “non-semantic” pieces of content.ReasoningServices that are able to retrieve additional semantic information about the content based on the semantic information retrieved via content enhancement.Knowledge ModelsServices that are used to define and manipulate the data models (e.g. ontologies) that are used to store the semantic information.PersistenceServices that store (or cache) semantic information, i.e. enhanced content, entities, facts, and make it searchable.3Slide4
Components
Enhancer
: Extracts Knowledge from parsed ContentEntityhub: Manage Entities and Topics of Interest to your DomainContenthub: Semantic Indexing / Search over your - semantic enhanced - ContentCMS Adapter: Sync. your CMS with Apache Stanbol (JCR/CMIS)Ontology Manager: Manage you formal Domain Knowledge Reasoners & Rules: Apply Domain Knowledge to improve / validate extracted.Information. Refactor / refine knowledge to align it to public schemas such as schema.org4Slide5
Stanbol
Content
EnhancerEntity Tagging - replacing text based tags such as "Bob Marley" with entities - dbpedia:Bob_Marley - to improve content search and categorization. Entity Disambiguation - enhance the entity tagging experience by explicit support for disambiguation between different suggested entities. This allows users to explicitly link to Paris (Texas), Bob Marley (Comedian) or in between any other entities that do share similar labels.Entity Checker - interact with extracted entities similar as with todays spellchecker: Show extracted/suggested dirtily within the content; Allow users to interact with suggestions and to disambiguate between different matches if necessary; Support search for additional/other entities.5Slide6
Stanbol
Content Enhancer (II)
6Slide7
Stanbol
Content Enhancer (III
)Support for domain specific vocabularies7Slide8
Stanbol
Content Enhancer (IV)
The following Languages are supported for Named Entity Recognition - and can therefore be used for Named entity Linking:English (via NamedEntityTaggingEngine, OpenCalais)Spansh (via NamedEntityTaggingEngine, OpenCalais)Dutch ((via NamedEntityTaggingEngine)French (via CELI NER engine, OpenCalais)Italien (via CELI NER engine)For the following languages NLP support is available to improve results when using the Keyword Extraction Engine:DanishDutchEnglishGermanPortugueseSpanishSwedish
8Slide9
Stanbol
Content Enhancer (V
)9Slide10
Stanbol
Entity Hub
Responsible for providing the information about Entities relevant to the users domain. The following figure tries to provide an overview about the features of the Entityhub.10Slide11
Stanbol
Content Hub
Add Semantic Search to your CMSRESTful Faceted Search InterfaceRelated Keyword Search using Entityhub, Ontonet or WordnetImprove Search by Semantic IndexingUse the Stanbol Contenthub for semantic indexing11Slide12
Stanbol
Ontology
Manage your Ontologiesand use/combine them in Scopes Reasoningon volatile Data loaded into a Sessions consistency check / classification / enrichmentRDFS, OWL and OWL - 2Support for background Jobs for long running reasoning tasks12Slide13
Stanbol
Ontology
13Slide14
Stanbol
Ontology (Rules)
Stanbol RulesRecipes: Manage a set of Rules that are executed togetherRules are converted to SWRL,Jena Rules or SPARQL CONSTRUCT depending on the available RuleEngineTypical Use Casesintegrity checks for imported Dataharmonize Vocabularies e.g. simple SEO by using schema.org14Slide15
Technologies
Functionalities
are provided as RESTful services returning results as RDF (Resource Description Language) and JSON. Apache Stanbol also supports the use of JSON-LD.Apache Stanbol can be run as a standalone application (packaged as a runable JAR) or as an web application (packaged as a WAR file) deployable in servlet containers such as Apache Tomcat.Written in Java based on the OSGi as component framework.Implemented using frameworks such as Apache Solr - for semantic search; Apache Tika - for plain text and metadata extraction; Apache OpenNLP - for natural language processing; Apache Clerezza and
Apache Jena
- as RDF and storage frameworks;
Apache
Felix
as
OSGi
framework and
Apache
Sling
for deployment
.
15Slide16
Technologies (II)
Stanbol
Components provideRESTful APIJava API and OSGI servicesStanbol Components do NOT depend on each otherhowever they can be easily combined16Slide17
Live DEMO
http://dev.iks-project.eu:8081
17