[ 
https://issues.apache.org/jira/browse/STANBOL-335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13119927#comment-13119927
 ] 

Rupert Westenthaler commented on STANBOL-335:
---------------------------------------------

As a first step let me describe what properties are currently included within 
the Indices created for entity extraction (NamedEntityTaggingEngine and 
KeywordLinkingEngine) as well as the user interface of the Stanbol Enhancer.

### CORE properties

* rdfs;label (in all languages)
* rdfs:comment (only in english)
* rdf:type
* rdfs:seeAlso (however I think this property is not used by DBpedia)
* dbp-ont:wikiPageRedirects mapped to rdfs:seeAlso

### SPATIAL properties:

* geo:lat
* geo:long
* geo:alt
* dbp-ont:elevation is mapped to geo:alt and values are converted to xsd:int

### CATEGORY properties:

**Category assignment**

* dc-terms:subject (because it is used to store the participation of Entities 
in Categories)
* dc-elements:subject is mapped to dc-terms:subject

**Category data**

* skos:* (all skos properties - such properties are only used for categories)
* skos:prefLabel is copied to rdfs:label
* skos:altLabel is copied to rdfs:label
* skos:hiddenLabel is copied to rdfs:label
* skos:relatedMatch is copied to skos:related
* skos:broadMatch is copied to skos:broader
* skos:narrowMatch is copied to skos:skos:narrower
* skos:narrowerTransitive is copied to skos:narrower
* skos:broaderTransitive is copied to skos:broader

** OTHER properties **

* foaf:homepage (link to the homepage of the entity)
* foaf:depiction (link to the thumbnail)
* dbp-ont:thumbnail is mapped to foaf:depiction (because this property is also 
used for thumbnails)
* dbp-ont:birthDate
* dbp-ont:deathDate
* dbp-ont:populationTotal

This configuration creates an Index of about 5GByte when including 4.4 million 
entities.
And this index can be downloaded at 
http://dev.iks-project.eu/downloads/stanbol-indices/dbpedia-3.6-multiple-languages/



                
> Creating new DBPedia indexes including new properties that might be useful
> --------------------------------------------------------------------------
>
>                 Key: STANBOL-335
>                 URL: https://issues.apache.org/jira/browse/STANBOL-335
>             Project: Stanbol
>          Issue Type: Improvement
>          Components: Entity Hub
>            Reporter: Suat Gonul
>            Priority: Minor
>              Labels: DBPedia, Index
>
> There might be new DBPedia indexes that contain new useful properties such as 
> country information for cities, birth place or company information for 
> persons ...
> When worthwhile number of needed properties (e.g 10) are collected under this 
> issue, a new DBPedia index can be created.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to