[ 
https://issues.apache.org/jira/browse/STANBOL-1148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rupert Westenthaler updated STANBOL-1148:
-----------------------------------------

    Description: 
The dbpedia default data index will be updated to:

* use the current dbpedia.org version. Currently dbpedia 3.6 is used by the 
default index. The new one will be based on dbpedia 3.8.
* do no longer index entities that are redirects. Rather generate a 
'dbp-ont:surfaceForm' field that indexes all labels of entities that redirect 
to indexed one (e.g 'US', 'USA', 'U.S.A' … -> 'United States')
* make the index compatible to the FST linking engine
* include generated FST models

As a lot of unit tests and integration test do depend on the data contained in 
the index this will also require to adapt those test.

  was:
The dbpedia default data index should be updated so that it can be used with 
the FST linking engine. 

This is an own issue as this change in data will most likely also require to 
change existing unit and integration test that relay on the current data 
present in the dbpedia default data index.

The current dbpedia default data index is based on dbpedia version 3.6. The new 
one will use version 3.8


    
> Update the dbpedia default data
> -------------------------------
>
>                 Key: STANBOL-1148
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1148
>             Project: Stanbol
>          Issue Type: Improvement
>          Components: Enhancement Engines
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> The dbpedia default data index will be updated to:
> * use the current dbpedia.org version. Currently dbpedia 3.6 is used by the 
> default index. The new one will be based on dbpedia 3.8.
> * do no longer index entities that are redirects. Rather generate a 
> 'dbp-ont:surfaceForm' field that indexes all labels of entities that redirect 
> to indexed one (e.g 'US', 'USA', 'U.S.A' … -> 'United States')
> * make the index compatible to the FST linking engine
> * include generated FST models
> As a lot of unit tests and integration test do depend on the data contained 
> in the index this will also require to adapt those test.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to