[ https://issues.apache.org/jira/browse/STANBOL-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15895656#comment-15895656 ]
Rafa Haro commented on STANBOL-1458: ------------------------------------ I don't have permissions enough to create versions, but Fix version must be entityhub-1.0.1 > Fields Language is being filtered while creating entities into Solr Yard > based Managed Sites > -------------------------------------------------------------------------------------------- > > Key: STANBOL-1458 > URL: https://issues.apache.org/jira/browse/STANBOL-1458 > Project: Stanbol > Issue Type: Bug > Components: Entityhub > Affects Versions: 1.0.0 > Reporter: Rafa Haro > Assignee: Rafa Haro > Labels: managed_site > > When entities are created through Managed Sites REST API, fields containing > xml:lang annotations are being stored into Solr (Yard) using only the field > value and not also the language. This is preventing, among other things, > Entity Linking engine to found the entities when the language is detected > first. Even if the Entity Linking engine is configured without any predefined > language, the entities are not found. > Taking a look into the code, The StringConverter within the IndexValueFactory > is, by purpose, ignoring the language for xsd:string based DataTypes. > TextConverter (which is bound to entityhub:text type) is indexing the > language along with the value. The problem is that, when uploading the > entities through the API, the Clerezza Serializer is of course not able to > understand entityhub:text data type, so it is always parsing the text fields > as xsd:string. > Proposed solution is to include the language, if exists, also for String > DataTypes as Text based are doing -- This message was sent by Atlassian JIRA (v6.3.15#6346)