David Riccitelli created STANBOL-953:
----------------------------------------

             Summary: Text Annotations: New Model engine
                 Key: STANBOL-953
                 URL: https://issues.apache.org/jira/browse/STANBOL-953
             Project: Stanbol
          Issue Type: Wish
          Components: Enhancer
            Reporter: David Riccitelli
            Priority: Minor


Stanbol defines new specifications for the Text Annotations definitions as part 
of the result of an enhancement analysis. These specifications are published on 
the official web site [1].

Their aim is to add the head/tail and prefix/suffix information to a Text 
Annotation. This would greatly benefit dependent services that somehow need to 
"clean-up" the textual contents before sending them for analysis, while 
receiving meaningful information about linking the identified entities with the 
related Text Annotations (without using the thus unreliable start/end 
information).

In order to jump-start support for the head/tail and prefix/suffix model, we 
create a TextAnnotations-NewModel engine which is converting start/end 
information to head/tail/prefix/suffix information before the analysis results 
are returned to the client. This engine was previously announced to the dev 
mailing list [2].

It would be nice to have the engine [3] merged in the trunk.

[1]
http://stanbol.apache.org/docs/trunk/components/enhancer/enhancementstructure.html#fisetextannotation

[2]
http://mail-archives.apache.org/mod_mbox/stanbol-dev/201211.mbox/%3ccag94hgi2miswgtvyu7-bnqgqvmgrc0w7vl1czezv-fc4xns...@mail.gmail.com%3E

[3]
https://github.com/insideout10/wordlift-stanbol/tree/master/textannotations-futuremodel

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to