David Riccitelli created STANBOL-953:
----------------------------------------
Summary: Text Annotations: New Model engine
Key: STANBOL-953
URL: https://issues.apache.org/jira/browse/STANBOL-953
Project: Stanbol
Issue Type: Wish
Components: Enhancer
Reporter: David Riccitelli
Priority: Minor
Stanbol defines new specifications for the Text Annotations definitions as part
of the result of an enhancement analysis. These specifications are published on
the official web site [1].
Their aim is to add the head/tail and prefix/suffix information to a Text
Annotation. This would greatly benefit dependent services that somehow need to
"clean-up" the textual contents before sending them for analysis, while
receiving meaningful information about linking the identified entities with the
related Text Annotations (without using the thus unreliable start/end
information).
In order to jump-start support for the head/tail and prefix/suffix model, we
create a TextAnnotations-NewModel engine which is converting start/end
information to head/tail/prefix/suffix information before the analysis results
are returned to the client. This engine was previously announced to the dev
mailing list [2].
It would be nice to have the engine [3] merged in the trunk.
[1]
http://stanbol.apache.org/docs/trunk/components/enhancer/enhancementstructure.html#fisetextannotation
[2]
http://mail-archives.apache.org/mod_mbox/stanbol-dev/201211.mbox/%3ccag94hgi2miswgtvyu7-bnqgqvmgrc0w7vl1czezv-fc4xns...@mail.gmail.com%3E
[3]
https://github.com/insideout10/wordlift-stanbol/tree/master/textannotations-futuremodel
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira