[
https://issues.apache.org/jira/browse/UIMA-958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12585940#action_12585940
]
Michael Baessler commented on UIMA-958:
---------------------------------------
Fine with me to release this with the next release. I will change the fix
version to 2.3
> Document Analyzer not showing PersonTitle when running with xml tagged source
> -----------------------------------------------------------------------------
>
> Key: UIMA-958
> URL: https://issues.apache.org/jira/browse/UIMA-958
> Project: UIMA
> Issue Type: Bug
> Components: Tools
> Reporter: Marshall Schor
> Assignee: Marshall Schor
> Priority: Minor
> Fix For: 2.3
>
>
> Running the document analyzer with an input source of xml docs, and
> specifying the xml tag (of TEXT) causes the personTitle annotator to not show
> results. This is traced to it being given a language of x-unspecified, even
> though the collection reader used set the document language to "en". This is
> traced to the insertion of the xmldetagger into the aggregate, which passed a
> cas view of "plain text" to the personTitle annotator. That component did
> not copy the language specifier from the input cas view. In fact, the input
> cas view was "xmlDocument" - and that view also didn't have the language.
> The collection reader put the language only into the "_InitialView". Fix by
> having the XmlDetagger copy the initial view's language into the resulting
> plain text view it creates for downstream annotators to work on.
> Note that there are two copies of this class - fix both of them (One in
> tools, other in examples).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.