[
https://issues.apache.org/jira/browse/TIKA-1815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15065533#comment-15065533
]
ASF GitHub Bot commented on TIKA-1815:
--------------------------------------
GitHub user thammegowda opened a pull request:
https://github.com/apache/tika/pull/67
FIX for TIKA-1815 contributed by Thamme Gowda
+ Writing the text content to XML Document
+ Added Regex recogniser to default NER chain
Closes #66 (this is a simpler version of the same). Fixes #TIKA-1815
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/thammegowda/tika TIKA-1815
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/tika/pull/67.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #67
----
commit a40a18e2f61f2152fa065bda193ceb74e7e60c97
Author: Thamme Gowda <[email protected]>
Date: 2015-12-19T20:56:21Z
FIX for TIKA-1815 contributed by Thamme Gowda
+ Writing the text content to XML Document
+ Added Regex recogniser to default NER chain
----
> Text content from parser is empty when NamedEntityParser is enabled
> -------------------------------------------------------------------
>
> Key: TIKA-1815
> URL: https://issues.apache.org/jira/browse/TIKA-1815
> Project: Tika
> Issue Type: Bug
> Components: parser
> Reporter: Thamme Gowda N
> Fix For: 1.12
>
> Original Estimate: 0.5h
> Remaining Estimate: 0.5h
>
> When the NamedEntityParser is enabled, the Tika#parseToString() and other
> parse() methods produces an empty string.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)