[ 
https://issues.apache.org/jira/browse/TIKA-2360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16010631#comment-16010631
 ] 

Tim Allison commented on TIKA-2360:
-----------------------------------

My preference would be not to include the SentimentParser by default:

1) network calls that are not currently robustly handled

2) .sent glob in mime detection which could cause problems for users who happen 
to have files with that suffix, and y, I can't imagine users have a bunch of 
Apple II files kicking around, but this is a mildly worrisome method of 
triggering the SentimentParser

3) while very cool, it is a fundamentally different thing than a parser.  It 
enriches already extracted UTF-8 text, kind of like the phone number handler, 
etc.  I realize NER does exactly the same thing...I know...



> Handle SentimentParser resource failure more robustly
> -----------------------------------------------------
>
>                 Key: TIKA-2360
>                 URL: https://issues.apache.org/jira/browse/TIKA-2360
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Priority: Blocker
>
> The SentimentParser tests currently require a network call to github.  For 
> those working behind a proxy or would prefer Tika not to make unexpected 
> network calls, can we please turn this off by default?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to