[
https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13633522#comment-13633522
]
Uwe Schindler commented on SOLR-4358:
-------------------------------------
I stopped Jenkins builds until this is fixed/reverted, because the Windows
Jenkins machine needed to be killed hard because ChaosMonkeySafeLeaderTest was
eating all virtual CPUs available, making it impossible to shut down the
virtual machine or stop tests other than hitting the virtual PowerOff button :(
Under Linux, only kill -9 stops ChaosMonkey.
> SolrJ, by preventing multi-part post, loses key information about file name
> that Tika needs
> -------------------------------------------------------------------------------------------
>
> Key: SOLR-4358
> URL: https://issues.apache.org/jira/browse/SOLR-4358
> Project: Solr
> Issue Type: Bug
> Components: clients - java
> Affects Versions: 4.0
> Reporter: Karl Wright
> Assignee: Ryan McKinley
> Attachments: SOLR-4358.patch, SOLR-4358.patch
>
>
> SolrJ accepts a ContentStream, which has a name field. Within
> HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this
> filename is transmitted as part of the form boundary information. However,
> if SolrJ chooses not to use multipart post, the filename information is lost.
> This information is used by SolrCell (Tika) to make decisions about content
> extraction, so it is very important that it makes it into Solr in one way or
> another. Either SolrJ should set appropriate equivalent headers to send the
> filename automatically, or it should force multipart posts when this
> information is present.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]