[
https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13635745#comment-13635745
]
Karl Wright commented on SOLR-4358:
-----------------------------------
Why would SolrCloud be affected at all by an HttpSolrServer.java change?
> SolrJ, by preventing multi-part post, loses key information about file name
> that Tika needs
> -------------------------------------------------------------------------------------------
>
> Key: SOLR-4358
> URL: https://issues.apache.org/jira/browse/SOLR-4358
> Project: Solr
> Issue Type: Bug
> Components: clients - java
> Affects Versions: 4.0
> Reporter: Karl Wright
> Assignee: Ryan McKinley
> Attachments: additional_changes.diff, SOLR-4358.patch,
> SOLR-4358.patch, SOLR-4358.patch
>
>
> SolrJ accepts a ContentStream, which has a name field. Within
> HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this
> filename is transmitted as part of the form boundary information. However,
> if SolrJ chooses not to use multipart post, the filename information is lost.
> This information is used by SolrCell (Tika) to make decisions about content
> extraction, so it is very important that it makes it into Solr in one way or
> another. Either SolrJ should set appropriate equivalent headers to send the
> filename automatically, or it should force multipart posts when this
> information is present.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]