[
https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13636572#comment-13636572
]
Ryan McKinley commented on SOLR-4358:
-------------------------------------
this just passed on my local trunk checkout... I will commit and make sure
jenkins is happy with trunk.
If it is, I will merge to 4.x
> SolrJ, by preventing multi-part post, loses key information about file name
> that Tika needs
> -------------------------------------------------------------------------------------------
>
> Key: SOLR-4358
> URL: https://issues.apache.org/jira/browse/SOLR-4358
> Project: Solr
> Issue Type: Bug
> Components: clients - java
> Affects Versions: 4.0
> Reporter: Karl Wright
> Assignee: Ryan McKinley
> Attachments: additional_changes.diff, SOLR-4358.patch,
> SOLR-4358.patch, SOLR-4358.patch
>
>
> SolrJ accepts a ContentStream, which has a name field. Within
> HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this
> filename is transmitted as part of the form boundary information. However,
> if SolrJ chooses not to use multipart post, the filename information is lost.
> This information is used by SolrCell (Tika) to make decisions about content
> extraction, so it is very important that it makes it into Solr in one way or
> another. Either SolrJ should set appropriate equivalent headers to send the
> filename automatically, or it should force multipart posts when this
> information is present.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]