[ 
https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13636290#comment-13636290
 ] 

Karl Wright commented on SOLR-4358:
-----------------------------------

Yup, test fails even with patch removed.  So it's not the patch.

                
> SolrJ, by preventing multi-part post, loses key information about file name 
> that Tika needs
> -------------------------------------------------------------------------------------------
>
>                 Key: SOLR-4358
>                 URL: https://issues.apache.org/jira/browse/SOLR-4358
>             Project: Solr
>          Issue Type: Bug
>          Components: clients - java
>    Affects Versions: 4.0
>            Reporter: Karl Wright
>            Assignee: Ryan McKinley
>         Attachments: additional_changes.diff, SOLR-4358.patch, 
> SOLR-4358.patch, SOLR-4358.patch
>
>
> SolrJ accepts a ContentStream, which has a name field.  Within 
> HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this 
> filename is transmitted as part of the form boundary information.  However, 
> if SolrJ chooses not to use multipart post, the filename information is lost.
> This information is used by SolrCell (Tika) to make decisions about content 
> extraction, so it is very important that it makes it into Solr in one way or 
> another.  Either SolrJ should set appropriate equivalent headers to send the 
> filename automatically, or it should force multipart posts when this 
> information is present.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to