[
https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13564225#comment-13564225
]
Jan Høydahl commented on SOLR-4358:
-----------------------------------
SolrCell will use resource.name request param as filename hint. The application
using SolrJ can set this. Not sure if SolrJ really should be closesly tied to
SolrCell params, SolrCell being a contrib module..
> SolrJ, by preventing multi-part post, loses key information about file name
> that Tika needs
> -------------------------------------------------------------------------------------------
>
> Key: SOLR-4358
> URL: https://issues.apache.org/jira/browse/SOLR-4358
> Project: Solr
> Issue Type: Bug
> Components: clients - java
> Affects Versions: 4.0
> Reporter: Karl Wright
>
> SolrJ accepts a ContentStream, which has a name field. Within
> HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this
> filename is transmitted as part of the form boundary information. However,
> if SolrJ chooses not to use multipart post, the filename information is lost.
> This information is used by SolrCell (Tika) to make decisions about content
> extraction, so it is very important that it makes it into Solr in one way or
> another. Either SolrJ should set appropriate equivalent headers to send the
> filename automatically, or it should force multipart posts when this
> information is present.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]