[
https://issues.apache.org/jira/browse/CONNECTORS-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564508#comment-14564508
]
Karl Wright commented on CONNECTORS-1204:
-----------------------------------------
Hi vigi,
The stream_size tika parameter may work but it is not necessarily the original
binary document size.
I don't know where you would find the tika documentation. The tika site should
have some I presume.
At any rate, I'll merge the new branch back in and resolve the ticket.
> Import original document file size into Solr
> --------------------------------------------
>
> Key: CONNECTORS-1204
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1204
> Project: ManifoldCF
> Issue Type: New Feature
> Components: Framework agents process, JCIFS connector, Lucene/SOLR
> connector
> Affects Versions: ManifoldCF 2.0.2
> Reporter: vigi
> Assignee: Karl Wright
> Priority: Minor
> Labels: manifoldcf, outputconnector, solr
> Fix For: ManifoldCF 1.10, ManifoldCF 2.2
>
>
> When using the Solr output connection, I would like to be able to store the
> original file size (in bytes) of the indexed documents into Solr so that it
> could be displayed in the search results or it could even be used for
> searching later on.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)