[ 
https://issues.apache.org/jira/browse/CONNECTORS-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564508#comment-14564508
 ] 

Karl Wright commented on CONNECTORS-1204:
-----------------------------------------

Hi vigi,

The stream_size tika parameter may work but it is not necessarily the original 
binary document size.

I don't know where you would find the tika documentation.  The tika site should 
have some I presume.

At any rate, I'll merge the new branch back in and resolve the ticket.


> Import original document file size into Solr
> --------------------------------------------
>
>                 Key: CONNECTORS-1204
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1204
>             Project: ManifoldCF
>          Issue Type: New Feature
>          Components: Framework agents process, JCIFS connector, Lucene/SOLR 
> connector
>    Affects Versions: ManifoldCF 2.0.2
>            Reporter: vigi
>            Assignee: Karl Wright
>            Priority: Minor
>              Labels: manifoldcf, outputconnector, solr
>             Fix For: ManifoldCF 1.10, ManifoldCF 2.2
>
>
> When using the Solr output connection, I would like to be able to store the 
> original file size (in bytes) of the indexed documents into Solr so that it 
> could be displayed in the search results or it could even be used for 
> searching later on.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to