[ 
https://issues.apache.org/jira/browse/CONNECTORS-681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

konrad updated CONNECTORS-681:
------------------------------

    Description: 
When crawling a JCIFS repository to ElasticSearch, 
SharedDriveConnector.java#checkIngest() always returns false. The result of 
that is, that nothing will be indexed in ElasticSearch.

This seems to happen, since the localFile's extension is 'tmp', rather than the 
original extension. The line
{noformat}
//SharedDriveConnector.java[1842]
isIndexable = activities.checkDocumentIndexable(localFile); 
{noformat}
always evaluates 'false'. Adding 'tmp' to the list of the ElasticSearch 
Connectors 'allowed file extensions' changes that.


  was:
When crawling a JCIFS repository, SharedDriveConnector.java#checkIngest() 
always returns false. The result of that is, that nothing will be indexed in 
ElasticSearch.

This seems to happen, since the localFile's extension is 'tmp', rather than the 
original extension. The line
{noformat}
//SharedDriveConnector.java[1842]
isIndexable = activities.checkDocumentIndexable(localFile); 
{noformat}
always evaluates 'false'. Adding 'tmp' to the list of the ElasticSearch 
Connectors 'allowed file extensions' changes that.


    
> Crawling JCIFS to ElasticSearch does not index due to 'tmp' file extensions
> ---------------------------------------------------------------------------
>
>                 Key: CONNECTORS-681
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-681
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Elastic Search connector, JCIFS connector
>    Affects Versions: ManifoldCF 1.2
>            Reporter: konrad
>
> When crawling a JCIFS repository to ElasticSearch, 
> SharedDriveConnector.java#checkIngest() always returns false. The result of 
> that is, that nothing will be indexed in ElasticSearch.
> This seems to happen, since the localFile's extension is 'tmp', rather than 
> the original extension. The line
> {noformat}
> //SharedDriveConnector.java[1842]
> isIndexable = activities.checkDocumentIndexable(localFile); 
> {noformat}
> always evaluates 'false'. Adding 'tmp' to the list of the ElasticSearch 
> Connectors 'allowed file extensions' changes that.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to