Matt Keranen created DRILL-6873:
-----------------------------------

             Summary: Cluster without dfs throws DATA_READ ERROR file does not 
exist
                 Key: DRILL-6873
                 URL: https://issues.apache.org/jira/browse/DRILL-6873
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - JSON
    Affects Versions: 1.14.0
         Environment: Drill v1.14.0

Zookeeper 3.4.13

Centos 7.5

 
            Reporter: Matt Keranen


Running drillbits on multiple servers with Zookeeper but without HDFS. When 
file storage is configured to a common path, but not all filenames are present 
on all nodes, errors are thrown:

    Error: DATA_READ ERROR: Failure reading JSON file - File 
file:/localdata/logs/fileX.json.gz does not exist

Example use case: Querying log files on multiple machines as a ZK cluster from 
their local filesystems without moving them to a distributed file system which 
may not be in use.

Is there a (planned) configuration option to simply skip filenames that exist 
on some but not all nodes?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to