Hi,
  I¹ve had pain in the same ares.

These tickets are relevant to querying json with other extensions:
https://issues.apache.org/jira/browse/DRILL-1871

https://issues.apache.org/jira/browse/DRILL-1545


There have also been other fixes around compressed JSON failures, e.g.:
https://issues.apache.org/jira/browse/DRILL-1960


All but DRILL-1545 are fixed on the master branch.

Best Regards,
Paul



On 3/2/15, 1:59 PM, "Daniil Osipov" <[email protected]> wrote:

>Thanks Sudhakar, I'll give this a try. Can you point me to some
>documentation about extension/type handling? The actual files I'm trying
>to
>query are compressed JSON, and have an extension .gz
>
>
>On Mon, Mar 2, 2015 at 10:49 AM, Sudhakar Thota <[email protected]>
>wrote:
>
>> Daniil,
>>
>> Please try doing these 2 things and check one more time again.
>>
>> 1. Name that file to clean.json.
>> 2. Issue this statement before you run your query.
>>
>> alter system set `store.json.all_text_mode` = true
>>
>>
>> Thanks
>> Sudhakar Thota
>> [email protected]
>> www.mapr.com
>> Now Available - Free Hadoop On-Demand Training
>>
>> On Mar 2, 2015, at 10:03 AM, Daniil Osipov <[email protected]>
>> wrote:
>>
>> > I'm continuing exploration of accessing files on S3, and running into
>> this
>> > issue:
>> > 0: jdbc:drill:> *SELECT COUNT(1) FROM
>> > s3n.root.`/dev/dan/cleaned_1210/clean.txt`;*
>> > *Query failed: Query failed: Failure validating SQL.
>> > org.eigenbase.util.EigenbaseContextException: From line 1, column 22
>>to
>> > line 1, column 24: Table 's3n.root./dev/dan/cleaned_1210/clean.txt'
>>not
>> > found*
>> >
>> > *Error: exception while executing query: Failure while executing
>>query.
>> > (state=,code=0)*
>> >
>> > At the same time:
>> > 0: jdbc:drill:> *show files from
>>s3n.`dev/dan/cleaned_1210/clean.txt`;*
>> >
>>
>>*+------------+-------------+------------+------------+------------+-----
>>-------+-------------+------------+------------------+*
>> > *|    name    | isDirectory |   isFile   |   length   |   owner    |
>> > group    | permissions | accessTime | modificationTime |*
>> >
>>
>>*+------------+-------------+------------+------------+------------+-----
>>-------+-------------+------------+------------------+*
>> > *| clean.txt  | false       | true       | 1313500    |            |
>> >     | rw-rw-rw-   | 1970-01-01 00:00:00.0 | 2014-12-10 23:51:59.0 |*
>> >
>>
>>*+------------+-------------+------------+------------+------------+-----
>>-------+-------------+------------+------------------+*
>> > *1 row selected (0.53 seconds)*
>> >
>> > My storage config is below. Any suggestions on what could be wrong, or
>> how
>> > to debug this error?
>> >
>> > {
>> >  "type": "file",
>> >  "enabled": true,
>> >  "connection": "s3n://bucket-name",
>> >  "workspaces": {
>> >    "root": {
>> >      "location": "/",
>> >      "writable": false,
>> >      "defaultInputFormat": null
>> >    },
>> >    "tmp": {
>> >      "location": "file:///tmp",
>> >      "writable": true,
>> >      "defaultInputFormat": null
>> >    }
>> >  },
>> >  "formats": {
>> >    "psv": {
>> >      "type": "text",
>> >      "extensions": [
>> >        "tbl"
>> >      ],
>> >      "delimiter": "|"
>> >    },
>> >    "csv": {
>> >      "type": "text",
>> >      "extensions": [
>> >        "csv"
>> >      ],
>> >      "delimiter": ","
>> >    },
>> >    "tsv": {
>> >      "type": "text",
>> >      "extensions": [
>> >        "tsv"
>> >      ],
>> >      "delimiter": "\t"
>> >    },
>> >    "parquet": {
>> >      "type": "parquet"
>> >    },
>> >    "json": {
>> >      "type": "json"
>> >    }
>> >  }
>> > }
>>
>>

This email and any attachments may contain confidential and proprietary 
information of Blackboard that is for the sole use of the intended recipient. 
If you are not the intended recipient, disclosure, copying, re-distribution or 
other use of any of this information is strictly prohibited. Please immediately 
notify the sender and delete this transmission if you received this email in 
error.

Reply via email to