Re: UnsupportedOperationException: Unsupported type LIST

2018-06-11 Thread Paul Rogers
Hi Divya, Drill will use a (non-repeated) List if it sees a 2D or higher array. Looks like the "Coordinates" field is a 2D array, so Drill will use a List. That the List type is unsupported means that, somewhere else in Drill, there is an operator that does not support the List type. That is

Re: explain - Polygon shapes and MultiPolygon shapes

2018-06-11 Thread Paul Rogers
Hi Divya, You mentioned combining Polygon shapes with Multipolygon shapes. JSON allows objects with distinct structures: a Polygon, say, in one record, a MultiPolygon in another. However, Drill has to convert the JSON to a flat SQL-like structure. Yes, Drill allows "Maps" (Structs). But all

Re: Hangout tomorrow (get your tickets now)

2018-06-11 Thread Timothy Farkas
Hi, I'd like to give the presentation for the resource management proposal. Thanks, Tim From: Parth Chandra Sent: Monday, June 11, 2018 5:02:51 PM To: dev; user@drill.apache.org Subject: Hangout tomorrow (get your tickets now) We'll have the Drill hangout

Hangout tomorrow (get your tickets now)

2018-06-11 Thread Parth Chandra
We'll have the Drill hangout tomorrow Jun12th, 2018 at 10:00 PDT. If you have any topics to discuss, send a reply to this post or just join the hangout. ( Drill hangout link ) Thanks Parth

Re: UnsupportedOperationException: Unsupported type LIST

2018-06-11 Thread Kunal Khatua
I'm wondering if this JIRA is related the issue you are facing: https://issues.apache.org/jira/browse/DRILL-2241 On 6/11/2018 9:44:32 AM, Kunal Khatua wrote: Can you share the stack trace as well? On 6/11/2018 3:12:07 AM, Divya Gehlot wrote: Hi , I am trying to convert the complex json format

Re: UnsupportedOperationException: Unsupported type LIST

2018-06-11 Thread Kunal Khatua
Can you share the stack trace as well? On 6/11/2018 3:12:07 AM, Divya Gehlot wrote: Hi , I am trying to convert the complex json format to Parquet and I am getting below erro : SYSTEM ERROR: UnsupportedOperationException: Unsupported type LIST Fragment 0:0 JSON File format as below : {

RE: Which perform better JSON or convert JSON to parquet format ?

2018-06-11 Thread Lee, David
Parquet is faster especially if you are only looking for a subset of json objects. Every JSON key / array is treated as a column. With that said creating parquet from JSON is not bullet proof if you have really complex json which may have NULL values or many optional keys (Drill can't figure

Re: Which perform better JSON or convert JSON to parquet format ?

2018-06-11 Thread Ted Dunning
Yes. Drill is good at JSON. But Parquet will be faster during a scan. Faster may be better. Or other things may be more important. You have to decide what is important to you. The great virtue of drill is that you have the choice. On Mon, Jun 11, 2018 at 11:06 AM Divya Gehlot wrote: >

UnsupportedOperationException: Unsupported type LIST

2018-06-11 Thread Divya Gehlot
Hi , I am trying to convert the complex json format to Parquet and I am getting below erro : SYSTEM ERROR: UnsupportedOperationException: Unsupported type LIST Fragment 0:0 JSON File format as below : { "results":[ { "_id":"5b1a3374fb4365076b4d48a8",

Re: Which perform better JSON or convert JSON to parquet format ?

2018-06-11 Thread Divya Gehlot
Thanks to all for your opinions ! As Drill has been popularised as complex JSON reader as compare to other tools in space . Was wondering does drill works better for JSON rather than parquet.

Re: Which perform better JSON or convert JSON to parquet format ?

2018-06-11 Thread Ted Dunning
I am going to play the contrarian here. Parquet is not *always* faster than JSON. The (almost unique) case where it is better to leave data as JSON (or whatever) is when the average number of times that a file is read is equal to or less than roughly 1. The point is that to convert read the

explain - Polygon shapes and MultiPolygon shapes

2018-06-11 Thread Divya Gehlot
Hi, Can anybody help me by explaining below : Drill cannot handle the intermixed Polygon shapes and MultiPolygon shapes. Workaround: None, per se, but if you avoid querying the multi-polygon lines (120 of them), Drill works fine on the entire remainder Thanks, Divya