Re: NullPointers in type conversions

2015-09-23 Thread Christopher Matta
Just bumping this, I’m sure other users are going to be running across this eventually and I wanted to see if what I’m experiencing is expected (even after setting ALTER SESSION SET `drill.exec.functions.cast_empty_string_to_null` = true;), or if it’s a bug. ​ Chris Matta cma...@mapr.com

Re: help with ApacheDrill and S3

2015-09-23 Thread Andries Engelbrecht
Scott, FYI: You may not want to publish your AWS key on a public list ;-) Anyway I grabbed you core-site.xml file and put it on my Mac. Started Drill $ ./drill-embedded Then created a SP (s3scott) that looked like this: { "type": "file", "enabled": true, "connection":

Re: ChannelclosedException

2015-09-23 Thread Abhishek Girish
For the said error ID, are there more messages in the log, which could give out more information? Could you also share the query & profile (full JSON) for the query? May be you put it up on gist and share a link. Someone here could take a look. Try and increase the value for

Re: repeated_contains - intended behaviour?

2015-09-23 Thread Jason Altekruse
I think it is reasonable to consider that a bug. We should implement the function both as it works today and as you were originally expecting it. Any ideas about about a good naming scheme for the two? Unfortunately the regular contains() method does substring matching, but I think the name

Drill custom aggregate functions

2015-09-23 Thread Neeraja Rentachintala
https://drill.apache.org/docs/developing-an-aggregate-function/ See the customer aggregate functions are marked as alpha and experimental usage only. What features or aspects are missing to make this a 'ready to deploy in production' capability. Appreciate response. thanks -Neeraja

Persistent Configuration Storage

2015-09-23 Thread Nathaniel Auvil
There are two options for configuring the Zookeeper pStore: HDFS and HBase. We had been using HBase as it allows us to set a TTL on the table so we can have some simple control over the size and age of historical data. Is this the only benefit to using HBase vs HDFS for this data?

Re: ChannelclosedException

2015-09-23 Thread Boris Chmiel
Hi Abhishek, It is probably due to OOM. Here is the log : [Error Id: b58b4918-3ed7-465b-8a16-1bdff8e0fc7f ] org.apache.drill.common.exceptions.UserException: SYSTEM ERROR:ChannelClosedException [Error Id: b58b4918-3ed7-465b-8a16-1bdff8e0fc7f ]    

Re: directory pruning and UDFs

2015-09-23 Thread Jacques Nadeau
Hey Stefan, Yes, this makes a lot of sense and seems reasonable. We've talked about providing the simple filename as a virtual attribute. It seems like we should also provide a full path attribute (from the root of the workspace). Can you open a JIRA for this? It isn't something that is supported

Re: Drill custom aggregate functions

2015-09-23 Thread Jacques Nadeau
Merging is the main missing thing. Drill supports building custom aggregate functions. However, those are currently run in a single thread per grouping. Generally, it is much better to do a two-phased aggregate for custom functions, however the interface doesn't yet support that functionality. --

Spotfire server doesn't play nice with Drill 1.1

2015-09-23 Thread Mike Beddo
Greetings, I downloaded apache-drill-1.1.0.tar.gz from drill.apache.org. When I run drill-embedded, the line just before the message of the day reads "apache drill 1.0.0", so I'm a little confused by the seeming mismatch. I'm running zookeeper and drillbit on a single machine (Pangolin)

Re: NullPointers in type conversions

2015-09-23 Thread Jason Altekruse
Sudheesh, What you said is true for all of the options besides this one. When it was added it was decided that users of Drill would likely want the results of their queries to be consistent for all users, so this option only takes effect at the system level. Unfortunately there is not an error

dir0 shows no result

2015-09-23 Thread Geercken, Uwe
I have used the queries below to create parquet files from 2 CSV files: create table dfs.datatransfer.`ct_fremde/2015/07` as select to_timestamp(columns[0],'dd.MM.') as Datum, columns[1] as Airline_In, columns[2] as Trip_In, columns[3] as Ac_Typ, columns[4] as Ordertype,

Re: NullPointers in type conversions

2015-09-23 Thread Jason Altekruse
I actually noticed while I was testing this out that it is not working with casts to date, I will be opening a JIRA for it. That being said, I did find a unit test for integer, so this should be working. It is possible that the data in the column is actually a series of spaces instead of the empty

Re: NullPointers in type conversions

2015-09-23 Thread Christopher Matta
This system setting was set, yet I still experience the NumberFormatException with MapR-DB when a column contains a NULL value: 0: jdbc:drill:> select cast(x.`row_key` as varchar(128)) as `row_key`, CAST(x.`a`.`c1` as varchar(64)) from maprfs.cmatta.`cmatta_test` . . . . . . . > x;

developer for help

2015-09-23 Thread peng891...@163.com
Hi: I am a developer, and i'm interested in calcite-core-1.1.0-drill-r14.jar of drill-1.1.0. But i don't know how to get the source code of this jar, can you give me some help to get the source code? Best wishes!

Re: ChannelclosedException

2015-09-23 Thread Carrot Hu -CIC
Hi everyone, I have seen the same exception. It happens when running concurrent queries on the same table with a relatively large LIMIT size (5,000,000 ~ 10,000,000), using JDBC. The exception was found in both sqline session and jdbc, and on different storage plugins (local

Re: Spotfire server doesn't play nice with Drill 1.1

2015-09-23 Thread Andries Engelbrecht
Mike, Which machine does TSS run on and which machine does Squirrel run on? Perhaps describe your environment a bit more, i.e. what packages are running on what machines. —Andries > On Sep 23, 2015, at 11:47 AM, Mike Beddo wrote: > > Greetings, > > I

Drill WITH clause syntax

2015-09-23 Thread Neeraja Rentachintala
Team Is this a valid Drill syntax (i.e 2 tables in with clause). WITH X1 AS (SELECT city, AVG(review_count) AS city_reviews_avg FROM `business.json` GROUP BY city), X2 AS (SELECT X1.city, X1.city_reviews_avg, MAX(X1.city_reviews_avg) OVER () AS city_reviews_avg_max FROM X1) SELECT X2.city,

Re: Drill custom aggregate functions

2015-09-23 Thread Ted Dunning
The other big missing bit is that the working variables can't be complex data. That means that I can't write: - my own form of count unique - approximate aggregates like hyper-log-log, tdigest - anything that constructs complex output like list_aggregate This is just as bad as the lack of

Re: Drill WITH clause syntax

2015-09-23 Thread Jinfeng Ni
Is it something related to the json file you used? I tried the following query, replacing "business.json' with cp.`employee.json`. It works fine. (no row returned, which is expected). 0: jdbc:drill:zk=local> WITH X1 . . . . . . . . . . . > AS . . . . . . . . . . . > (SELECT city, . . . . . . . .

Re: Drill WITH clause syntax

2015-09-23 Thread Abhishek Girish
Works for me. WITH X1 AS (SELECT city, AVG(review_count) AS city_reviews_avg FROM `yelp_academic_dataset_business.json` GROUP BY city), X2 AS (SELECT X1.city, X1.city_reviews_avg, MAX(X1.city_reviews_avg) OVER () AS city_reviews_avg_max FROM X1) SELECT X2.city, X2.city_reviews_avg FROM X2 WHERE

Re: Drill WITH clause syntax

2015-09-23 Thread Daniel Barclay
I am hitting this error. *Error: SYSTEM ERROR: NoSuchFieldError: constants* That looks like a low-level Java error caused by a version mismatch, as if one class is referring to a member field named "constants" in another class, which had the field when the first class was compiled, but field