Hangout Topics For 05/31/16

2016-05-27 Thread Sudheesh Katkam
Hey y’all, Let’s use this thread to pre-populate a list of topics to discuss on Tuesday’s hangout (05/31/16), so people can attend if they are interested in the mentioned topics. I will also collect topics at the beginning of the hangout. + DRILL-4280: Kerberos Authentication (Sudheesh) Thank

Re: Hash Aggregate Memory usage

2016-05-27 Thread Jacques Nadeau
There was a presentation a year or so ago I presented at the MapR sales kickoff that covers the memory characteristics of operators. Unfortunately, I don't have access to the content but hopefully someone internal to MapR should have it. (Maybe Ellen or Neeraja) Approximately (from memory):

Integration with Spark

2016-05-27 Thread Zhenrui(Jerry) Zhang
Hi, Does anyone has any updates on the integration with Spark. The feature mentioned in https://drill.apache.org/blog/2014/12/16/whats-coming-in-2015/ and http://www.slideshare.net/SparkSummit/adding-complex-data-to-spark-stackneeraja-rentachintala ? Also there is an issue opened in JIRA(

Re: Hash Aggregate Memory usage

2016-05-27 Thread Aman Sinha
Rahul, can you send me the query profile separately ? Also, can you try group-by on fixed-width columns instead of Varchar ? With single group, the hash table itself should be consuming relatively small amount of memory. On Fri, May 27, 2016 at 11:14 AM, Zelaine Fong wrote:

Re: Hash Aggregate Memory usage

2016-05-27 Thread Zelaine Fong
My guess would be that for hashing, a hash table is pre-allocated based on the number of keys in the hash. That would explain why with more keys, the memory usage grows. But that's just my guess. Someone who really understands how this works should chime in :). -- Zelaine On Fri, May 27, 2016

Re: Hash Aggregate Memory usage

2016-05-27 Thread rahul challapalli
Any inputs on this one? On Wed, May 25, 2016 at 7:51 PM, rahul challapalli < challapallira...@gmail.com> wrote: > Its using hash aggregation. > On May 25, 2016 7:48 PM, "Zelaine Fong" wrote: > >> What does the explain plan show? I.e., is the group by being done via a >>

[GitHub] drill pull request: DRILL-4618: Fix hive function loader not corre...

2016-05-27 Thread StevenMPhillips
Github user StevenMPhillips commented on a diff in the pull request: https://github.com/apache/drill/pull/509#discussion_r64940411 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/expr/EvaluationVisitor.java --- @@ -811,7 +811,7 @@ public HoldingContainer

[jira] [Created] (DRILL-4698) Cant read parquet partitioning from Azure Blob Storage

2016-05-27 Thread Roberto Grandi (JIRA)
Roberto Grandi created DRILL-4698: - Summary: Cant read parquet partitioning from Azure Blob Storage Key: DRILL-4698 URL: https://issues.apache.org/jira/browse/DRILL-4698 Project: Apache Drill

[GitHub] drill pull request: DRILL-4690: CORS in REST API

2016-05-27 Thread PythonicNinja
Github user PythonicNinja commented on the pull request: https://github.com/apache/drill/pull/507#issuecomment-222073117 I have updated PR according to your and laurentgo ideas. @hnfgns: Can you check second round of review? --- If your project is set up for it, you can reply to