Re: [ANNOUNCE] Apache Drill 1.17.0 Released

2019-12-30 Thread Aman Sinha
Congratulations on a great release ! Seems quite feature rich. Also thanks to Volodymyr for shepherding the release even during the holiday season. On Fri, Dec 27, 2019 at 2:34 AM Arina Yelchiyeva wrote: > Congrats everyone, great job! > > Kind regards, > Arina > > > On 26 Dec 2019, at 20:32,

Re: Drill Resources Information

2019-10-14 Thread Aman Sinha
Hi Charles, Resource provisioning is a broad area and workload specific but perhaps the following presentations and doc links might help: [1] https://www.slideshare.net/MapRTechnologies/putting-apache-drill-into-production [2] MapR specific but the concepts should be generally applicable :

[jira] [Created] (DRILL-7391) Wrong result when doing left outer join on CSV table

2019-09-30 Thread Aman Sinha (Jira)
Aman Sinha created DRILL-7391: - Summary: Wrong result when doing left outer join on CSV table Key: DRILL-7391 URL: https://issues.apache.org/jira/browse/DRILL-7391 Project: Apache Drill Issue

Re: Anybody at Apachecon Vegas

2019-09-11 Thread Aman Sinha
Yes, I am here today. See you guys soon. -Aman On Tue, Sep 10, 2019 at 9:59 PM Ted Dunning wrote: > I am here. So is Ellen. I think Aman as well. Come to the Drill track > tomorrow. Both Ellen and I have talks. > > > > On Tue, Sep 10, 2019 at 2:16 PM Naresh Bhat > wrote: > > > Hi Guys, > >

Re: [ANNOUNCE] New PMC Chair of Apache Drill

2019-08-23 Thread Aman Sinha
Congratulations Charles ! And thank you Arina ! -Aman On Thu, Aug 22, 2019 at 9:11 PM Divya Gehlot wrote: > Congratulations Charles ! > Looking forward for much better Drill and more addition in your book as > well :) > > Thanks , > Divya > > On Fri, 23 Aug 2019 at 12:07 PM, Bhargava

Re: August Apache Drill board report

2019-08-08 Thread Aman Sinha
Thanks for putting this together Arina. One minor comment is that for a future release do we need to mention the feature set ? Typically we would enumerate those in the next board report after the release has happened. Aman On Thu, Aug 8, 2019 at 10:00 AM Sorabh Hamirwasia wrote: > Hi Arina,

Re: Apache Drill Hangout July 23rd

2019-07-23 Thread Aman Sinha
arm and > overslept. Now I can't join the meeting, maybe it has finished. I will > issue the ParallelHashJoin PR recently. > > On Tue, Jul 23, 2019 at 10:14 AM Aman Sinha wrote: > > > Hi Drillers, > > > > We will have our bi-weekly hangout tomorrow, July 23rd, at 10 AM PST &g

Re: [ANNOUNCE] New Committer: Bohdan Kazydub

2019-07-23 Thread Aman Sinha
Congratulations Bohdan and thanks much for your contributions ! On Tue, Jul 23, 2019 at 3:13 AM Igor Guzenko wrote: > Congratulations, Bohdan! Great job !!! > > On Tue, Jul 23, 2019 at 12:33 PM Volodymyr Vysotskyi > > wrote: > > > Congratulations, Bohdan! Thanks for your contributions! > > > >

Re: [ANNOUNCE] New Committer: Igor Guzenko

2019-07-23 Thread Aman Sinha
Congratulations Igor and thanks for your contributions to Drill ! On Tue, Jul 23, 2019 at 3:33 AM Anton Gozhiy wrote: > Congratulations Igor, well deserved! > > On Tue, Jul 23, 2019, 12:31 Volodymyr Vysotskyi > wrote: > > > Congratulations, Ihor! Thanks for your contributions! > > > > Kind

Apache Drill Hangout July 23rd

2019-07-22 Thread Aman Sinha
Hi Drillers, We will have our bi-weekly hangout tomorrow, July 23rd, at 10 AM PST (link: https://meet.google.com/yki-iqdf-tai ). If there are any topics you would like to discuss during the hangout please respond to this email. I believe last time Weijie mentioned he could talk about the hash

Re: testing maprdb pluging

2019-05-30 Thread Aman Sinha
Regarding the unit tests, for some reason maven requires full package name for the plugin test suite. You can do this: (assuming you are running on Linux) 1. do the clean build 2. cd contrib/format-maprdb 3. mvn test -Dtest=com.mapr.drill.maprdb.tests.MaprDBTestsSuite -Pmapr

Re: adding insert

2019-05-28 Thread Aman Sinha
would want to range-partition the rows based on the tablet rowid ranges such that rows belonging to the same tablet are somewhat 'grouped together' and 2 minor fragments in Drill don't try to write to the same tablet. Aman On Tue, May 28, 2019 at 12:50 PM Aman Sinha wrote: > Yes, Calc

Re: adding insert

2019-05-28 Thread Aman Sinha
Yes, Calcite already supports the INSERT/UPSERT syntax. Within Drill, you would need to 'unblock' this syntax (not all of it but whatever variation we may want to support). You can take a look at DrillParserImpl.java (SqlInsert() method) which is actually a generated file from JavaCC. We would

Re: Questions about bushy join

2019-05-27 Thread Aman Sinha
Hi Weijie, As you might imagine Busy joins have pros and cons compared to Left-deep only plans: The main pro is that they enumerate a lot more plan choices such that the planner is likely to find the optimal join order. On the other hand, there are significant cons: (a) by enumerating more join

Re: encouraging and cultivating new committers

2019-05-10 Thread Aman Sinha
Beam has a good set of guidelines and certainly some thought has gone into articulating these to the community. I generally agree with these and in particular the use of the word 'earnestly' in the following: - *They earnestly try to make Beam better with their contributions* - *In

[ANNOUNCE] New Committer: Jyothsna Donapati

2019-05-09 Thread Aman Sinha
The Project Management Committee (PMC) for Apache Drill has invited Jyothsna Donapati to become a committer, and we are pleased to announce that she has accepted. Jyothsna has been contributing to Drill for about 1 1/2 years. She initially contributed the graceful shutdown capability and more

[jira] [Created] (DRILL-7242) Query with range predicate hits IOBE when accessing histogram buckets

2019-05-06 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7242: - Summary: Query with range predicate hits IOBE when accessing histogram buckets Key: DRILL-7242 URL: https://issues.apache.org/jira/browse/DRILL-7242 Project: Apache Drill

Re: May Apache Drill board report

2019-05-03 Thread Aman Sinha
+1 On Fri, May 3, 2019 at 1:40 PM Volodymyr Vysotskyi wrote: > Looks good, +1 > > > Пт, 3 трав. 2019 23:32 користувач Arina Ielchiieva > пише: > > > Hi all, > > > > please take a look at the draft board report for the last quarter and let > > me know if you have any comments. > > > > Thanks, >

Re: [RESULT] [VOTE] Apache Drill Release 1.16.0 - RC2

2019-05-01 Thread Aman Sinha
Great ! Thanks for managing this release Sorabh ! On Wed, May 1, 2019 at 9:22 AM SorabhApache wrote: > Hi All, > RC2 candidate for 1.16.0 passes the voting criteria. Thanks to everyone who > has tested and voted for release candidate. The summary of voting is: > > Total Votes: 8 > 5x +1

[jira] [Created] (DRILL-7228) Histogram end points show high deviation for a sample data set

2019-04-30 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7228: - Summary: Histogram end points show high deviation for a sample data set Key: DRILL-7228 URL: https://issues.apache.org/jira/browse/DRILL-7228 Project: Apache Drill

Re: [VOTE] Apache Drill Release 1.16.0 - RC2

2019-04-29 Thread Aman Sinha
Downloaded binary tarball on my Mac and ran in embedded mode. Verified Sorabh's release signature and the tar file's checksum Did a quick glance through maven artifacts Did some manual tests with TPC-DS Web_Sales table and ran REFRESH METADATA command against the same table Checked runtime query

[jira] [Created] (DRILL-7223) Make the timeout in TimedCallable a configurable boot time parameter

2019-04-27 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7223: - Summary: Make the timeout in TimedCallable a configurable boot time parameter Key: DRILL-7223 URL: https://issues.apache.org/jira/browse/DRILL-7223 Project: Apache Drill

Re: [VOTE] Apache Drill Release 1.16.0 - RC1

2019-04-26 Thread Aman Sinha
t;> > >>>>> On Wed, Apr 24, 2019 at 9:52 AM SorabhApache wrote: > >>>>> > >>>>>> Hi Volodymyr/Anton, > >>>>>> I can verify that I am seeing both the below issues as reported by > >>> Anton > >>>>&

Re: [VOTE] Apache Drill Release 1.16.0 - RC1

2019-04-24 Thread Aman Sinha
. > > > >> > > Tested new features of metadata caching by creating v4 cache > files > > > >> using > > > >> > > new Refresh Metadata commands and manually verified the cache > > files. > > > >> > Tried > > >

Re: [VOTE] Apache Drill Release 1.16.0 - RC1

2019-04-23 Thread Aman Sinha
On Tue, Apr 23, 2019 at 9:18 AM Volodymyr Vysotskyi > wrote: > > > Discussed with Aman and concluded that this issue is not a blocker for > the > > release. > > > > Kind regards, > > Volodymyr Vysotskyi > > > > > > On Tue, Apr 23, 2019 at 6

[jira] [Created] (DRILL-7198) Issuing a control-C in Sqlline exits the session (it does cancel the query)

2019-04-23 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7198: - Summary: Issuing a control-C in Sqlline exits the session (it does cancel the query) Key: DRILL-7198 URL: https://issues.apache.org/jira/browse/DRILL-7198 Project: Apache

Re: [VOTE] Apache Drill Release 1.16.0 - RC1

2019-04-23 Thread Aman Sinha
Hi Vova, I added some thoughts in the DRILL-7195 JIRA. Aman On Tue, Apr 23, 2019 at 6:06 AM Volodymyr Vysotskyi wrote: > Hi all, > > I did some checks and found the following issues: > - DRILL-7195 > - DRILL-7194

[jira] [Created] (DRILL-7187) Improve selectivity estimates for range predicates when using histogram

2019-04-19 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7187: - Summary: Improve selectivity estimates for range predicates when using histogram Key: DRILL-7187 URL: https://issues.apache.org/jira/browse/DRILL-7187 Project: Apache

[jira] [Resolved] (DRILL-3929) Support the ability to query database tables using external indices

2019-04-19 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-3929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aman Sinha resolved DRILL-3929. --- Resolution: Fixed Fix Version/s: 1.15.0 This feature was done in the scope of DRILL-6381

Re: Drill Profile Management

2019-04-16 Thread Aman Sinha
This would be a great improvement (and long overdue). Thanks for working on it. I would be inclined to option #2 and perhaps add an option to drillbit startup that allows partitioning all existing profiles in a forced manner (default can be the 1000 profiles that you proposed). The option makes

Re: support Apache Drill

2019-04-14 Thread Aman Sinha
I am tagging @AnilKumar B who previously worked on the MongoDB plugin to see if he can answer your question. The issue is how 'SELECT *' is processed by the plugin. Also, in future, it would be better to name the subject of the email you post here more specifically... such as by including

Re: Query Question

2019-04-11 Thread Aman Sinha
> I thought flatten() would be the answer, however, if I flatten the columns, I get the following result: Regarding the flatten() output, this is expected because doing a 'SELECT flatten(a), flatten(b) FROM T' is equivalent to doing a cross-product of the 2 arrays. In your example, both arrays

Re: [DISCUSS]: Hadoop 3

2019-04-03 Thread Aman Sinha
Hi Vitali, so if the *commons-logging* is removed from the banned dependency, do we expect developers to use it or will we enforce through checkstyle to use the current logging library ? What are the pros/cons of using one vs the other ? Any idea why it was in the banned dependency earlier ?

[jira] [Resolved] (DRILL-7152) Histogram creation throws exception for all nulls column

2019-04-02 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-7152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aman Sinha resolved DRILL-7152. --- Resolution: Fixed Fixed in 54384a9. > Histogram creation throws exception for all nulls col

[jira] [Created] (DRILL-7152) Histogram creation throws exception for all nulls column

2019-04-02 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7152: - Summary: Histogram creation throws exception for all nulls column Key: DRILL-7152 URL: https://issues.apache.org/jira/browse/DRILL-7152 Project: Apache Drill

Re: Parquet Metadata Caching design doc

2019-04-01 Thread Aman Sinha
For DRILL-7064 [1] I needed to add a sub-section to the design doc. I have added Section 3.6 (* 'Enhancements to ConvertCountToDirectScan rule'*) to the document. Feedback is welcome. [1] https://issues.apache.org/jira/browse/DRILL-7064 [2]

Re: [DISCUSS] 1.16.0 release

2019-03-22 Thread Aman Sinha
Hi Sorabh, for the Parquet Metadata caching improvements, we are estimating 2 more weeks for the feature development. This does not include bug fixing if any blocker bugs are discovered during functional testing. Hope that helps with setting the release cut-off date. Aman On Tue, Mar 19, 2019

[jira] [Created] (DRILL-7119) Modify selectivity calculations to use histograms

2019-03-19 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7119: - Summary: Modify selectivity calculations to use histograms Key: DRILL-7119 URL: https://issues.apache.org/jira/browse/DRILL-7119 Project: Apache Drill Issue Type

[jira] [Created] (DRILL-7117) Support creation of histograms for numeric data types (except Decimal)

2019-03-19 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7117: - Summary: Support creation of histograms for numeric data types (except Decimal) Key: DRILL-7117 URL: https://issues.apache.org/jira/browse/DRILL-7117 Project: Apache Drill

Re: Roadmap for Drill 2.0 and beyond?

2019-03-19 Thread Aman Sinha
Please see the discussions under the thread for Drill Developer Day 2018 .. this was held in November. Presentations for various planned projects for 2.0 and beyond were also posted on google drive. For 2.0, the Resource Manager and Drill Metastore are actively being worked on. On Tue, Mar 19,

[jira] [Created] (DRILL-7114) ANALYZE command generates warnings for stats file and materialization

2019-03-18 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-7114: - Summary: ANALYZE command generates warnings for stats file and materialization Key: DRILL-7114 URL: https://issues.apache.org/jira/browse/DRILL-7114 Project: Apache Drill

Re: [DISCUSS] Whether to create separate 2.0 branch

2019-03-03 Thread Aman Sinha
h > > > > On Wed, Feb 27, 2019 at 4:23 PM Abhishek Girish > > wrote: > > > > > My opinion would be option (a) as well. It's easier to maintain a > single > > > master branch. With a separate v2 branch, it's twice the effort to test > > > common c

Re: [DISCUSS] Whether to create separate 2.0 branch

2019-02-27 Thread Aman Sinha
My personal preference would be option (a) as much as possible until we get to a situation where it is getting too unwieldy at which point we re-evaluate. Aman On Wed, Feb 27, 2019 at 12:35 PM Aman Sinha wrote: > Hi Drill devs, > There are couple of ongoing projects - Resource M

[DISCUSS] Whether to create separate 2.0 branch

2019-02-27 Thread Aman Sinha
Hi Drill devs, There are couple of ongoing projects - Resource Manager and the Drill Metastore - that are relatively large in scope. Intermediate PRs will be created for these (for example, there's one open for the metastore [1]. Another one for the RM [2]. These don't currently break existing

Re: Interesting result for JSON parsing

2019-02-21 Thread Aman Sinha
Almost 20x faster parsing speed than the next fastest parser. Would be good to explore for Drill (needs the transition from Java to C++ parsing). On Wed, Feb 20, 2019 at 10:14 PM Ted Dunning wrote: > This is interesting. > > https://twitter.com/kellabyte/status/1098447972809900037 >

Re: BI Tool Demo on Drill

2019-02-11 Thread Aman Sinha
+1 On Mon, Feb 11, 2019 at 12:08 PM Abhishek Ravi wrote: > +1 > > On Mon, Feb 11, 2019 at 11:54 AM Pritesh Maker wrote: > > > +1 > > > > On Mon, Feb 11, 2019 at 4:40 AM Charles Givre wrote: > > > > > +1 ;-) > > > > > > > > > > On Feb 11, 2019, at 02:02, Kunal Khatua wrote: > > > > > > > > Hi

Histogram design doc

2019-01-31 Thread Aman Sinha
Hi devs, I have updated DRILL-6992 (histogram support) [1] with a design proposal. Please take a look and provide feedback either in the JIRA or in the doc (make sure to use your email login such that the comments don't show as anonymous). [1] https://issues.apache.org/jira/browse/DRILL-6992

Re: January Apache Drill board report

2019-01-31 Thread Aman Sinha
Thanks for putting this together, Arina. The Drill Developer Day and Meetup were separate events, so you can split them up. - A half day Drill Developer Day was held on Nov 14. A variety of technical design issues were discussed. - A Drill user meetup was held on the same evening. 2

Re: "Crude-but-effective" Arrow integration

2019-01-29 Thread Aman Sinha
Hi Charles, You may have seen the talk that was given on the Drill Developer Day [1] by Karthik and me ... look for the slides on 'Drill-Arrow Integration' which describes 2 high level options and what the integration might entail. Option 1 corresponds to what you and Paul are discussing in this

[jira] [Created] (DRILL-6992) Support column histogram statistics

2019-01-22 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-6992: - Summary: Support column histogram statistics Key: DRILL-6992 URL: https://issues.apache.org/jira/browse/DRILL-6992 Project: Apache Drill Issue Type: New Feature

[jira] [Resolved] (DRILL-6897) TPCH 13 has regressed

2019-01-11 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-6897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aman Sinha resolved DRILL-6897. --- Resolution: Duplicate Duplicate of DRILL-6896. > TPCH 13 has regres

Re: [VOTE] Apache Drill release 1.15.0 - RC2

2018-12-27 Thread Aman Sinha
- Downloaded source from [3] onto my Linux VM, built and ran unit tests. I had to run some test suites individually but got a clean run. - Verified extraneous directory issue (DRILL-6916) is resolved - Built the source using MapR profile and ran the secondary indexing tests within mapr format

Re: [VOTE] Apache Drill release 1.15.0 - RC0

2018-12-20 Thread Aman Sinha
] https://github.com/apache/drill/pull/1579 > > Kind regards > Vitalii > > > On Wed, Dec 19, 2018 at 2:18 PM Vitalii Diravka > wrote: > > > @Aman Sinha I am investigating, which maven plugin > > causes creating this dir. > > > > I guess this sinks rc

Re: [VOTE] Apache Drill release 1.15.0 - RC0

2018-12-18 Thread Aman Sinha
@vita...@apache.org any idea why there's an extraneous directory in the source ? drwxrwxr-x vitalii/vitalii 0 2018-12-18 03:48 apache-drill-1.15.0-src/${project.*basedir*}/ drwxrwxr-x vitalii/vitalii 0 2018-12-18 03:48 apache-drill-1.15.0-src/${project.*basedir*}/src/ drwxrwxr-x

Re: [ANNOUNCE] New Committer: Salim Achouche

2018-12-17 Thread Aman Sinha
Congratulations Salim ! Thanks for your contributions ! Aman On Mon, Dec 17, 2018 at 3:20 AM Vitalii Diravka wrote: > Congratulations Salim! > Well deserved! > > Kind regards > Vitalii > > > On Mon, Dec 17, 2018 at 12:40 PM Arina Ielchiieva > wrote: > > > The Project Management Committee

Re: [ANNOUNCE] New Committer: Karthikeyan Manivannan

2018-12-07 Thread Aman Sinha
Congratulations Karthik ! Thanks for your contributions ! Aman On Fri, Dec 7, 2018 at 11:53 AM salim achouche wrote: > Congrats Karthik! > > On Fri, Dec 7, 2018 at 11:11 AM Arina Ielchiieva wrote: > > > The Project Management Committee (PMC) for Apache Drill has invited > > Karthikeyan > >

Re: Apache Drill Meetup on Nov 14th!

2018-11-20 Thread Aman Sinha
che Drill! The next meet up will be on Nov > 14th at 6:30 PM at the MapR Headquarters. > > We will have two speakers for the meetup > - Nitin Sharma @ Netflix who will talk about Netflix's Personalization > Infrastructure > - Aman Sinha @ MapR who will talk about a brand

Re: Hangout Discussion Topics

2018-11-12 Thread Aman Sinha
Since we are having the Drill Developer day on Wednesday, perhaps we can skip the hangout tomorrow ? Aman On Mon, Nov 12, 2018 at 10:13 AM Timothy Farkas wrote: > Hi All, > > Does anyone have any topics to discuss during the hangout tomorrow? > > Thanks, > Tim >

Re: Handling schema change in blocking operators

2018-11-06 Thread Aman Sinha
ode for more. Work out how > they could be resolved. You may see something I've missed, or you may > realize that the problem is just not solvable in general without an > up-front schema. > > More comments in the JIRA ticket. > > Thanks, > - Paul > > > > On Mon

Handling schema change in blocking operators

2018-11-05 Thread Aman Sinha
Hi all, While we continue to enhance the schema provision and metastore aspects in Drill, we also should explore what it means to be truly schema-less such that we can better handle {semi, un}structured data, data sitting in DBs that store JSON documents (e.g Mongo, MapR-DB). The blocking

[jira] [Created] (DRILL-6829) Handle schema change in ExternalSort

2018-11-05 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-6829: - Summary: Handle schema change in ExternalSort Key: DRILL-6829 URL: https://issues.apache.org/jira/browse/DRILL-6829 Project: Apache Drill Issue Type: New Feature

Re: [ANNOUNCE] New Committer: Hanumath Rao Maduri

2018-11-01 Thread Aman Sinha
Congratulations Hanumath ! Aman On Thu, Nov 1, 2018 at 11:39 AM Paul Rogers wrote: > Congratulations Hanu! > > - Paul > > Sent from my iPhone > > > On Nov 1, 2018, at 11:09 AM, Kunal Khatua wrote: > > > > Congratulations, Hanu! > > On 11/1/2018 11:04:58 AM, Abhishek Girish wrote: > >

Re: November Apache Drill board report

2018-11-01 Thread Aman Sinha
Docket container ==> 'Docker' November 14, 2019 ==> 2018 :) (this is wrong in email that was sent out) Rest LGTM. On Thu, Nov 1, 2018 at 6:42 AM Arina Ielchiieva wrote: > Hi all, > > please take a look at the draft board report for the last quarter and let > me know if you have any

Re: [ANNOUNCE] New Committer: Gautam Parai

2018-10-22 Thread Aman Sinha
Congratulations Gautam ! On Mon, Oct 22, 2018 at 3:00 PM Jyothsna Reddy wrote: > Congrats Gautam!! > > > > On Mon, Oct 22, 2018 at 2:01 PM Vitalii Diravka > wrote: > > > Congratulations! > > > > On Mon, Oct 22, 2018 at 10:54 PM Khurram Faraaz > wrote: > > > > > Congrats Gautam! > > > > > > On

Re: [HANGOUT] [new link] Topics for October 02 2018

2018-10-13 Thread Aman Sinha
Please use this link: https://www.slideshare.net/secret/zMZIrpM5qKV5pI I forgot the apache mailing lists block the attachments. Aman On Sat, Oct 13, 2018 at 5:20 PM Aman Sinha wrote: > On my gmail account it shows the attachment was sent. I am re-attaching > and sending. > > Ama

Re: [HANGOUT] [new link] Topics for October 02 2018

2018-10-13 Thread Aman Sinha
On my gmail account it shows the attachment was sent. I am re-attaching and sending. Aman On Sat, Oct 13, 2018 at 3:38 PM Chunhui Shi wrote: > Hi Aman, are you going to send out the slides in another email? > > Regards, > Chunhui >

Re: [HANGOUT] [new link] Topics for October 02 2018

2018-10-12 Thread Aman Sinha
Attached is a PDF version of the slides. Unfortunately, I don't have a recording. thanks, Aman On Thu, Oct 11, 2018 at 9:39 AM Pritesh Maker wrote: > Divya - anyone is welcome to join the hangout! Aman will be sharing the > slides shortly. We use Google Hangouts which doesn't have the

Re: [HANGOUT] [new link] Topics for October 02 2018

2018-09-30 Thread Aman Sinha
I can talk about the index planning and execution feature [1] that is currently in review [2]. [1[ https://issues.apache.org/jira/browse/DRILL-6381 [2] https://github.com/apache/drill/pull/1466 On Fri, Sep 28, 2018 at 2:13 PM Karthikeyan Manivannan wrote: > Hi, > > We will have a Drill Hangout

Re: [ANNOUNCE] New Committer: Chunhui Shi

2018-09-28 Thread Aman Sinha
Congratulations Chunhui ! On Fri, Sep 28, 2018 at 10:46 AM Karthikeyan Manivannan < kmanivan...@mapr.com> wrote: > Congrats Chunhui! > > On Fri, Sep 28, 2018 at 10:04 AM Hanumath Rao Maduri > wrote: > > > Congratulations Chunhui. > > > > On Fri, Sep 28, 2018 at 9:26 AM Padma Penumarthy < > >

Re: [ANNOUNCE] New PMC member: Volodymyr Vysotskyi

2018-08-28 Thread Aman Sinha
Congrats Vova ! On Tue, Aug 28, 2018 at 6:43 AM Vitalii Diravka wrote: > Congrats Vova! Well deserved! > > On Tue, Aug 28, 2018, 16:04 Volodymyr Vysotskyi > wrote: > > > Thank everyone for nice words, it's a great honor for me to become a > Drill > > PMC member! > > > > Kind regards, > >

Re: [VOTE] Apache Drill release 1.14.0 - RC3

2018-08-03 Thread Aman Sinha
- Downloaded the source tarball from [2] on my Linux VM, built and ran the unit tests. 2 tests in 'TestUtf8SupportInQueryString' had errors but passed when run independently. - Downloaded the binary tarball from [2] onto my Macbook, untarred and ran Drill in embedded mode - Ran a few queries

[jira] [Created] (DRILL-6651) Compilation error in Eclipse IDE due to missing package name

2018-07-31 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-6651: - Summary: Compilation error in Eclipse IDE due to missing package name Key: DRILL-6651 URL: https://issues.apache.org/jira/browse/DRILL-6651 Project: Apache Drill

Re: [DISCUSS] 1.14.0 release

2018-07-13 Thread Aman Sinha
I would say we have to take a measured approach to this and decide on a case-by-case which issue is a show stopper. While of course we have to make every effort to avoid regression, we cannot claim that a particular release will not cause any regression. I believe there are 1+ passing tests,

[jira] [Created] (DRILL-6588) System table columns incorrectly marked as non-nullable

2018-07-09 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-6588: - Summary: System table columns incorrectly marked as non-nullable Key: DRILL-6588 URL: https://issues.apache.org/jira/browse/DRILL-6588 Project: Apache Drill

Re: Actual vectorization execution

2018-06-29 Thread Aman Sinha
Hi Weijie, the Panama project is an OpenJDK initialitve, right [1] ? not Intel specific. It would be quite a bit of work to test and certify with Intel's JVM which may be still in the experimental stage. Also, you may have seen the Gandiva project for Apache Arrow which aims to improve

Re: [DISCUSSION] Travis build failures

2018-06-27 Thread Aman Sinha
Sounds good but why exclude the planning tests ? Only the TPC-H runtime tests should be excluded I think. On Wed, Jun 27, 2018 at 12:16 PM Timothy Farkas wrote: > +1 > > On Wed, Jun 27, 2018 at 10:00 AM, Vitalii Diravka < > vitalii.dira...@gmail.com > > wrote: > > > This is a topic from last

Re: Drill Hangout tomorrow 06/26

2018-06-26 Thread Aman Sinha
lanning, TestTpchExplain) to the > SlowTest category. > > Is there other solution for this issue? What are other tests are executed > very slowly? > > Kind regards > Vitalii > > > On Tue, Jun 26, 2018 at 3:34 AM Aman Sinha wrote: > > > We'll have the Drill hangout tomo

[ANNOUNCE] New PMC member: Vitalii Diravka

2018-06-26 Thread Aman Sinha
I am pleased to announce that Drill PMC invited Vitalii Diravka to the PMC and he has accepted the invitation. Congratulations Vitalii and thanks for your contributions ! -Aman (on behalf of Drill PMC)

Drill Hangout tomorrow 06/26

2018-06-25 Thread Aman Sinha
We'll have the Drill hangout tomorrow Jun26th, 2018 at 10:00 PDT. If you have any topics to discuss, send a reply to this post or just join the hangout. ( Drill hangout link )

Re: [jira] [Created] (DRILL-6514) Document pcap format plug-in options

2018-06-20 Thread Aman Sinha
On the master branch I am not seeing the 'extensions' class variable in the reference chain shown in the Jackson error message either. There is an extensions parameter in the constructor of the EasyFormatPlugin [1] which is the base class of PcapFormatPlugin but I don't think Jackson is

[ANNOUNCE] New Committer: Padma Penumarthy

2018-06-15 Thread Aman Sinha
The Project Management Committee (PMC) for Apache Drill has invited Padma Penumarthy to become a committer, and we are pleased to announce that she has accepted. Padma has been contributing to Drill for about 1 1/2 years. She has made improvements for work-unit assignment in the parallelizer,

Re: [DISCUSS] case insensitive storage plugin and workspaces names

2018-06-12 Thread Aman Sinha
plugins table names must be case > sensitive, since under table name we imply directory / file name and their > case sensitivity depends on file system. > > Kind regards, > Arina > > On Tue, Jun 12, 2018 at 6:13 PM Aman Sinha wrote: > > > Drill is dependent on

Re: [DISCUSS] case insensitive storage plugin and workspaces names

2018-06-12 Thread Aman Sinha
Drill is dependent on the underlying file system's case sensitivity. On HDFS one can create 'hadoop fs -mkdir /tmp/TPCH' and /tmp/tpch which are separate directories. These could be set as workspace in Drill's storage plugin configuration and we would want the ability to query both. If we

Re: [Vote] Cleaning Up Old PRs

2018-06-07 Thread Aman Sinha
I haven't looked at Tim's survey yet but just wanted to respond to Dave about his experience. I can assure you that Drill committers would welcome quality contributions from you or anyone else. In the case of your PR, the reason for it getting stuck was basically what you already mentioned:

Re: [Discuss] Cleanup Old PRs

2018-05-31 Thread Aman Sinha
Sounds good Tim. At least we can clean up some of the obvious ones. On Thu, May 31, 2018 at 2:35 PM, Timothy Farkas wrote: > Hi All, > > There are a lot of open PRs. I think it would be good to close some of > them in order to identify the remaining PRs that require action to be > taken.

Re: How to generate hash code for each build side one of the hash join columns

2018-05-30 Thread Aman Sinha
rent choices to achieve the target. > >>> > >>> To make discussion more accurate, I put the generated codes of the > >>> previous > >>> setupGetBuild64Hash method here: > >>> > >>> public long getBuild64HashCodeInner(int

Re: Why remaining.isAlwaysTrue() is necessary in JoinUtils.getJoinCategory in Drill

2018-05-26 Thread Aman Sinha
> > Jianqing Fu > > ------ > 发件人:Aman Sinha <amansi...@apache.org> > 发送时间:2018年5月25日(星期五) 23:16 > 收件人:dev <dev@drill.apache.org>; 傅建庆(天池) <jianqing.f...@alibaba-inc.com> > 主 题:Re: Why remaining.i

[ANNOUNCE] New Committer: Timothy Farkas

2018-05-25 Thread Aman Sinha
The Project Management Committee (PMC) for Apache Drill has invited Timothy Farkas to become a committer, and we are pleased to announce that he has accepted. Tim has become an active contributor to Drill in less than a year. During this time he has contributed to addressing flaky unit tests,

Re: Why remaining.isAlwaysTrue() is necessary in JoinUtils.getJoinCategory in Drill

2018-05-25 Thread Aman Sinha
Hi Jianqing, This happens because the ON clause of the join has a single column predicate (in addition to the join predicate). Currently, Drill does not support that regardless of equality or in-equality. Here's a simplified query's Calcite logical plan. Note that the local predicate l_suppkey =

Apache Drill board report (draft) for May 2018

2018-05-09 Thread Aman Sinha
Hi Drill Devs, the Apache board report for Drill for this quarter is due soon. Here's a draft. If you have any comments, let me know. I plan to submit by tomorrow morning. Thanks, Aman === ## Description: - Drill is a Schema-free SQL Query Engine for Hadoop, NoSQL and

[jira] [Created] (DRILL-6381) Add capability to do index based planning and execution

2018-05-02 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-6381: - Summary: Add capability to do index based planning and execution Key: DRILL-6381 URL: https://issues.apache.org/jira/browse/DRILL-6381 Project: Apache Drill Issue

[ANNOUNCE] New Committer: Sorabh Hamirwasia

2018-04-30 Thread Aman Sinha
The Project Management Committee (PMC) for Apache Drill has invited Sorabh Hamirwasia to become a committer, and we are pleased to announce that he has accepted. Over the last 1 1/2 years Sorabh's contributions have been in a few different areas. He took the lead in designing and implementing

Re: Display column data type without code

2018-04-25 Thread Aman Sinha
You can do it through SQL using typeof() function. Since there is no global schema, Drill evaluates this for each row. 0: jdbc:drill:drillbit=10.10.101.41> select n_name, typeof(n_name) as name_type, n_nationkey, typeof(n_nationkey) as nationkey_type from cp.`tpch/nation.parquet` limit 2;

Re: gitbox?

2018-04-18 Thread Aman Sinha
Yeah, let's go ahead with this. I don't quite recall Vlad's explanations in the hangout back in October, but I think we were all convinced, so I am +1. On Wed, Apr 18, 2018 at 3:42 AM, Arina Yelchiyeva < arina.yelchiy...@gmail.com> wrote: > Thanks, Parth, that would be really helpful. > > On

Re: [DISCUSS] Regarding mutator interface

2018-04-15 Thread Aman Sinha
a sequential write to the output batch as needed for the ANY_VALUE. The expectation is that case 2 is very rare because the functionality of doing the DISTINCTing is essentially satisfied by case 1. -Aman On Fri, Apr 13, 2018 at 6:34 PM, Aman Sinha <amansi...@apache.org> wrote: > Hi Pa

Re: [DISCUSS] Regarding mutator interface

2018-04-13 Thread Aman Sinha
ble-width values are stored in an Object Vector, but > those values won't survive serialization. As a result, only fixed-width > types can be updated in random order. DRILL-6087 describes this issue. > > Thanks, > - Paul > > [1] https://github.com/paul-rogers/drill/wiki/UDFs-Bac

Re: [DISCUSS] Regarding mutator interface

2018-04-11 Thread Aman Sinha
Here's some background on what Gautam is trying to do: Currently, SQL does not have a standard way to do a DISTINCT on a subset of the columns in the SELECT list. Suppose there are 2 columns: a: INTEGER b: MAP Suppose I want to only do DISTINCT on 'a' and I don't really care about the

Re: "Death of Schema-on-Read"

2018-04-08 Thread Aman Sinha
On Sun, Apr 8, 2018 at 10:57 AM, Ted Dunning <ted.dunn...@gmail.com> wrote: > I have been thinking about this email and I still don't understand some of > the comments. > > On Fri, Apr 6, 2018 at 5:13 PM, Aman Sinha <amansi...@apache.org> wrote: > > > On the

Re: Non-column filters in Drill

2018-04-07 Thread Aman Sinha
A better option would be to have a user-defined function that takes 2 parameters and evaluates to a boolean value. e.g select * from myTable where MyUDF(notColumn, 'value') IS TRUE; The Storage Plugin that you are developing would need to implement a pushdown rule that looks at the filter

Re: "Death of Schema-on-Read"

2018-04-06 Thread Aman Sinha
On the subject of CAST pushdown to Scans, there are potential drawbacks ... - In general, the planner will see a Scan-Project where the Project has CAST functions. But the Project can have arbitrary expressions, e.g CAST(a as INT) * 5 or a combination of 2 CAST functions or non-CAST

  1   2   3   4   5   >