Regression? Drill Truncating Varchars

2019-01-22 Thread Charles Givre
Hello all, I’m working on a format plugin to read syslog data, and have encountered what seems to be a bit of a regression (maybe). The code below is a helper function which writes strings from the data. As of Drill 1.16, the varchar holder seems to throw an error if the string you are

[GitHub] weijietong commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak

2019-01-22 Thread GitBox
weijietong commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak URL: https://github.com/apache/drill/pull/1600#discussion_r250042986 ## File path:

[GitHub] weijietong commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak

2019-01-22 Thread GitBox
weijietong commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak URL: https://github.com/apache/drill/pull/1600#discussion_r250042732 ## File path: exec/java-exec/src/main/java/org/apache/drill/exec/ops/FragmentContextImpl.java ## @@

[GitHub] vvysotskyi opened a new pull request #1619: DRILL-6910: Allow applying DrillPushProjectIntoScanRule at physical phase

2019-01-22 Thread GitBox
vvysotskyi opened a new pull request #1619: DRILL-6910: Allow applying DrillPushProjectIntoScanRule at physical phase URL: https://github.com/apache/drill/pull/1619 - Enhance `DrillPushProjectIntoScanRule` to be applied at the physical stage; - Avoid recreating `TableScan` rel nodes

[jira] [Created] (DRILL-6994) TIMESTAMP type DOB column in Spark parquet is treated as VARBINARY in Drill

2019-01-22 Thread Khurram Faraaz (JIRA)
Khurram Faraaz created DRILL-6994: - Summary: TIMESTAMP type DOB column in Spark parquet is treated as VARBINARY in Drill Key: DRILL-6994 URL: https://issues.apache.org/jira/browse/DRILL-6994 Project:

[GitHub] sohami commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak

2019-01-22 Thread GitBox
sohami commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak URL: https://github.com/apache/drill/pull/1600#discussion_r249937518 ## File path: exec/java-exec/src/main/java/org/apache/drill/exec/physical/impl/join/HashJoinBatch.java ## @@

[GitHub] sohami commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak

2019-01-22 Thread GitBox
sohami commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak URL: https://github.com/apache/drill/pull/1600#discussion_r249888648 ## File path: exec/java-exec/src/main/java/org/apache/drill/exec/ops/FragmentContextImpl.java ## @@ -530,11

[GitHub] sohami commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak

2019-01-22 Thread GitBox
sohami commented on a change in pull request #1600: DRILL-6947: fix RuntimeFilter memory leak URL: https://github.com/apache/drill/pull/1600#discussion_r249937947 ## File path: exec/java-exec/src/main/java/org/apache/drill/exec/physical/impl/filter/RuntimeFilterRecordBatch.java

[jira] [Created] (DRILL-6993) VARBINARY length is ignored on cast

2019-01-22 Thread Bohdan Kazydub (JIRA)
Bohdan Kazydub created DRILL-6993: - Summary: VARBINARY length is ignored on cast Key: DRILL-6993 URL: https://issues.apache.org/jira/browse/DRILL-6993 Project: Apache Drill Issue Type: Bug

[jira] [Created] (DRILL-6992) Support column histogram statistics

2019-01-22 Thread Aman Sinha (JIRA)
Aman Sinha created DRILL-6992: - Summary: Support column histogram statistics Key: DRILL-6992 URL: https://issues.apache.org/jira/browse/DRILL-6992 Project: Apache Drill Issue Type: New Feature

Re: Good DB theory references

2019-01-22 Thread rahul challapalli
The redbook [1] deserves a mention. It also has a chapter (collection of papers) dedicated to query optimization [2]. [1] http://www.redbook.io/ [2] http://www.redbook.io/ch7-queryoptimization.html On Tue, Jan 22, 2019 at 4:16 AM Joel Pfaff wrote: > Hello, > > Thanks for this initiative. > I

[GitHub] arina-ielchiieva commented on issue #1618: DRILL-6950: Row set-based scan framework

2019-01-22 Thread GitBox
arina-ielchiieva commented on issue #1618: DRILL-6950: Row set-based scan framework URL: https://github.com/apache/drill/pull/1618#issuecomment-456389823 @paul-rogers thanks for making the PR. I'll start review shortly. This

Re: Good DB theory references

2019-01-22 Thread Joel Pfaff
Hello, Thanks for this initiative. I have found a couple of years ago this page of link from Reynold Xin: https://github.com/rxin/db-readings And it is full of nice things. Regards, Joel On Tue, Jan 22, 2019 at 9:01 AM weijie tong wrote: > Hi Paul: > Thanks for the sharing. I would like to

[GitHub] ihuzenko commented on issue #1613: DRILL-6977: Improve Hive tests configuration

2019-01-22 Thread GitBox
ihuzenko commented on issue #1613: DRILL-6977: Improve Hive tests configuration URL: https://github.com/apache/drill/pull/1613#issuecomment-456377236 @arina-ielchiieva thanks for detailed review, the final changes is much better now.

[GitHub] arina-ielchiieva commented on issue #1613: DRILL-6977: Improve Hive tests configuration

2019-01-22 Thread GitBox
arina-ielchiieva commented on issue #1613: DRILL-6977: Improve Hive tests configuration URL: https://github.com/apache/drill/pull/1613#issuecomment-456376028 +1, thanks for making the changes. This is an automated message

[GitHub] ihuzenko commented on issue #1613: DRILL-6977: Improve Hive tests configuration

2019-01-22 Thread GitBox
ihuzenko commented on issue #1613: DRILL-6977: Improve Hive tests configuration URL: https://github.com/apache/drill/pull/1613#issuecomment-456367440 @arina-ielchiieva I've updated the PR since last comment, could you review changes again please ?

[jira] [Created] (DRILL-6991) Kerberos ticket is being dumped in the log if log level is "debug" for stdout

2019-01-22 Thread Anton Gozhiy (JIRA)
Anton Gozhiy created DRILL-6991: --- Summary: Kerberos ticket is being dumped in the log if log level is "debug" for stdout Key: DRILL-6991 URL: https://issues.apache.org/jira/browse/DRILL-6991 Project:

[GitHub] ihuzenko commented on issue #1613: DRILL-6977: Improve Hive tests configuration

2019-01-22 Thread GitBox
ihuzenko commented on issue #1613: DRILL-6977: Improve Hive tests configuration URL: https://github.com/apache/drill/pull/1613#issuecomment-456327285 Running eight derivative tests from HiveTestBase in IDE from master and this branch I figured out that execution time in this branch was

[GitHub] ihuzenko commented on a change in pull request #1613: DRILL-6977: Improve Hive tests configuration

2019-01-22 Thread GitBox
ihuzenko commented on a change in pull request #1613: DRILL-6977: Improve Hive tests configuration URL: https://github.com/apache/drill/pull/1613#discussion_r249695262 ## File path: contrib/storage-hive/core/src/test/java/org/apache/drill/exec/hive/HiveTestFixture.java ##

Re: Good DB theory references

2019-01-22 Thread weijie tong
Hi Paul: Thanks for the sharing. I would like to share another good latest paper here "Everything you always wanted to know about compiled and vectorized queries but were afraid to ask" : http://www.vldb.org/pvldb/vol11/p2209-kersten.pdf It explains the two kind of database execution