date:20150406

[jira] [Updated] (DRILL-2419) UDF that returns string representation of expression type

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2419:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 UDF that returns string representation of expression type
 -

 Key: DRILL-2419
 URL: https://issues.apache.org/jira/browse/DRILL-2419
 Project: Apache Drill
  Issue Type: Improvement
  Components: Functions - Drill
Reporter: Victoria Markman
Assignee: Mehant Baid
 Fix For: 1.1.0


 Suggested name: typeof (credit goes to Aman)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2487) Schema is ignored when using : between schema and zk on sqlline connection string

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2487:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Schema is ignored when using : between schema and zk on sqlline connection 
 string 
 

 Key: DRILL-2487
 URL: https://issues.apache.org/jira/browse/DRILL-2487
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - CLI
Affects Versions: 0.8.0
Reporter: Krystal
Assignee: Daniel Barclay (Drill)
Priority: Minor
 Fix For: 1.1.0


 git.commit.id=ae2053d2a078a40033a140f2dfaeef802a5e8254
 Invoking sqlline using a : between the schema and zk causes sqlline not to 
 connect the specified schema.  For example:
 root@qa-node113:~# /opt/drill/bin/sqlline -u 
 'jdbc:drill:schema=hive:zk=10.10.100.113:5181'
 touch: cannot touch `/var/log/drill/sqlline.log': No such file or directory
 Drill log directory /var/log/drill does not exist or is not writable, 
 defaulting to /opt/drill/log
 sqlline version 1.1.6
 0: jdbc:drill:schema=hive:zk=10.10.100.113:51 show tables;
 Query failed: RelConversionException: No schema selected. Select a schema 
 using 'USE schema' command
 If I put a ; between schema and zk, then sqlline connects to the specified 
 schema:
 root@qa-node113:~# /opt/drill/bin/sqlline -u 
 'jdbc:drill:schema=hive;zk=10.10.100.113:5181'
 touch: cannot touch `/var/log/drill/sqlline.log': No such file or directory
 Drill log directory /var/log/drill does not exist or is not writable, 
 defaulting to /opt/drill/log
 sqlline version 1.1.6
 0: jdbc:drill:schema=hive show tables;
 +--++
 | TABLE_SCHEMA | TABLE_NAME |
 +--++
 | hive.default | t2 |
 | hive.default | episodes_partitioned |
 | hive.default | store  |
 | hive.default | store_sales |
  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2478) Validating values assigned to SYSTEM/SESSION configuration parameters

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2478:
-
Assignee: Chris Westin  (was: Parth Chandra)

 Validating values assigned to SYSTEM/SESSION configuration parameters
 -

 Key: DRILL-2478
 URL: https://issues.apache.org/jira/browse/DRILL-2478
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Flow
Affects Versions: 0.8.0
 Environment: {code}
 0: jdbc:drill: select * from sys.version;
 +++-+-++
 | commit_id  | commit_message | commit_time | build_email | build_time |
 +++-+-++
 | f658a3c513ddf7f2d1b0ad7aa1f3f65049a594fe | DRILL-2209 Insert 
 ProjectOperator with MuxExchange | 09.03.2015 @ 01:49:18 EDT | Unknown | 
 09.03.2015 @ 04:50:05 EDT |
 +++-+-++
 1 row selected (0.046 seconds)
 {code}
Reporter: Khurram Faraaz
Assignee: Chris Westin
 Fix For: 1.0.0


 Values that are assigned to configuration parameters of type SYSTEM and 
 SESSION must be validated. Currently any value can be assigned to some of the 
 SYSTEM/SESSION type parameters.
 Here are two examples where assignment of invalid values to store.format does 
 not result in any error.
 {code}
 0: jdbc:drill: alter session set `store.format`='1';
 +++
 | ok |  summary   |
 +++
 | true   | store.format updated. |
 +++
 1 row selected (0.02 seconds)
 {code}
 {code}
 0: jdbc:drill: alter session set `store.format`='foo';
 +++
 | ok |  summary   |
 +++
 | true   | store.format updated. |
 +++
 1 row selected (0.039 seconds)
 {code}
 In some cases values to some of the configuration parameters are validated, 
 like in this example, where trying to assign an invalid value to parameter 
 store.parquet.compression results in an error, which is correct. However, 
 this kind of validation is not performed for every configuration parameter of 
 SYSTEM/SESSION type. These values that are assigned to parameters must be 
 validated, and report errors if incorrect values are assigned by users.
 {code}
 0: jdbc:drill: alter session set `store.parquet.compression`='anything';
 Query failed: ExpressionParsingException: Option store.parquet.compression 
 must be one of: [snappy, gzip, none]
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2241) CTAS fails when writing a repeated list

2015-04-06 Thread Jacques Nadeau (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jacques Nadeau updated DRILL-2241:
--
Fix Version/s: (was: Future)
   0.9.0

 CTAS fails when writing a repeated list
 ---

 Key: DRILL-2241
 URL: https://issues.apache.org/jira/browse/DRILL-2241
 Project: Apache Drill
  Issue Type: New Feature
  Components: Storage - Parquet
Affects Versions: 0.8.0
Reporter: Abhishek Girish
Assignee: Deneche A. Hakim
Priority: Blocker
 Fix For: 0.9.0

 Attachments: drillbit_replist.log


 Drill can read the following JSON file with a repeated list:
 {
   a : null
   b : [ [B1, B2] ],
 }
 Writing this to Parquet via a simple CTAS fails. 
  create table temp as select * from `replist.json`;
 Log indicates this to be unsupported (UnsupportedOperationException: 
 Unsupported type LIST)
 Log attached. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2435) Query SCHEMATA using querySubmitter against C++ client return no data

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2435:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Query SCHEMATA using querySubmitter against C++ client return no data
 -

 Key: DRILL-2435
 URL: https://issues.apache.org/jira/browse/DRILL-2435
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - C++
Affects Versions: 0.8.0
Reporter: Krystal
Assignee: Parth Chandra
Priority: Minor
 Fix For: 1.1.0


 The following query from querySubmitter against C++ client returns rows with 
 empty column values:
 CATALOG_NAMESCHEMA_NAME SCHEMA_OWNERTYPEIS_MUTABLE
 ROW: 1
 ROW: 2
 ROW: 3
 ROW: 4
 ROW: 5
 ROW: 6
 Running the same query from drill, all data is returned.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2416) Zookeeper in sqlline connection string does not override the entry from drill-override.conf

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2416:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Zookeeper in sqlline connection string does not override the entry from 
 drill-override.conf 
 

 Key: DRILL-2416
 URL: https://issues.apache.org/jira/browse/DRILL-2416
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - CLI
Affects Versions: 0.8.0
Reporter: Krystal
Assignee: Parth Chandra
 Fix For: 1.1.0


 git.commit.id=f658a3c513ddf7f2d1b0ad7aa1f3f65049a594fe
 On the sqlline jdbc connection string, I changed the zookeeper ip to point to 
 another cluster; however, sqlline kept connecting to the drillbits specified 
 in drill-override.conf.  I updated the drill-override.conf with the other 
 zookeeper information, then I was able to successfully connected to the 
 drillbits on a remote cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2496) Add SSL support to C++ client

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2496:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Add SSL support to C++ client
 -

 Key: DRILL-2496
 URL: https://issues.apache.org/jira/browse/DRILL-2496
 Project: Apache Drill
  Issue Type: Improvement
  Components: Client - C++
Reporter: Parth Chandra
Assignee: Parth Chandra
 Fix For: 1.1.0


 Needed for impersonation where username and password are sent over the wire 
 to the user.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2273) Move new JDBC test support classes to org.apache.drill.jdbc.test/etc.

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2273:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Move new JDBC test support classes to org.apache.drill.jdbc.test/etc.
 -

 Key: DRILL-2273
 URL: https://issues.apache.org/jira/browse/DRILL-2273
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - JDBC
Reporter: Daniel Barclay (Drill)
Assignee: Daniel Barclay (Drill)
Priority: Minor
 Fix For: 1.1.0


 JDBC test support classes in package org.apache.drill.jdbc in the 
 src/test/java tree should be elsewhere, presumably in package 
 org.apache.drill.jdbc.test:
 - CachingConnectionFactory
 - ConnectionFactory
 - ConnectionInfo
 - JdbcTest
 - MultiConnectionCachingFactory
 - NonClosableConnectionCachingFactory
 - SingleConnectionCachingFactory
 (In the src/test/java tree, package org.apache.drill.jdbc should be reserved 
 for tests of types in package org.apache.drill.jdbc in src/main/java.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2594) Web UI visualized plan may show incorrect plan due to rearranging

2015-04-06 Thread Chris Westin (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Westin updated DRILL-2594:

Fix Version/s: 1.0.0

 Web UI visualized plan may show incorrect plan due to rearranging 
 --

 Key: DRILL-2594
 URL: https://issues.apache.org/jira/browse/DRILL-2594
 Project: Apache Drill
  Issue Type: Improvement
  Components: Tools, Build  Test
Affects Versions: 0.8.0
Reporter: Aman Sinha
Assignee: Sudheesh Katkam
 Fix For: 1.0.0


 For large enough query plans (such as TPCH query 8), the visualized plan in 
 the Web UI  shows the inputs of a HashJoin as swapped whereas in the actual 
 plan it is the correct order. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1724) Can't use ~ to specify home directory in table path

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1724:
-
Assignee: Steven Phillips  (was: Mehant Baid)

 Can't use ~ to specify home directory in table path
 ---

 Key: DRILL-1724
 URL: https://issues.apache.org/jira/browse/DRILL-1724
 Project: Apache Drill
  Issue Type: New Feature
  Components: Storage - Text  CSV
Reporter: Tomer Shiran
Assignee: Steven Phillips
Priority: Minor
 Fix For: 1.0.0


 Drill throws an error when there's a tilda (~) in the path (which was 
 intended to refer to the user's home directory). This should work (and even 
 if not, the error message should explain what the issue is).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Assigned] (DRILL-2064) Internal Float4 and Float8 names are confusing; rename using Float and Double

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra reassigned DRILL-2064:


Assignee: Parth Chandra

 Internal Float4 and Float8 names are confusing; rename using Float and 
 Double
 -

 Key: DRILL-2064
 URL: https://issues.apache.org/jira/browse/DRILL-2064
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Daniel Barclay (Drill)
Assignee: Parth Chandra
Priority: Minor
 Fix For: 1.1.0


 [For/from Jacques:]
 The internal names (e.g., enumerations,  vector subtypes, function names) 
 containing Float4 and Float8 are confusing since they don't correspond to 
 type names float and double in Java, etc.
 Things currently named using Float4 should be renamed to names using 
 Float; things currently named using Float8 should be renamed using 
 Double.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2064) Internal Float4 and Float8 names are confusing; rename using Float and Double

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2064:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Internal Float4 and Float8 names are confusing; rename using Float and 
 Double
 -

 Key: DRILL-2064
 URL: https://issues.apache.org/jira/browse/DRILL-2064
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Daniel Barclay (Drill)
Priority: Minor
 Fix For: 1.1.0


 [For/from Jacques:]
 The internal names (e.g., enumerations,  vector subtypes, function names) 
 containing Float4 and Float8 are confusing since they don't correspond to 
 type names float and double in Java, etc.
 Things currently named using Float4 should be renamed to names using 
 Float; things currently named using Float8 should be renamed using 
 Double.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1966) Fix mock data generation for date, time and decimal types

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1966:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Fix mock data generation for date, time and decimal types
 -

 Key: DRILL-1966
 URL: https://issues.apache.org/jira/browse/DRILL-1966
 Project: Apache Drill
  Issue Type: Test
  Components: Execution - Data Types
Reporter: Jason Altekruse
Assignee: Daniel Barclay (Drill)
Priority: Minor
 Fix For: 1.1.0


 Currently the mock data generation code is shared among all of the vectors 
 created by a single freemarker template. Thus all of the types that are 
 generated by the FixedValueVectors.java template all use the same method that 
 fills values with alternating min/max byte values. This produces invalid data 
 for the decimal, time and date types.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2457) aggregation function on complex type should give appropriate error

2015-04-06 Thread Parth Chandra (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481480#comment-14481480
 ] 

Parth Chandra commented on DRILL-2457:
--

[~cch...@maprtech.com] Can you verify the error message is displayed now. If so 
please lower the priority. We can mark this fixed only after DRILL-2430 is done.


 aggregation function on complex type should give appropriate error 
 ---

 Key: DRILL-2457
 URL: https://issues.apache.org/jira/browse/DRILL-2457
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Parth Chandra
Priority: Critical
 Fix For: 0.9.0


 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 Apply aggregation function on complex data should error out with appropriate 
 error message instead of returning null.
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select t.soa[1] from 
 `complex.json` t limit 1;
 ++
 |   EXPR$0   |
 ++
 | {in:1,fl:1.12345} |
 ++
 1 row selected (0.169 seconds)
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select avg(t.soa[1]) from 
 `complex.json` t;
 +--+
 |  |
 +--+
 +--+
 No rows selected (0.164 seconds)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2457) aggregation function on complex type should give appropriate error

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2457:
-
Assignee: Chun Chang  (was: Parth Chandra)

 aggregation function on complex type should give appropriate error 
 ---

 Key: DRILL-2457
 URL: https://issues.apache.org/jira/browse/DRILL-2457
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Chun Chang
Priority: Critical
 Fix For: 0.9.0


 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 Apply aggregation function on complex data should error out with appropriate 
 error message instead of returning null.
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select t.soa[1] from 
 `complex.json` t limit 1;
 ++
 |   EXPR$0   |
 ++
 | {in:1,fl:1.12345} |
 ++
 1 row selected (0.169 seconds)
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select avg(t.soa[1]) from 
 `complex.json` t;
 +--+
 |  |
 +--+
 +--+
 No rows selected (0.164 seconds)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1446) C++ Client. Make Drill Client Logger thread safe

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1446:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 C++ Client. Make Drill Client Logger thread safe
 

 Key: DRILL-1446
 URL: https://issues.apache.org/jira/browse/DRILL-1446
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - C++
Reporter: Xiao Meng
Assignee: Parth Chandra
Priority: Minor
 Fix For: 1.1.0


 The C++ client uses ostream/ofstream to log which is not thread-safe. If 
 multiple thread writing on the logger file, it causes undefined behaviors.   



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2192) DrillScanRel should differentiate skip-all scan-all scan-some semantics while creating a GroupScan [umbrella]

2015-04-06 Thread Parth Chandra (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481493#comment-14481493
 ] 

Parth Chandra commented on DRILL-2192:
--

Reassigning to Steven for the two remaining subtasks.

 DrillScanRel should differentiate skip-all  scan-all  scan-some semantics 
 while creating a GroupScan [umbrella]
 -

 Key: DRILL-2192
 URL: https://issues.apache.org/jira/browse/DRILL-2192
 Project: Apache Drill
  Issue Type: Improvement
  Components: Query Planning  Optimization
Reporter: Hanifi Gunes
Assignee: Steven Phillips
 Fix For: 1.0.0


 DrillScanRel passes a list of columns to be read into GroupScan. Currently 
 the logic here is to scan all of the columns even if planner asks to skip 
 them all. Skipping all of the columns is particularly beneficial for the case 
 of count(star) that is translated to count(constant) where we just need row 
 count but not the actual data.
 The idea is to distinguish three separate states depending on the output 
 coming from planner as follows:
 | list of columns from planner | scan semantics |
 | null | scan-all |
 | empty list of columns | skip-all |
 | non-empty list of columns w/o star | scan-some |
 | list of columns with star | scan-all |
 As part this umbrella, we should make readers understand skip-all semantics.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2192) DrillScanRel should differentiate skip-all scan-all scan-some semantics while creating a GroupScan [umbrella]

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2192:
-
Assignee: Steven Phillips  (was: Hanifi Gunes)

 DrillScanRel should differentiate skip-all  scan-all  scan-some semantics 
 while creating a GroupScan [umbrella]
 -

 Key: DRILL-2192
 URL: https://issues.apache.org/jira/browse/DRILL-2192
 Project: Apache Drill
  Issue Type: Improvement
  Components: Query Planning  Optimization
Reporter: Hanifi Gunes
Assignee: Steven Phillips
 Fix For: 1.0.0


 DrillScanRel passes a list of columns to be read into GroupScan. Currently 
 the logic here is to scan all of the columns even if planner asks to skip 
 them all. Skipping all of the columns is particularly beneficial for the case 
 of count(star) that is translated to count(constant) where we just need row 
 count but not the actual data.
 The idea is to distinguish three separate states depending on the output 
 coming from planner as follows:
 | list of columns from planner | scan semantics |
 | null | scan-all |
 | empty list of columns | skip-all |
 | non-empty list of columns w/o star | scan-some |
 | list of columns with star | scan-all |
 As part this umbrella, we should make readers understand skip-all semantics.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1890) Parquet reader throws an cryptic exception when reading a file with colliding column names accoring to Drill's case-insensitive field matching

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1890:
-
Assignee: Steven Phillips  (was: Parth Chandra)

 Parquet reader throws an cryptic exception when reading a file with colliding 
 column names accoring to Drill's case-insensitive field matching
 --

 Key: DRILL-1890
 URL: https://issues.apache.org/jira/browse/DRILL-1890
 Project: Apache Drill
  Issue Type: Bug
  Components: Storage - Parquet
Reporter: Jason Altekruse
Assignee: Steven Phillips
Priority: Minor
 Fix For: 1.0.0






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2457) aggregation function on complex type should give appropriate error

2015-04-06 Thread Chun Chang (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481590#comment-14481590
 ] 

Chun Chang commented on DRILL-2457:
---

[~parthc] I will lower the priority to minor.

{code}
0: jdbc:drill:schema=dfs.drillTestDirComplexJ select avg(t.soa[1]) from 
`complex.json` t;
Query failed: RemoteRpcException: Failure while running fragment., Failure 
while materializing expression.
Error in expression at index -1.  Error: Missing function implementation: 
[$sum0(MAP-REQUIRED)].  Full expression: --UNKNOWN EXPRESSION--.
Error in expression at index -1.  Error: Missing function implementation: 
[count(MAP-REQUIRED)].  Full expression: --UNKNOWN EXPRESSION--. [ 
0a36eadd-802a-49df-a2b8-6718e4f2b2c4 on qa-node118.qa.lab:31010 ]
[ 0a36eadd-802a-49df-a2b8-6718e4f2b2c4 on qa-node118.qa.lab:31010 ]


Error: exception while executing query: Failure while executing query. 
(state=,code=0)
{code}

 aggregation function on complex type should give appropriate error 
 ---

 Key: DRILL-2457
 URL: https://issues.apache.org/jira/browse/DRILL-2457
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Chun Chang
Priority: Critical
 Fix For: 0.9.0


 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 Apply aggregation function on complex data should error out with appropriate 
 error message instead of returning null.
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select t.soa[1] from 
 `complex.json` t limit 1;
 ++
 |   EXPR$0   |
 ++
 | {in:1,fl:1.12345} |
 ++
 1 row selected (0.169 seconds)
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select avg(t.soa[1]) from 
 `complex.json` t;
 +--+
 |  |
 +--+
 +--+
 No rows selected (0.164 seconds)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2409) Drill profile page mishandles statistics from long running queries

2015-04-06 Thread Chris Westin (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Westin updated DRILL-2409:

Fix Version/s: (was: 0.9.0)
   1.0.0

 Drill profile page mishandles statistics from long running queries
 --

 Key: DRILL-2409
 URL: https://issues.apache.org/jira/browse/DRILL-2409
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - HTTP
Reporter: Jacques Nadeau
Assignee: Sudheesh Katkam
 Fix For: 1.0.0


 We recently ran a 72 hour query that joined several trillion records for a 
 customer.  While the query completed successfully, the presentation on the 
 profile page had a number of problems.  This included times not being 
 correctly reported (they were truncated) and the Gant timeline being 
 unreadable (since it doesn't scale the axes from seconds). We should correct 
 these.  (For durations specifically, we should really be present as 4m 2s, 7h 
 4m or 7d 4h 4m instead of 07:04:02 since we're talking about durations and 
 not times.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (DRILL-2697) Pause injections should pause indefinitely until signalled

2015-04-06 Thread Sudheesh Katkam (JIRA)

Sudheesh Katkam created DRILL-2697:
--

 Summary: Pause injections should pause indefinitely until signalled
 Key: DRILL-2697
 URL: https://issues.apache.org/jira/browse/DRILL-2697
 Project: Apache Drill
  Issue Type: Improvement
  Components: Execution - Flow
Affects Versions: 0.9.0
Reporter: Sudheesh Katkam
 Fix For: 1.1.0


Currently injected pauses make threads sleep for a specified time. This can  be 
an enhanced to stop the thread indefinitely using a CountDownLatch. It is quite 
similar to how cancellation works. 

Tasks: 
(a) Add another message to RPC layer to signal paused remote threads to resume 
(through ControlHandler) by counting down. Complications if the thread has not 
reached the pause site yet.
(b) Add resume signal (like ctrl-c) to sqlline 
(further enhancement: another signal to trigger pause from sqlline)




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2409) Drill profile page mishandles statistics from long running queries

2015-04-06 Thread Chris Westin (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481594#comment-14481594
 ] 

Chris Westin commented on DRILL-2409:
-

This sounds like a superset of DRILL-2533.

 Drill profile page mishandles statistics from long running queries
 --

 Key: DRILL-2409
 URL: https://issues.apache.org/jira/browse/DRILL-2409
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - HTTP
Reporter: Jacques Nadeau
Assignee: Sudheesh Katkam
 Fix For: 1.0.0


 We recently ran a 72 hour query that joined several trillion records for a 
 customer.  While the query completed successfully, the presentation on the 
 profile page had a number of problems.  This included times not being 
 correctly reported (they were truncated) and the Gant timeline being 
 unreadable (since it doesn't scale the axes from seconds). We should correct 
 these.  (For durations specifically, we should really be present as 4m 2s, 7h 
 4m or 7d 4h 4m instead of 07:04:02 since we're talking about durations and 
 not times.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2688) Use of ORDER BY on right side of Union All results in SqlValidatorException

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481864#comment-14481864
 ] 

Sean Hsuan-Yi Chu commented on DRILL-2688:
--

Indeed, it is a bug. Calcite believes order by is associated with outer query 
(which violates the SQL standard).

A simple workaround is to use parenthesis to help Calcite. Given the simplicity 
of the workaround, this issue is downgraded to minor. 

Thanks Khurram and Abhishek!

 Use of ORDER BY on right side of Union All results in SqlValidatorException
 ---

 Key: DRILL-2688
 URL: https://issues.apache.org/jira/browse/DRILL-2688
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment:  9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: 
 Exit early from HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 
 EDT
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu

 Order by in the right leg of Union All results in SqlValidatorException.
 Executing the query individually (the query that is on right side of Union 
 All) gives expected results and no Exceptions are seen. However, executing 
 the query as part of the right leg of Union All results in Exception.
 Tests were executed on 4 node cluster on CentOS. Both sides of Union All got 
 input from CSV files.
 The query below returns correct results when executed individually. (this is 
 the same query used on right side of Union All query)
 {code}
 0: jdbc:drill: select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0] order by columns[0];
 ++
 |   EXPR$0   |
 ++
 | 10 |
 | 100|
 | 113|
 | 119|
 | 2  |
 | 50 |
 | 55 |
 | 57 |
 | 61 |
 | 67 |
 | 89 |
 ++
 11 rows selected (0.265 seconds)
 {code}
 Note that the Union All query works when there is no order by in the right 
 leg.
 {code}
 0: jdbc:drill: select columns[0] from `employee.csv` t1 where 
 t1.columns[0]=1 union all select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0];
 ++
 |   EXPR$0   |
 ++
 | 1  |
 | 100|
 | 10 |
 | 2  |
 | 50 |
 | 55 |
 | 67 |
 | 113|
 | 119|
 | 89 |
 | 57 |
 | 61 |
 ++
 12 rows selected (0.175 seconds)
 {code}
 In this case we do use an order by on the right side of Union All, which 
 results in SqlValidatorException. Same query as above, except that we have an 
 order by clause in the query on the right side of Union All.
 {code}
 0: jdbc:drill: select columns[0] from `employee.csv` t1 where 
 t1.columns[0]=1 union all select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0] order by columns[0];
 Query failed: SqlValidatorException: Column 'columns' not found in any table
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}
 Stack trace from drillbit.log 
 {code}
 2015-04-04 04:31:22,882 [2ae096e5-4085-de95-5642-ae10d320a8f7:foreman] ERROR 
 o.a.drill.exec.work.foreman.Foreman - Error 
 c6a60f71-c959-4ae5-ba8c-23b9c7b6bb7e: SqlValidatorException: Column 'columns' 
 not found in any table
 org.apache.drill.exec.planner.sql.QueryInputException: Failure validating 
 SQL. org.eigenbase.util.EigenbaseContextException: From line 1, column 178 to 
 line 1, column 184: Column 'columns' not found in any table
 at 
 org.apache.drill.exec.planner.sql.DrillSqlWorker.getPlan(DrillSqlWorker.java:147)
  ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.foreman.Foreman.runSQL(Foreman.java:773) 
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at org.apache.drill.exec.work.foreman.Foreman.run(Foreman.java:204) 
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
  [na:1.7.0_75]
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
  [na:1.7.0_75]
 at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
 Caused by: net.hydromatic.optiq.tools.ValidationException: 
 org.eigenbase.util.EigenbaseContextException: From line 1, column 178 to line 
 1, column 184: Column 'columns' not found in any table
 at 
 net.hydromatic.optiq.prepare.PlannerImpl.validate(PlannerImpl.java:176) 
 ~[optiq-core-0.9-drill-r20.jar:na]
 at 
 org.apache.drill.exec.planner.sql.handlers.DefaultSqlHandler.validateNode(DefaultSqlHandler.java:157)

[jira] [Commented] (DRILL-2591) Aggregate in left input to Union All does not work

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481875#comment-14481875
 ] 

Sean Hsuan-Yi Chu commented on DRILL-2591:
--

Review done at:
https://reviews.apache.org/r/32590/

Commit #: a53e12336c29b421f1df51da480af9a65d70bb72

 Aggregate in left input to Union All does not work
 --

 Key: DRILL-2591
 URL: https://issues.apache.org/jira/browse/DRILL-2591
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment: {code}
 0: jdbc:drill: select * from sys.version;
 +++-+-++
 | commit_id  | commit_message | commit_time | build_email | build_time |
 +++-+-++
 | 9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: Exit early from 
 HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 EDT | Unknown
  | 26.03.2015 @ 16:53:21 EDT |
 +++-+-++
 1 row selected (0.104 seconds)
 {code}
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu
 Attachments: DRILL-2591.1.patch


 If the left input to Union All has an aggregate function, the result is 
 SQLException. This was seen on a 4 node cluster.
 {code}
 0: jdbc:drill: select max(key) from `dateData.json` union all select key 
 from `timeStmpData.json`;
 ++
 |   EXPR$0   |
 ++
 Query failed: Query stopped., Schema change detected in the left input of 
 Union-All. This is not currently supported [ 
 441285d7-e4a5-46c8-ab11-a0332945e3fc on centos-04.qa.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 {code}
 Stack trace from drillbit.log
 {code}
 2015-03-27 00:29:09,795 [2aeb5baa-5af0-ac70-b49a-53e61c92be51:frag:0:0] ERROR 
 o.a.drill.exec.work.foreman.Foreman - Error 
 e3ad43f5-fda6-48e5-9e74-779c69bb3cb2: RemoteRpcException: Failure while 
 running fragment., Schema change detected in the left input of Union-All. 
 This is not currently supported [ c2c7add0-651b-44d8-9a7c-3218761098e4 on 
 centos-04.qa.lab:31010 ]
 [ c2c7add0-651b-44d8-9a7c-3218761098e4 on centos-04.qa.lab:31010 ]
 org.apache.drill.exec.rpc.RemoteRpcException: Failure while running 
 fragment., Schema change detected in the left input of Union-All. This is not 
 currently supported [ c2c7add0-651b-44d8-9a7c-3218761098e4 on 
 centos-04.qa.lab:31010 ]
 [ c2c7add0-651b-44d8-9a7c-3218761098e4 on centos-04.qa.lab:31010 ]
 at 
 org.apache.drill.exec.work.foreman.QueryManager.statusUpdate(QueryManager.java:163)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:281)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:114)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:110)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.FragmentExecutor.internalFail(FragmentExecutor.java:230)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:165)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38)
  [drill-common-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
  [na:1.7.0_75]
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
  [na:1.7.0_75]
 at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
 2015-03-27 00:29:09,796 [2aeb5baa-5af0-ac70-b49a-53e61c92be51:frag:0:0] WARN  
 o.a.d.e.w.fragment.FragmentExecutor - Error while initializing or executing 
 fragment
 java.lang.RuntimeException: Error closing fragment context.
 at

[jira] [Resolved] (DRILL-2591) Aggregate in left input to Union All does not work

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Hsuan-Yi Chu resolved DRILL-2591.
--
Resolution: Fixed

 Aggregate in left input to Union All does not work
 --

 Key: DRILL-2591
 URL: https://issues.apache.org/jira/browse/DRILL-2591
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment: {code}
 0: jdbc:drill: select * from sys.version;
 +++-+-++
 | commit_id  | commit_message | commit_time | build_email | build_time |
 +++-+-++
 | 9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: Exit early from 
 HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 EDT | Unknown
  | 26.03.2015 @ 16:53:21 EDT |
 +++-+-++
 1 row selected (0.104 seconds)
 {code}
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu
 Attachments: DRILL-2591.1.patch


 If the left input to Union All has an aggregate function, the result is 
 SQLException. This was seen on a 4 node cluster.
 {code}
 0: jdbc:drill: select max(key) from `dateData.json` union all select key 
 from `timeStmpData.json`;
 ++
 |   EXPR$0   |
 ++
 Query failed: Query stopped., Schema change detected in the left input of 
 Union-All. This is not currently supported [ 
 441285d7-e4a5-46c8-ab11-a0332945e3fc on centos-04.qa.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 {code}
 Stack trace from drillbit.log
 {code}
 2015-03-27 00:29:09,795 [2aeb5baa-5af0-ac70-b49a-53e61c92be51:frag:0:0] ERROR 
 o.a.drill.exec.work.foreman.Foreman - Error 
 e3ad43f5-fda6-48e5-9e74-779c69bb3cb2: RemoteRpcException: Failure while 
 running fragment., Schema change detected in the left input of Union-All. 
 This is not currently supported [ c2c7add0-651b-44d8-9a7c-3218761098e4 on 
 centos-04.qa.lab:31010 ]
 [ c2c7add0-651b-44d8-9a7c-3218761098e4 on centos-04.qa.lab:31010 ]
 org.apache.drill.exec.rpc.RemoteRpcException: Failure while running 
 fragment., Schema change detected in the left input of Union-All. This is not 
 currently supported [ c2c7add0-651b-44d8-9a7c-3218761098e4 on 
 centos-04.qa.lab:31010 ]
 [ c2c7add0-651b-44d8-9a7c-3218761098e4 on centos-04.qa.lab:31010 ]
 at 
 org.apache.drill.exec.work.foreman.QueryManager.statusUpdate(QueryManager.java:163)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:281)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:114)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:110)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.FragmentExecutor.internalFail(FragmentExecutor.java:230)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:165)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38)
  [drill-common-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
  [na:1.7.0_75]
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
  [na:1.7.0_75]
 at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
 2015-03-27 00:29:09,796 [2aeb5baa-5af0-ac70-b49a-53e61c92be51:frag:0:0] WARN  
 o.a.d.e.w.fragment.FragmentExecutor - Error while initializing or executing 
 fragment
 java.lang.RuntimeException: Error closing fragment context.
 at 
 org.apache.drill.exec.work.fragment.FragmentExecutor.closeOutResources(FragmentExecutor.java:224)

[jira] [Updated] (DRILL-1924) Unexpected exception in TRUNC function when unsupported data type is passed in

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1924:
-
Assignee: Mehant Baid

 Unexpected exception in TRUNC function when unsupported data type is passed in
 --

 Key: DRILL-1924
 URL: https://issues.apache.org/jira/browse/DRILL-1924
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
Reporter: Victoria Markman
Assignee: Mehant Baid
Priority: Minor
 Fix For: 1.0.0


 t.json
 {code}
 { a1: 0, b1: 0, c1: true,   d1 : 2015-01-02}
 { a1: 0, b1: 0, c1: false , d1 : 2015-01-03}
 { a1: 0, b1: 0, c1: false , d1 : 2015-01-04}
 { a1: 1, b1: 1, c1: true ,  d1 : 2015-01-05}
 { a1: 1, b1: 1, c1: true ,  d1 : 2015-01-06}
 {code}
 {code}
 0: jdbc:drill:schema=dfs select trunc(c1) from `t.json`;
 Query failed: Query stopped., Failure while trying to materialize incoming 
 schema.  Errors:
 Error in expression at index -1.  Error: Missing function implementation: 
 [castTINYINT(VARCHAR-OPTIONAL)].  Full expression: --UNKNOWN EXPRESSION--.. [ 
 8fa0b28d-da4a-4c73-81df-823033dedc35 on atsqa4-133.qa.lab:31010 ]
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 0: jdbc:drill:schema=dfs select trunc(cast(d1 as timestamp)) from `t.json`;
 Query failed: Query stopped., Failure while trying to materialize incoming 
 schema.  Errors:
 Error in expression at index -1.  Error: Missing function implementation: 
 [trunc(TIMESTAMP-OPTIONAL)].  Full expression: --UNKNOWN EXPRESSION--.. [ 
 2a6783c4-e111-47dc-ba67-a767a111e3a9 on atsqa4-133.qa.lab:31010 ]
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 0: jdbc:drill:schema=dfs select trunc(cast(d1 as date)) from `t.json`;
 Query failed: Query stopped., Failure while trying to materialize incoming 
 schema.  Errors:
 Error in expression at index -1.  Error: Missing function implementation: 
 [trunc(DATE-OPTIONAL)].  Full expression: --UNKNOWN EXPRESSION--.. [ 
 65de17df-5c57-44e0-b5ea-653d0ad6afb9 on atsqa4-133.qa.lab:31010 ]
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}
 It also works on boolean data type, whatever it means ... 
 {code}
 0: jdbc:drill:schema=dfs select cast(c1 as boolean) from `t.json`;
 ++
 |   EXPR$0   |
 ++
 | true   |
 | false  |
 | false  |
 | true   |
 | true   |
 ++
 5 rows selected (0.066 seconds)
 {code}
 TRUNC function is listed in external documentation as math function.
 Please throw an error message when any other data type is passed in.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1706) date_sub function does not accept string as input in Drill

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1706:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 date_sub function does not accept string as input in Drill
 --

 Key: DRILL-1706
 URL: https://issues.apache.org/jira/browse/DRILL-1706
 Project: Apache Drill
  Issue Type: New Feature
  Components: Functions - Drill, Functions - Hive
Affects Versions: 0.7.0
Reporter: Hao Zhu
Assignee: Mehant Baid
 Fix For: 1.0.0


 date_sub function does not accept string as input in Drill, however it does 
 in Hive.
 This different behavior of the function will make customer re-write their 
 query to use cast as date.
 Minimum reproduce :
 {code}
 0: jdbc:drill:zk=local select date_sub('2014-11-12 16:45:22',15) from 
 dfs.tmp.`drilltest/test.csv` ;
 Query failed: Failure while running fragment., Invalid format: 2014-11-12 
 16:45:22 is malformed at 14-11-12 16:45:22 [ 
 9a6f18da-eb1e-4d91-879a-8d9d528efd59 on 10.250.0.115:31010 ]
   (java.lang.IllegalArgumentException) Invalid format: 2014-11-12 16:45:22 
 is malformed at 14-11-12 16:45:22
 org.joda.time.format.DateTimeFormatter.parseDateTime():873
 org.apache.drill.exec.test.generated.ProjectorGen23.doSetup():63
 org.apache.drill.exec.test.generated.ProjectorGen23.setup():97
 
 org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.setupNewSchema():427
 
 org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.buildSchema():270
 
 org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.buildSchema():80
 
 org.apache.drill.exec.physical.impl.ScreenCreator$ScreenRoot.buildSchema():95
 org.apache.drill.exec.work.fragment.FragmentExecutor.run():111
 org.apache.drill.exec.work.WorkManager$RunnableWrapper.run():249
 ...():0
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}
 Comparing to Hive which is good.
 {code}
 0: jdbc:hive2://n1a:1/default select date_sub('2014-11-12 16:45:22',15) 
 from passwords limit 1 ; 
 +-+
 | _c0 |
 +-+
 | 2014-10-28  |
 +-+
 1 row selected (6.568 seconds)
 {code}
 Workaround in Drill:
 {code}
 0: jdbc:drill:zk=local select date_sub(cast('2014-11-12 16:45:22' as 
 date),15) from dfs.tmp.`drilltest/test.csv` ;
 ++
 |   EXPR$0   |
 ++
 | 2014-10-28 |
 ++
 1 row selected (0.082 seconds)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2455) Cluster utility

2015-04-06 Thread Chris Westin (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Westin updated DRILL-2455:

Fix Version/s: Future

 Cluster utility
 ---

 Key: DRILL-2455
 URL: https://issues.apache.org/jira/browse/DRILL-2455
 Project: Apache Drill
  Issue Type: New Feature
  Components: Tools, Build  Test
Reporter: Sudheesh Katkam
Assignee: Sudheesh Katkam
 Fix For: Future


 A bash utility to -download and deploy- *manage* Drill cluster will be useful 
 for users and developers. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2691) Source files with Windows line endings

2015-04-06 Thread Deneche A. Hakim (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Deneche A. Hakim updated DRILL-2691:

Assignee: Jacques Nadeau  (was: Deneche A. Hakim)

 Source files with Windows line endings
 --

 Key: DRILL-2691
 URL: https://issues.apache.org/jira/browse/DRILL-2691
 Project: Apache Drill
  Issue Type: Bug
Affects Versions: 0.6.0
Reporter: Deneche A. Hakim
Assignee: Jacques Nadeau
 Fix For: 0.9.0

 Attachments: DRILL-2691.1.patch.txt


 The following files:
 {noformat}
 common/src/main/java/org/apache/drill/common/util/DrillStringUtils.java
 contrib/storage-hbase/src/test/java/org/apache/drill/hbase/TestHBaseCFAsJSONString.java
 {noformat}
 Have Windows line endings in them. Trying to apply a patch that contains 
 changes in one of those files will fail.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2318) Query fails when an ORDER BY clause is used with WITH-CLAUSE

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481856#comment-14481856
 ] 

Sean Hsuan-Yi Chu commented on DRILL-2318:
--

Resolved in Calcite side:
 https://issues.apache.org/jira/browse/CALCITE-662

This issue will be gone after rebasing is done.

 Query fails when an ORDER BY clause is used with WITH-CLAUSE
 

 Key: DRILL-2318
 URL: https://issues.apache.org/jira/browse/DRILL-2318
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.8.0
Reporter: Abhishek Girish
Assignee: Sean Hsuan-Yi Chu
 Fix For: 1.0.0

 Attachments: drillbit.log


 Adding a WITH clause with a simple CTE causes a query with an ORDER BY to 
 fail. This happens even when the CTE is unrelated to the main query. 
 *The following query fails to execute:*
 {code:sql}
 WITH 
  x
  AS (SELECT  ss_sold_date_sk a1
  FROM  store_sales) 
 SELECT  x.a1
 FROM   x
 ORDER  BY 
   x.a1;
 {code}
 Error:
 Query failed: SqlValidatorException: Table 'x' not found
 Log attached. 
 *The following query executes fine:*
 {code:sql}
 WITH 
  x
  AS (SELECT  ss_sold_date_sk a1
  FROM  store_sales) 
 SELECT  x.a1
 FROM   x
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (DRILL-2637) Schema change reported incorrectly although both the input columns are of same datatype

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Hsuan-Yi Chu resolved DRILL-2637.
--
Resolution: Fixed

 Schema change reported incorrectly although both the input columns are of 
 same datatype
 ---

 Key: DRILL-2637
 URL: https://issues.apache.org/jira/browse/DRILL-2637
 Project: Apache Drill
  Issue Type: Bug
Affects Versions: 0.9.0
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu

 Schema change is reported incorrectly, although the two columns hold data of 
 same datatype, and those two columns are input to Union All and an aggregate 
 count is performed on the results returned by Union All.
 Tests were run on 4 node cluster.
 {code}
 0: jdbc:drill: select c1 from (select columns[0] c1 from `testWindow.csv`) 
 union all (select columns[0] c2 from `testWindow.csv`);
 ++
 | c1 |
 ++
 | 100|
 | 10 |
 | 2  |
 | 50 |
 | 55 |
 | 67 |
 | 113|
 | 119|
 | 89 |
 | 57 |
 | 61 |
 | 100|
 | 10 |
 | 2  |
 | 50 |
 | 55 |
 | 67 |
 | 113|
 | 119|
 | 89 |
 | 57 |
 | 61 |
 ++
 22 rows selected (0.121 seconds)
 {code}
 {code}
 0: jdbc:drill: select count(c1) from (select columns[0] c1 from 
 `testWindow.csv`) union all (select columns[0] c2 from `testWindow.csv`);
 ++
 |   EXPR$0   |
 ++
 Query failed: Query stopped., Schema change detected in the left input of 
 Union-All. This is not currently supported [ 
 57dd6384-fb23-4ab0-aee9-fb7def390788 on centos-04.qa.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 {code}
 Stack trace from drillbit.log
 {code}
 2015-03-31 20:10:07,825 [2ae500df-db85-2583-fa7f-b89beb7e5ac0:frag:0:0] ERROR 
 o.a.drill.exec.work.foreman.Foreman - Error 
 0b4d9b3a-d8af-4dc9-be47-46c4547a793a: RemoteRpcException: Failure while 
 running fragment., Schema change detected in the left input of Union-All. 
 This is not currently supported [ b9555eb8-c009-4e9c-b058-ffae3f015df7 on 
 centos-04.qa.lab:31010 ]
 [ b9555eb8-c009-4e9c-b058-ffae3f015df7 on centos-04.qa.lab:31010 ]
 org.apache.drill.exec.rpc.RemoteRpcException: Failure while running 
 fragment., Schema change detected in the left input of Union-All. This is not 
 currently supported [ b9555eb8-c009-4e9c-b058-ffae3f015df7 on 
 centos-04.qa.lab:31010 ]
 [ b9555eb8-c009-4e9c-b058-ffae3f015df7 on centos-04.qa.lab:31010 ]
 at 
 org.apache.drill.exec.work.foreman.QueryManager.statusUpdate(QueryManager.java:163)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:281)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:114)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:110)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.FragmentExecutor.internalFail(FragmentExecutor.java:230)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:165)
  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38)
  [drill-common-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
  [na:1.7.0_75]
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
  [na:1.7.0_75]
 at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
 2015-03-31 20:10:07,825 [2ae500df-db85-2583-fa7f-b89beb7e5ac0:frag:0:0] WARN  
 o.a.d.e.w.fragment.FragmentExecutor - Error while initializing or executing 
 fragment
 java.lang.RuntimeException: Error closing fragment

[jira] [Updated] (DRILL-2200) Obscure error message from kvgen when the field does not exist in a parquet file

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2200:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Obscure error message from kvgen when the field does not exist in a parquet 
 file
 

 Key: DRILL-2200
 URL: https://issues.apache.org/jira/browse/DRILL-2200
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Reporter: Rahul Challapalli
Assignee: Mehant Baid
Priority: Minor
 Fix For: 1.1.0

 Attachments: data.parquet


 git.commit.id.abbrev=3d863b5
 Json Data Set :
 {code}
 {
   id:1,
   map: {
   map1:{col1:1, col2:2}
   }
 }
 {code}
 Equivalent Parquet File is attached.
 The below response properly indicates the reason for failure
 {code}
 select kvgen(t.map.abcd) from dfs.flatten_operators.`data.json` t;
 Query failed: RemoteRpcException: Failure while running fragment., kvgen 
 function only supports Simple maps as input [ 
 9b441359-9f04-4199-8cca-cdc23aaab517 on qa-node191.qa.lab:31010 ]
 [ 9b441359-9f04-4199-8cca-cdc23aaab517 on qa-node191.qa.lab:31010 ]
 {code}
 However the same query on an equivalent parquet file returns an obscure error 
 message
 {code}
 select kvgen(t.map.abcd) from `data.parquet` t;
 Query failed: RemoteRpcException: Failure while running fragment., Unable to 
 find holder type for minorType: LATE [ 9697f4aa-1e8d-4221-8e7f-4d374ffbd916 
 on qa-node191.qa.lab:31010 ]
 [ 9697f4aa-1e8d-4221-8e7f-4d374ffbd916 on qa-node191.qa.lab:31010 ]
 {code}
 However if we apply kvgen on a first level non-existent field we get back a 
 proper error message even for a parquet file
 {code}
 select kvgen(t.map1) from `data.parquet` t;
 Query failed: RemoteRpcException: Failure while running fragment., kvgen 
 function only supports Simple maps as input [ 
 1d1c4ca3-ceda-4320-8ff8-fe08afe32090 on qa-node191.qa.lab:31010 ]
 [ 1d1c4ca3-ceda-4320-8ff8-fe08afe32090 on qa-node191.qa.lab:31010 ]
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (DRILL-2226) Create test utilities for checking plans for patterns

2015-04-06 Thread Jason Altekruse (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Altekruse resolved DRILL-2226.

   Resolution: Fixed
Fix Version/s: (was: 1.0.0)
   0.8.0

Fixed in ed397862eb9584572aa0fcb684dfc9554b00cf60

 Create test utilities for checking plans for patterns
 -

 Key: DRILL-2226
 URL: https://issues.apache.org/jira/browse/DRILL-2226
 Project: Apache Drill
  Issue Type: Improvement
  Components: Tools, Build  Test
Reporter: Jason Altekruse
Assignee: Jason Altekruse
 Fix For: 0.8.0

 Attachments: DRILL-2226.patch


 Regex matching for calcite text format plans, includes expected and excluded 
 pattern matching.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1914) Commands given in SQLLine !help don't work as expected

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1914:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Commands given in SQLLine !help don't work as expected
 --

 Key: DRILL-1914
 URL: https://issues.apache.org/jira/browse/DRILL-1914
 Project: Apache Drill
  Issue Type: Wish
  Components: Client - CLI
Affects Versions: 0.6.0
Reporter: irfan
Assignee: Daniel Barclay (Drill)
Priority: Minor
 Fix For: 1.1.0


 how to use help utility in apache drill. we have tried with !help it's giving 
 list like !sql,!tables ,!run etc. we want to know the usage of reserved 
 keywords like RANK and ROW_NUM. please respond if you have any idea



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (DRILL-2639) Planner bug - RelOptPlanner.CannotPlanException

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481872#comment-14481872
 ] 

Sean Hsuan-Yi Chu edited comment on DRILL-2639 at 4/6/15 8:27 PM:
--

DRILL-600 was based on the previous implementation of Union-All. Basically, we 
re-did a brand new one. Thanks!!!


was (Author: seanhychu):
DRILL-600 was based on the previous implementation of Union-All. Basically, we 
re-did a brand new one.

 Planner bug - RelOptPlanner.CannotPlanException
 ---

 Key: DRILL-2639
 URL: https://issues.apache.org/jira/browse/DRILL-2639
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment: | 9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: 
 Exit early from HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 
 EDT | Unknown | 26.03.2015 @ 16:53:21 EDT |
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu
Priority: Critical
 Fix For: 0.9.0


 Reporting this as a separate JIRA as this issue related to a bug in the 
 planner. Performing aggregate on the output returned by Union All results in 
 CannotPlanException. Note that the two inputs to Union All are casted to 
 integer and hence the inputs from both legs are of the same datatype. 
 {code}
 0: jdbc:drill: select count(c1) from (select cast(columns[0] as int) c1 from 
 `testWindow.csv`) union all (select cast(columns[0] as int) c2 from 
 `testWindow.csv`);
 Query failed: RelOptPlanner.CannotPlanException: Node 
 [rel#59393:Subset#4.LOGICAL.ANY([]).[]] could not be implemented; planner 
 state:
 Root: rel#59393:Subset#4.LOGICAL.ANY([]).[]
 Original rel:
 AbstractConverter(subset=[rel#59393:Subset#4.LOGICAL.ANY([]).[]], 
 convention=[LOGICAL], DrillDistributionTraitDef=[ANY([])], sort=[[]]): 
 rowcount = 1.7976931348623157E308, cumulative cost = {inf}, id = 59394
   UnionRel(subset=[rel#59392:Subset#4.NONE.ANY([]).[]], all=[true]): rowcount 
 = 1.7976931348623157E308, cumulative cost = {1.7976931348623157E308 rows, 
 1.7976931348623157E308 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59391
 AggregateRel(subset=[rel#59388:Subset#2.NONE.ANY([]).[]], group=[{}], 
 EXPR$0=[COUNT($0)]): rowcount = 1.7976931348623158E307, cumulative cost = 
 {1.7976931348623158E307 rows, 0.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 
 59387
   ProjectRel(subset=[rel#59386:Subset#1.NONE.ANY([]).[]], 
 c1=[CAST(ITEM($1, 0)):INTEGER]): rowcount = 100.0, cumulative cost = {100.0 
 rows, 100.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59385
 
 EnumerableTableAccessRel(subset=[rel#59384:Subset#0.ENUMERABLE.ANY([]).[]], 
 table=[[dfs, tmp, testWindow.csv]]): rowcount = 100.0, cumulative cost = 
 {100.0 rows, 101.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59368
 ProjectRel(subset=[rel#59390:Subset#3.NONE.ANY([]).[]], c2=[CAST(ITEM($1, 
 0)):INTEGER]): rowcount = 100.0, cumulative cost = {100.0 rows, 100.0 cpu, 
 0.0 io, 0.0 network, 0.0 memory}, id = 59389
   
 EnumerableTableAccessRel(subset=[rel#59384:Subset#0.ENUMERABLE.ANY([]).[]], 
 table=[[dfs, tmp, testWindow.csv]]): rowcount = 100.0, cumulative cost = 
 {100.0 rows, 101.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59368
 Sets:
 Set#0, type: (DrillRecordRow[*, columns])
   rel#59384:Subset#0.ENUMERABLE.ANY([]).[], best=rel#59368, 
 importance=0.6561
   
 rel#59368:EnumerableTableAccessRel.ENUMERABLE.ANY([]).[](table=[dfs, tmp, 
 testWindow.csv]), rowcount=100.0, cumulative cost={100.0 rows, 101.0 cpu, 0.0 
 io, 0.0 network, 0.0 memory}
   
 rel#59408:AbstractConverter.ENUMERABLE.ANY([]).[](child=rel#59407:Subset#0.LOGICAL.ANY([]).[],convention=ENUMERABLE,DrillDistributionTraitDef=ANY([]),sort=[]),
  rowcount=1.0, cumulative cost={inf}
   rel#59407:Subset#0.LOGICAL.ANY([]).[], best=rel#59415, 
 importance=0.59049001
   
 rel#59409:AbstractConverter.LOGICAL.ANY([]).[](child=rel#59384:Subset#0.ENUMERABLE.ANY([]).[],convention=LOGICAL,DrillDistributionTraitDef=ANY([]),sort=[]),
  rowcount=100.0, cumulative cost={inf}
   rel#59415:DrillScanRel.LOGICAL.ANY([]).[](table=[dfs, tmp, 
 testWindow.csv],groupscan=EasyGroupScan [selectionRoot=/tmp/testWindow.csv, 
 numFiles=1, columns=[`*`], files=[maprfs:/tmp/testWindow.csv]]), 
 rowcount=1.0, cumulative cost={1.0 rows, 1.0 cpu, 0.0 io, 0.0 network, 
 0.0 memory}
 Set#1, type: RecordType(INTEGER c1)
   rel#59386:Subset#1.NONE.ANY([]).[], best=null, 
 importance=0.7291
   
 rel#59385:ProjectRel.NONE.ANY([]).[](child=rel#59384:Subset#0.ENUMERABLE.ANY([]).[],c1=CAST(ITEM($1,
  0)):INTEGER), rowcount=100.0, cumulative cost={inf}
   rel#59404:Subset#1.LOGICAL.ANY([]).[],

[jira] [Updated] (DRILL-2108) Running SQLLine with bad command syntax yields index exception (not usage message)

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2108:
-
Fix Version/s: (was: 1.0.0)
   Future

 Running SQLLine with bad command syntax yields index exception (not usage 
 message)
 --

 Key: DRILL-2108
 URL: https://issues.apache.org/jira/browse/DRILL-2108
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - CLI
Reporter: Daniel Barclay (Drill)
Assignee: Daniel Barclay (Drill)
Priority: Minor
 Fix For: Future


 Running .../sqlline -u yields an ArrayIndexOutOfBoundsException and a Java 
 stack trace (rather than reporting the command syntax error and giving a 
 usage message).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2259) Projecting a nested repeated map results in IOOBE when the data has schema changes

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2259:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Projecting a nested repeated map results in IOOBE when the data has schema 
 changes
 --

 Key: DRILL-2259
 URL: https://issues.apache.org/jira/browse/DRILL-2259
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Rahul Challapalli
Assignee: Hanifi Gunes
 Fix For: 1.1.0

 Attachments: error.log, file1.json, file2.json, file3.json


 git.commit.id.abbrev=6676f2d
 The data set contains 3 files with schema changes among them. By schema 
 changes I refer to adding/removing fields among records/files
 The below query which does a simple project on a field which does not have a 
 schema change and is present in all the records
 {code}
 select d.type type, d.map.rm[1].rptd  from `schema_changes` d;
 +++
 |type|   EXPR$1   |
 +++
 | web| [{a:bar},{c:1},{d:4.5}] |
 | store  | [{a:bar},{c:-1},{d:4.5}] |
 Query failed: RemoteRpcException: Failure while running fragment., index: -4, 
 length: 4 (expected: range(0, 16384)) [ 289ca604-0ece-45ff-94ec-35c11264d7a8 
 on qa-node190.qa.lab:31010 ]
 [ 289ca604-0ece-45ff-94ec-35c11264d7a8 on qa-node190.qa.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 {code}
 The below query succeeds when we explicitly read from any one file
 {code}
 select d.type type, d.map.rm[1].rptd  from `schema_changes/file3.json` d;
 +++
 |type|   EXPR$1   |
 +++
 | web| [{a:bar},{c:1},{d:4.5}] |
 | store  | [{a:bar},{c:-1},{d:4.5}] |
 +++
 2 rows selected (0.064 seconds)
 {code}
 I attached the data set and the error from the logs



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2456) regexp_replace using hex codes fails on larger JSON data sets

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2456:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 regexp_replace using hex codes fails on larger JSON data sets
 -

 Key: DRILL-2456
 URL: https://issues.apache.org/jira/browse/DRILL-2456
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.7.0
 Environment: Drill 0.7
 MapR 4.0.1
 CentOS
Reporter: Andries Engelbrecht
Assignee: Mehant Baid
 Fix For: 1.0.0

 Attachments: drillbit.log


 This query works with only 1 file
 select regexp_replace(`text`, '[^\x20-\xad]', '°'), count(id)  from 
 dfs.twitter.`/feed/2015/03/13/17/FlumeData.1426267859699.json` group by 
 `text` order by count(id) desc limit 10;
 This one fails with multiple files
 select regexp_replace(`text`, '[^\x20-\xad]', '°'), count(id)  from 
 dfs.twitter.`/feed/2015/03/13` group by `text` order by count(id) desc limit 
 10;
 Query failed: Query failed: Failure while trying to start remote fragment, 
 Encountered an illegal char on line 1, column 31: '' [ 
 43ff1aa4-4a71-455d-b817-ec5eb8d179bb on twitternode:31010 ]
 Using text in regexp_replace does work for same dataset.
 This query works fine on full data set.
 select regexp_replace(`text`, '[^ -~¡-ÿ]', '°'), count(id)  from 
 dfs.twitter.`/feed/2015/03/13` group by `text` order by count(id) desc limit 
 10;
 Attached snippet drillbit.log for error



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2282) Eliminate spaces, special characters from names in function templates

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2282:
-
Fix Version/s: (was: 0.9.0)
   1.1.0

 Eliminate spaces, special characters from names in function templates
 -

 Key: DRILL-2282
 URL: https://issues.apache.org/jira/browse/DRILL-2282
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Reporter: Mehant Baid
Assignee: Mehant Baid
 Fix For: 1.1.0

 Attachments: DRILL-2282.patch


 Having spaces in the name of the functions causes issues while deserializing 
 such expressions when we try to read the plan fragment. As part of this JIRA 
 would like to clean up all the templates to not include special characters in 
 their names.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2457) aggregation function on complex type should give appropriate error

2015-04-06 Thread Chun Chang (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chun Chang updated DRILL-2457:
--
Priority: Minor  (was: Critical)

 aggregation function on complex type should give appropriate error 
 ---

 Key: DRILL-2457
 URL: https://issues.apache.org/jira/browse/DRILL-2457
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Chun Chang
Priority: Minor
 Fix For: 0.9.0


 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 Apply aggregation function on complex data should error out with appropriate 
 error message instead of returning null.
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select t.soa[1] from 
 `complex.json` t limit 1;
 ++
 |   EXPR$0   |
 ++
 | {in:1,fl:1.12345} |
 ++
 1 row selected (0.169 seconds)
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select avg(t.soa[1]) from 
 `complex.json` t;
 +--+
 |  |
 +--+
 +--+
 No rows selected (0.164 seconds)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2697) Pause injections should pause indefinitely until signalled

2015-04-06 Thread Chris Westin (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481630#comment-14481630
 ] 

Chris Westin commented on DRILL-2697:
-

Note this should speed up any tests that require pauses for synchronizing 
execution lifecycle with test steps, because they wouldn't have to wait for an 
arbitrary period of time, but can instead progress when ready. If we find such 
tests are common, and are starting to take a lot of time, we may want to 
prioritize this.

 Pause injections should pause indefinitely until signalled
 --

 Key: DRILL-2697
 URL: https://issues.apache.org/jira/browse/DRILL-2697
 Project: Apache Drill
  Issue Type: Improvement
  Components: Execution - Flow
Affects Versions: 0.9.0
Reporter: Sudheesh Katkam
 Fix For: 1.1.0


 Currently injected pauses make threads sleep for a specified time. This can  
 be an enhanced to stop the thread indefinitely using a CountDownLatch. It is 
 quite similar to how cancellation works. 
 Tasks: 
 (a) Add another message to RPC layer to signal paused remote threads to 
 resume (through ControlHandler) by counting down. Complications if the thread 
 has not reached the pause site yet.
 (b) Add resume signal (like ctrl-c) to sqlline 
 (further enhancement: another signal to trigger pause from sqlline)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2688) Use of ORDER BY on right side of Union All results in SqlValidatorException

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Hsuan-Yi Chu updated DRILL-2688:
-
Priority: Minor  (was: Major)

 Use of ORDER BY on right side of Union All results in SqlValidatorException
 ---

 Key: DRILL-2688
 URL: https://issues.apache.org/jira/browse/DRILL-2688
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment:  9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: 
 Exit early from HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 
 EDT
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu
Priority: Minor
 Fix For: 1.0.0


 Order by in the right leg of Union All results in SqlValidatorException.
 Executing the query individually (the query that is on right side of Union 
 All) gives expected results and no Exceptions are seen. However, executing 
 the query as part of the right leg of Union All results in Exception.
 Tests were executed on 4 node cluster on CentOS. Both sides of Union All got 
 input from CSV files.
 The query below returns correct results when executed individually. (this is 
 the same query used on right side of Union All query)
 {code}
 0: jdbc:drill: select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0] order by columns[0];
 ++
 |   EXPR$0   |
 ++
 | 10 |
 | 100|
 | 113|
 | 119|
 | 2  |
 | 50 |
 | 55 |
 | 57 |
 | 61 |
 | 67 |
 | 89 |
 ++
 11 rows selected (0.265 seconds)
 {code}
 Note that the Union All query works when there is no order by in the right 
 leg.
 {code}
 0: jdbc:drill: select columns[0] from `employee.csv` t1 where 
 t1.columns[0]=1 union all select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0];
 ++
 |   EXPR$0   |
 ++
 | 1  |
 | 100|
 | 10 |
 | 2  |
 | 50 |
 | 55 |
 | 67 |
 | 113|
 | 119|
 | 89 |
 | 57 |
 | 61 |
 ++
 12 rows selected (0.175 seconds)
 {code}
 In this case we do use an order by on the right side of Union All, which 
 results in SqlValidatorException. Same query as above, except that we have an 
 order by clause in the query on the right side of Union All.
 {code}
 0: jdbc:drill: select columns[0] from `employee.csv` t1 where 
 t1.columns[0]=1 union all select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0] order by columns[0];
 Query failed: SqlValidatorException: Column 'columns' not found in any table
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}
 Stack trace from drillbit.log 
 {code}
 2015-04-04 04:31:22,882 [2ae096e5-4085-de95-5642-ae10d320a8f7:foreman] ERROR 
 o.a.drill.exec.work.foreman.Foreman - Error 
 c6a60f71-c959-4ae5-ba8c-23b9c7b6bb7e: SqlValidatorException: Column 'columns' 
 not found in any table
 org.apache.drill.exec.planner.sql.QueryInputException: Failure validating 
 SQL. org.eigenbase.util.EigenbaseContextException: From line 1, column 178 to 
 line 1, column 184: Column 'columns' not found in any table
 at 
 org.apache.drill.exec.planner.sql.DrillSqlWorker.getPlan(DrillSqlWorker.java:147)
  ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.foreman.Foreman.runSQL(Foreman.java:773) 
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at org.apache.drill.exec.work.foreman.Foreman.run(Foreman.java:204) 
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
  [na:1.7.0_75]
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
  [na:1.7.0_75]
 at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
 Caused by: net.hydromatic.optiq.tools.ValidationException: 
 org.eigenbase.util.EigenbaseContextException: From line 1, column 178 to line 
 1, column 184: Column 'columns' not found in any table
 at 
 net.hydromatic.optiq.prepare.PlannerImpl.validate(PlannerImpl.java:176) 
 ~[optiq-core-0.9-drill-r20.jar:na]
 at 
 org.apache.drill.exec.planner.sql.handlers.DefaultSqlHandler.validateNode(DefaultSqlHandler.java:157)
  ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.planner.sql.handlers.DefaultSqlHandler.getPlan(DefaultSqlHandler.java:133)
  ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at

[jira] [Updated] (DRILL-2688) Use of ORDER BY on right side of Union All results in SqlValidatorException

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Hsuan-Yi Chu updated DRILL-2688:
-
Fix Version/s: 1.0.0

 Use of ORDER BY on right side of Union All results in SqlValidatorException
 ---

 Key: DRILL-2688
 URL: https://issues.apache.org/jira/browse/DRILL-2688
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment:  9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: 
 Exit early from HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 
 EDT
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu
Priority: Minor
 Fix For: 1.0.0


 Order by in the right leg of Union All results in SqlValidatorException.
 Executing the query individually (the query that is on right side of Union 
 All) gives expected results and no Exceptions are seen. However, executing 
 the query as part of the right leg of Union All results in Exception.
 Tests were executed on 4 node cluster on CentOS. Both sides of Union All got 
 input from CSV files.
 The query below returns correct results when executed individually. (this is 
 the same query used on right side of Union All query)
 {code}
 0: jdbc:drill: select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0] order by columns[0];
 ++
 |   EXPR$0   |
 ++
 | 10 |
 | 100|
 | 113|
 | 119|
 | 2  |
 | 50 |
 | 55 |
 | 57 |
 | 61 |
 | 67 |
 | 89 |
 ++
 11 rows selected (0.265 seconds)
 {code}
 Note that the Union All query works when there is no order by in the right 
 leg.
 {code}
 0: jdbc:drill: select columns[0] from `employee.csv` t1 where 
 t1.columns[0]=1 union all select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0];
 ++
 |   EXPR$0   |
 ++
 | 1  |
 | 100|
 | 10 |
 | 2  |
 | 50 |
 | 55 |
 | 67 |
 | 113|
 | 119|
 | 89 |
 | 57 |
 | 61 |
 ++
 12 rows selected (0.175 seconds)
 {code}
 In this case we do use an order by on the right side of Union All, which 
 results in SqlValidatorException. Same query as above, except that we have an 
 order by clause in the query on the right side of Union All.
 {code}
 0: jdbc:drill: select columns[0] from `employee.csv` t1 where 
 t1.columns[0]=1 union all select columns[0] from `testWindow.csv` t2 where 
 t2.columns[0] is not null group by columns[0] order by columns[0];
 Query failed: SqlValidatorException: Column 'columns' not found in any table
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}
 Stack trace from drillbit.log 
 {code}
 2015-04-04 04:31:22,882 [2ae096e5-4085-de95-5642-ae10d320a8f7:foreman] ERROR 
 o.a.drill.exec.work.foreman.Foreman - Error 
 c6a60f71-c959-4ae5-ba8c-23b9c7b6bb7e: SqlValidatorException: Column 'columns' 
 not found in any table
 org.apache.drill.exec.planner.sql.QueryInputException: Failure validating 
 SQL. org.eigenbase.util.EigenbaseContextException: From line 1, column 178 to 
 line 1, column 184: Column 'columns' not found in any table
 at 
 org.apache.drill.exec.planner.sql.DrillSqlWorker.getPlan(DrillSqlWorker.java:147)
  ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.work.foreman.Foreman.runSQL(Foreman.java:773) 
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at org.apache.drill.exec.work.foreman.Foreman.run(Foreman.java:204) 
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
  [na:1.7.0_75]
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
  [na:1.7.0_75]
 at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
 Caused by: net.hydromatic.optiq.tools.ValidationException: 
 org.eigenbase.util.EigenbaseContextException: From line 1, column 178 to line 
 1, column 184: Column 'columns' not found in any table
 at 
 net.hydromatic.optiq.prepare.PlannerImpl.validate(PlannerImpl.java:176) 
 ~[optiq-core-0.9-drill-r20.jar:na]
 at 
 org.apache.drill.exec.planner.sql.handlers.DefaultSqlHandler.validateNode(DefaultSqlHandler.java:157)
  ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at 
 org.apache.drill.exec.planner.sql.handlers.DefaultSqlHandler.getPlan(DefaultSqlHandler.java:133)
  ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
 at

[jira] [Commented] (DRILL-2639) Planner bug - RelOptPlanner.CannotPlanException

2015-04-06 Thread Sean Hsuan-Yi Chu (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481882#comment-14481882
 ] 

Sean Hsuan-Yi Chu commented on DRILL-2639:
--

Under review at: https://reviews.apache.org/r/32886/

 Planner bug - RelOptPlanner.CannotPlanException
 ---

 Key: DRILL-2639
 URL: https://issues.apache.org/jira/browse/DRILL-2639
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment: | 9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: 
 Exit early from HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 
 EDT | Unknown | 26.03.2015 @ 16:53:21 EDT |
Reporter: Khurram Faraaz
Assignee: Sean Hsuan-Yi Chu
Priority: Critical
 Fix For: 0.9.0


 Reporting this as a separate JIRA as this issue related to a bug in the 
 planner. Performing aggregate on the output returned by Union All results in 
 CannotPlanException. Note that the two inputs to Union All are casted to 
 integer and hence the inputs from both legs are of the same datatype. 
 {code}
 0: jdbc:drill: select count(c1) from (select cast(columns[0] as int) c1 from 
 `testWindow.csv`) union all (select cast(columns[0] as int) c2 from 
 `testWindow.csv`);
 Query failed: RelOptPlanner.CannotPlanException: Node 
 [rel#59393:Subset#4.LOGICAL.ANY([]).[]] could not be implemented; planner 
 state:
 Root: rel#59393:Subset#4.LOGICAL.ANY([]).[]
 Original rel:
 AbstractConverter(subset=[rel#59393:Subset#4.LOGICAL.ANY([]).[]], 
 convention=[LOGICAL], DrillDistributionTraitDef=[ANY([])], sort=[[]]): 
 rowcount = 1.7976931348623157E308, cumulative cost = {inf}, id = 59394
   UnionRel(subset=[rel#59392:Subset#4.NONE.ANY([]).[]], all=[true]): rowcount 
 = 1.7976931348623157E308, cumulative cost = {1.7976931348623157E308 rows, 
 1.7976931348623157E308 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59391
 AggregateRel(subset=[rel#59388:Subset#2.NONE.ANY([]).[]], group=[{}], 
 EXPR$0=[COUNT($0)]): rowcount = 1.7976931348623158E307, cumulative cost = 
 {1.7976931348623158E307 rows, 0.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 
 59387
   ProjectRel(subset=[rel#59386:Subset#1.NONE.ANY([]).[]], 
 c1=[CAST(ITEM($1, 0)):INTEGER]): rowcount = 100.0, cumulative cost = {100.0 
 rows, 100.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59385
 
 EnumerableTableAccessRel(subset=[rel#59384:Subset#0.ENUMERABLE.ANY([]).[]], 
 table=[[dfs, tmp, testWindow.csv]]): rowcount = 100.0, cumulative cost = 
 {100.0 rows, 101.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59368
 ProjectRel(subset=[rel#59390:Subset#3.NONE.ANY([]).[]], c2=[CAST(ITEM($1, 
 0)):INTEGER]): rowcount = 100.0, cumulative cost = {100.0 rows, 100.0 cpu, 
 0.0 io, 0.0 network, 0.0 memory}, id = 59389
   
 EnumerableTableAccessRel(subset=[rel#59384:Subset#0.ENUMERABLE.ANY([]).[]], 
 table=[[dfs, tmp, testWindow.csv]]): rowcount = 100.0, cumulative cost = 
 {100.0 rows, 101.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 59368
 Sets:
 Set#0, type: (DrillRecordRow[*, columns])
   rel#59384:Subset#0.ENUMERABLE.ANY([]).[], best=rel#59368, 
 importance=0.6561
   
 rel#59368:EnumerableTableAccessRel.ENUMERABLE.ANY([]).[](table=[dfs, tmp, 
 testWindow.csv]), rowcount=100.0, cumulative cost={100.0 rows, 101.0 cpu, 0.0 
 io, 0.0 network, 0.0 memory}
   
 rel#59408:AbstractConverter.ENUMERABLE.ANY([]).[](child=rel#59407:Subset#0.LOGICAL.ANY([]).[],convention=ENUMERABLE,DrillDistributionTraitDef=ANY([]),sort=[]),
  rowcount=1.0, cumulative cost={inf}
   rel#59407:Subset#0.LOGICAL.ANY([]).[], best=rel#59415, 
 importance=0.59049001
   
 rel#59409:AbstractConverter.LOGICAL.ANY([]).[](child=rel#59384:Subset#0.ENUMERABLE.ANY([]).[],convention=LOGICAL,DrillDistributionTraitDef=ANY([]),sort=[]),
  rowcount=100.0, cumulative cost={inf}
   rel#59415:DrillScanRel.LOGICAL.ANY([]).[](table=[dfs, tmp, 
 testWindow.csv],groupscan=EasyGroupScan [selectionRoot=/tmp/testWindow.csv, 
 numFiles=1, columns=[`*`], files=[maprfs:/tmp/testWindow.csv]]), 
 rowcount=1.0, cumulative cost={1.0 rows, 1.0 cpu, 0.0 io, 0.0 network, 
 0.0 memory}
 Set#1, type: RecordType(INTEGER c1)
   rel#59386:Subset#1.NONE.ANY([]).[], best=null, 
 importance=0.7291
   
 rel#59385:ProjectRel.NONE.ANY([]).[](child=rel#59384:Subset#0.ENUMERABLE.ANY([]).[],c1=CAST(ITEM($1,
  0)):INTEGER), rowcount=100.0, cumulative cost={inf}
   rel#59404:Subset#1.LOGICAL.ANY([]).[], best=rel#59413, 
 importance=0.36455
   
 rel#59405:AbstractConverter.LOGICAL.ANY([]).[](child=rel#59386:Subset#1.NONE.ANY([]).[],convention=LOGICAL,DrillDistributionTraitDef=ANY([]),sort=[]),
  rowcount=1.7976931348623157E308,

[jira] [Updated] (DRILL-2561) Profile UI: Metrics displayed incorrectly for failed query

2015-04-06 Thread Chris Westin (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Westin updated DRILL-2561:

Fix Version/s: (was: 0.9.0)
   1.0.0

 Profile UI: Metrics displayed incorrectly for failed query
 --

 Key: DRILL-2561
 URL: https://issues.apache.org/jira/browse/DRILL-2561
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - HTTP
Affects Versions: 0.9.0
Reporter: Krystal
Assignee: Sudheesh Katkam
 Fix For: 1.0.0


 git.commit.id=8493713cafe6e5d1f56f2dffc9d8bea294a6e013
 I have a query that failed to execute.  The profile UI for this query 
 displayed wrong metrics in columns.  Here is the url for that profile:
 http://10.10.100.115:8047/profiles/2aed1b79-17a0-312d-42a5-161a1c2c66a4



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2653) All additions to storage plugin configuration gets wiped out if info is not correct

2015-04-06 Thread Chris Westin (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Westin updated DRILL-2653:

Fix Version/s: (was: 0.9.0)
   1.0.0

 All additions to storage plugin configuration gets wiped out if info is not 
 correct 
 

 Key: DRILL-2653
 URL: https://issues.apache.org/jira/browse/DRILL-2653
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - HTTP
Affects Versions: 0.9.0
Reporter: Krystal
Assignee: Sudheesh Katkam
 Fix For: 1.0.0


 Whenever a user enters an invalid information into a storage plugin 
 configuration, the entire data gets wiped out.  This can be frustrating to 
 users if alot of information has been entered.  The data should be retained 
 on the screen to allow users correct the mistakes rather than having to write 
 all the information over again.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1478) The order of query results for the selected fields seems to be different from sqlline vs Web UI

2015-04-06 Thread Chris Westin (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Westin updated DRILL-1478:

Fix Version/s: (was: 0.9.0)
   1.0.0

 The order of query results for the selected fields seems to be different from 
 sqlline vs Web UI
 ---

 Key: DRILL-1478
 URL: https://issues.apache.org/jira/browse/DRILL-1478
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - HTTP
 Environment: I executed a query with aggregation, it seems the order 
 results is different from  sqlline vs Web UI.
Reporter: B Anil Kumar
Assignee: Sudheesh Katkam
 Fix For: 1.0.0


 Here the order means selected columns order. For example.
 {noformat}
 0: jdbc:drill:zk=localhost:2181 select state,city,avg(pop) from 
 mongo.test.`zips` zipcodes group by state, city limit 5;
 ++++
 |   state|city|   EXPR$2   |
 ++++
 | MA | AGAWAM | 15338.0|
 | MA | CUSHMAN| 36963.0|
 | MA | BARRE  | 4546.0 |
 | MA | BELCHERTOWN | 10579.0|
 | MA | BLANDFORD  | 1240.0 |
 ++++
 {noformat}
 The above is as expected. But where as for the same above query in Web UI
 {noformat}
 EXPR$2  state  city
 1,240 MA  BLANDFORD
 4,546 MA  BARRE
 10,579MA  BELCHERTOWN
 15,338MA  AGAWAM
 36,963MA  CUSHMAN
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2457) aggregation function on complex type should give appropriate error

2015-04-06 Thread Chun Chang (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chun Chang updated DRILL-2457:
--
Assignee: Parth Chandra  (was: Chun Chang)

 aggregation function on complex type should give appropriate error 
 ---

 Key: DRILL-2457
 URL: https://issues.apache.org/jira/browse/DRILL-2457
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Parth Chandra
Priority: Minor
 Fix For: 0.9.0


 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 Apply aggregation function on complex data should error out with appropriate 
 error message instead of returning null.
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select t.soa[1] from 
 `complex.json` t limit 1;
 ++
 |   EXPR$0   |
 ++
 | {in:1,fl:1.12345} |
 ++
 1 row selected (0.169 seconds)
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select avg(t.soa[1]) from 
 `complex.json` t;
 +--+
 |  |
 +--+
 +--+
 No rows selected (0.164 seconds)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2401) Value 0 for monthOfYear must be in the range [1,12]

2015-04-06 Thread Parth Chandra (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481898#comment-14481898
 ] 

Parth Chandra commented on DRILL-2401:
--

[~cch...@maprtech.com] Can you try to reproduce this.

 Value 0 for monthOfYear must be in the range [1,12]
 ---

 Key: DRILL-2401
 URL: https://issues.apache.org/jira/browse/DRILL-2401
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.7.0
 Environment: Sandbox, SE cluster
Reporter: Sudhakar Thota
Assignee: Chun Chang
Priority: Minor
 Fix For: 1.0.0

 Attachments: month-00-drill-MapR-DB.rtf


 0: jdbc:drill:zk=10.10.15.10:5181,10.10.15.11 select cast(row_key as 
 varchar(20)) as order_id, cast(t.CF1.`clerk` as varchar(20)) as order_clerk, 
 cast(t.CF1.orderstatus as varchar(20)) as ord_stat,cast(t.CF1.orderdate as 
 date) ordrdt from m7thotas.orders t where  cast(t.CF1.orderdate as date) 
 BETWEEN cast('1996-01-10' as date) AND cast('1996-01-11' as date) and 
 cast(t.CF1.`clerk` as varchar(20))cast('Clerk#00334' as varchar(20));
 .
 .
 .
 | 3592807| Clerk#00358 | O  | 1996-01-11 |
 | 6969024| Clerk#00185 | O  | 1996-01-10 |
 | 6973990| Clerk#00491 | O  | 1996-01-10 |
 | 6975170| Clerk#00493 | O  | 1996-01-10 |
 | 6975462| Clerk#00352 | O  | 1996-01-11 |
 | 6978503| Clerk#00482 | O  | 1996-01-11 |
 | 6979654| Clerk#00389 | O  | 1996-01-10 |
 | 5344355| Clerk#00653 | O  | 1996-01-11 |
 | 5351969| Clerk#00237 | O  | 1996-01-10 |
 | 5352003| Clerk#00756 | O  | 1996-01-10 |
 | 3608519| Clerk#00384 | O  | 1996-01-10 |
 Query failed: Query failed: Failure while running fragment., Value 0 for 
 monthOfYear must be in the range [1,12] [ 
 2b41018b-b35e-42a9-bd4e-0307e582b718 on se-node11.se.lab:31010 ]
 [ 2b41018b-b35e-42a9-bd4e-0307e582b718 on se-node11.se.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1626) sqlline echoes statements with unnecessary wrapping

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1626:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 sqlline echoes statements with unnecessary wrapping
 ---

 Key: DRILL-1626
 URL: https://issues.apache.org/jira/browse/DRILL-1626
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - CLI
 Environment: sqlline running under Eclipse on OSX
Reporter: Chris Westin
Assignee: Mehant Baid
Priority: Minor
 Fix For: 1.1.0


 I type a command in sqlline, and when it is echoed, it is unnecessarily 
 wrapped (making it hard to read, as well as casting uncertainty on what I was 
 doing). Example:
 0: jdbc:drill:zk=local create table donuts_parquet as select * from 
 `donuts.json`;
 create table donuts_parquet as select * from `donuts.jso 
 n`;
 ++---+
 |  Fragment  | Number of records written |
 ++---+
 | 0_0| 5 |
 That's sqlline adding a newline after jso and before n in the echoed command.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2588) Profile UI: First Start field contains incorrect data

2015-04-06 Thread Chris Westin (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris Westin updated DRILL-2588:

Fix Version/s: (was: 0.9.0)
   1.0.0

 Profile UI: First Start field contains incorrect data  
 -

 Key: DRILL-2588
 URL: https://issues.apache.org/jira/browse/DRILL-2588
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - HTTP
Affects Versions: 0.9.0
Reporter: Krystal
Assignee: Sudheesh Katkam
 Fix For: 1.0.0


 Some data under the Start Time column for a long running query contain 
 incorrect data.  Below is the data for major fragment 2: 
 Major Fragment: 02-xx-xx
 Minor FragmentHostStart   End Total Time  Max Records 
 Max Batches Peak Memory State
 02-00-xx  ucs-node4.perf.lab  2.224   7:597:5757,346,551  
 7,012   2MB FINISHED
 02-01-xx  ucs-node8.perf.lab  2.210   8:068:0461,095,926  
 7,470   2MB FINISHED
 02-02-xx  ucs-node9.perf.lab  2.223   8:138:1161,329,266  
 7,496   2MB FINISHED
 02-03-xx  ucs-node3.perf.lab  2.250   8:088:0661,112,691  
 7,472   2MB FINISHED
 02-04-xx  ucs-node7.perf.lab  2.235   8:048:0161,173,642  
 7,478   2MB FINISHED
 02-05-xx  ucs-node5.perf.lab  43.518  7:528:0869,778,300  
 8,526   2MB FINISHED
 02-06-xx  ucs-node11.perf.lab 2.258   7:357:3357,324,207  
 7,009   2MB FINISHED
 02-07-xx  ucs-node2.perf.lab  2.265   7:597:5761,165,117  
 7,477   2MB FINISHED
 02-08-xx  ucs-node10.perf.lab 2.276   7:016:5954,521,464  
 6,667   2MB FINISHED
 02-09-xx  ucs-node6.perf.lab  2.250   7:437:4155,190,738  
 6,748   2MB FINISHED
 Below is the summary for major fragment 2 as reported in the UI:
 Major FragmentMinor Fragments Reporting   First Start Last 
 Start  First End   Last Endtmintavgtmaxmemmax
 02-xx-xx  10 / 10 43.518  2.276   7:018:136:597:528:11
 2MB
 The value for `First Start` should be 2.210.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2551) Varbinary literals are not supported in Drill logical expressions

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2551:
-
Fix Version/s: (was: 1.0.0)
   Future

 Varbinary literals are not supported in Drill logical expressions
 -

 Key: DRILL-2551
 URL: https://issues.apache.org/jira/browse/DRILL-2551
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.8.0
Reporter: Jason Altekruse
Assignee: Jason Altekruse
 Fix For: Future


 Drill does not support the varbinary literal type at the logical expression 
 level. We need to define the Type in ValueExpressions, RexToDrill and add 
 corresponding methods to ExprVisitor and its implementations. There may be 
 other areas that are needed to make the definition complete within Drill. I 
 will check the patches from adding other types and see what else was changed 
 when we added them.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2257) Filter on nested repeated types on top of data with schema changes fails

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2257:
-
Fix Version/s: (was: 1.0.0)
   1.1.0

 Filter on nested repeated types on top of data with schema changes fails
 

 Key: DRILL-2257
 URL: https://issues.apache.org/jira/browse/DRILL-2257
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Rahul Challapalli
Assignee: Hanifi Gunes
 Fix For: 1.1.0

 Attachments: error.log, file1.json, file2.json, file3.json


 git.commit.id.abbrev=6676f2d
 The data set contains 3 files and 2 records within each file. We 
 added/removed fields between these files. However the below query uses fields 
 which are present in all the files.
 {code}
 select d.uid from `schema_changes` d where d.map.rm[0].rptd[0].a = 'foo';
 ++
 |uid |
 ++
 | 1  |
 Query failed: RemoteRpcException: Failure while running fragment., index: -4, 
 length: 4 (expected: range(0, 16384)) [ bebebc31-0ce7-4b77-9984-f5e548790bfd 
 on qa-node190.qa.lab:31010 ]
 [ bebebc31-0ce7-4b77-9984-f5e548790bfd on qa-node190.qa.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 {code}
 Running the same query on each of the individual files succeeds and returns 
 the data correctly.
 Let me know if you need anything else



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (DRILL-2698) Add allocation policy for (direct memory) limits per-query-per-node

2015-04-06 Thread Chris Westin (JIRA)

Chris Westin created DRILL-2698:
---

 Summary: Add allocation policy for (direct memory) limits 
per-query-per-node
 Key: DRILL-2698
 URL: https://issues.apache.org/jira/browse/DRILL-2698
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Flow
Affects Versions: 0.8.0
Reporter: Chris Westin
Assignee: Chris Westin
 Fix For: 1.0.0


We want the allocator to support a policy that limits memory allocation 
per-query-per-node. Current policy is to divide the available direct memory by 
the number of fragments running on a node, and limit each fragment to that. 
This new policy would count fragments that belong to the same query and are 
running on the same node together. So, if query A has 3 fragments running on a 
node, and query B has 5 fragments running on the same node, and there are no 
other queries with fragments on that node, then there are two queries running 
on the node, and their allocation limit is the available direct memory divided 
by two. (Under the original policy, which divides the available memory by the 
number of fragments, A would be allowed 3/8 of the memory, and B would be 
allowed 5/8 of the memory.)

Note that this policy should be a user-settable boot-time option; the original 
policy would still also be available as an option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2401) Value 0 for monthOfYear must be in the range [1,12]

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2401:
-
Assignee: Chun Chang  (was: Daniel Barclay (Drill))

 Value 0 for monthOfYear must be in the range [1,12]
 ---

 Key: DRILL-2401
 URL: https://issues.apache.org/jira/browse/DRILL-2401
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.7.0
 Environment: Sandbox, SE cluster
Reporter: Sudhakar Thota
Assignee: Chun Chang
Priority: Minor
 Fix For: 1.0.0

 Attachments: month-00-drill-MapR-DB.rtf


 0: jdbc:drill:zk=10.10.15.10:5181,10.10.15.11 select cast(row_key as 
 varchar(20)) as order_id, cast(t.CF1.`clerk` as varchar(20)) as order_clerk, 
 cast(t.CF1.orderstatus as varchar(20)) as ord_stat,cast(t.CF1.orderdate as 
 date) ordrdt from m7thotas.orders t where  cast(t.CF1.orderdate as date) 
 BETWEEN cast('1996-01-10' as date) AND cast('1996-01-11' as date) and 
 cast(t.CF1.`clerk` as varchar(20))cast('Clerk#00334' as varchar(20));
 .
 .
 .
 | 3592807| Clerk#00358 | O  | 1996-01-11 |
 | 6969024| Clerk#00185 | O  | 1996-01-10 |
 | 6973990| Clerk#00491 | O  | 1996-01-10 |
 | 6975170| Clerk#00493 | O  | 1996-01-10 |
 | 6975462| Clerk#00352 | O  | 1996-01-11 |
 | 6978503| Clerk#00482 | O  | 1996-01-11 |
 | 6979654| Clerk#00389 | O  | 1996-01-10 |
 | 5344355| Clerk#00653 | O  | 1996-01-11 |
 | 5351969| Clerk#00237 | O  | 1996-01-10 |
 | 5352003| Clerk#00756 | O  | 1996-01-10 |
 | 3608519| Clerk#00384 | O  | 1996-01-10 |
 Query failed: Query failed: Failure while running fragment., Value 0 for 
 monthOfYear must be in the range [1,12] [ 
 2b41018b-b35e-42a9-bd4e-0307e582b718 on se-node11.se.lab:31010 ]
 [ 2b41018b-b35e-42a9-bd4e-0307e582b718 on se-node11.se.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (DRILL-2699) Collect all cleanup errors before reporting a failure to the client

2015-04-06 Thread Deneche A. Hakim (JIRA)

Deneche A. Hakim created DRILL-2699:
---

 Summary: Collect all cleanup errors before reporting a failure to 
the client
 Key: DRILL-2699
 URL: https://issues.apache.org/jira/browse/DRILL-2699
 Project: Apache Drill
  Issue Type: Improvement
  Components: Execution - Flow
Affects Versions: 0.8.0
Reporter: Deneche A. Hakim
Assignee: Chris Westin
 Fix For: 0.9.0


If a query fails, the fragments and foreman should make sure to collect all 
failures and report them back to the client. Some known places where this isn't 
respected:

- If a fragment fails, it will report the failure to the foreman before 
cleaning up. Any failure that happens in the cleanup process will be dropped by 
the foreman.
- If multiple fragments fail, the Foreman will only report to the user the 
first failure it received and close immediately. All other failures will be 
dropped.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (DRILL-2700) Missing/deleted drillbit.log files on cluster are not reported

2015-04-06 Thread Khurram Faraaz (JIRA)

Khurram Faraaz created DRILL-2700:
-

 Summary: Missing/deleted drillbit.log files on cluster are not 
reported
 Key: DRILL-2700
 URL: https://issues.apache.org/jira/browse/DRILL-2700
 Project: Apache Drill
  Issue Type: Bug
  Components: Tools, Build  Test
Affects Versions: 0.9.0
 Environment: 4 node cluster on CentOS
Reporter: Khurram Faraaz
Assignee: Steven Phillips


Missing drillbit.log files on cluster, under /opt/mapr/drill/drill-0.9.0/logs/ 
are not reported as missing. Instead, user is able to issue and execute SQL 
queries from sqlline prompt. Failing queries are also executed and 
errors/Exceptions are reported on sqlline prompt, but there is no corresponding 
drillbit.log file where details must be written to in the case of 
passing/failing queries.

I have manually deleted the drillbit.log (users can accidentally delete 
drillbit.log), and continue to execute SQL from sqlline prompt, and in the case 
of failing SQL there will be no drillbit.log file since it was accidentally 
deleted, and hence stacktrace and other diagnostic information is lost.

{code}
[root@centos-01 bin]# clush -g khurram  service mapr-warden stop
10.10.100.201: stopping WARDEN
10.10.100.202: stopping WARDEN
10.10.100.203: stopping WARDEN
10.10.100.204: stopping WARDEN
10.10.100.202: looking to stop mapr-core processes not started by warden
10.10.100.203: looking to stop mapr-core processes not started by warden
10.10.100.204: looking to stop mapr-core processes not started by warden
10.10.100.201: looking to stop mapr-core processes not started by warden

[root@centos-01 bin]# clush -g khurram  service mapr-warden start
10.10.100.204: Starting WARDEN, logging to /opt/mapr/logs/warden.log.
10.10.100.203: Starting WARDEN, logging to /opt/mapr/logs/warden.log.
10.10.100.202: Starting WARDEN, logging to /opt/mapr/logs/warden.log.
10.10.100.201: Starting WARDEN, logging to /opt/mapr/logs/warden.log.
10.10.100.204: .
10.10.100.203: .
10.10.100.204: For diagnostics look at /opt/mapr/logs/ for 
createsystemvolumes.log, warden.log and configured services log files
10.10.100.202: .
10.10.100.201: .
10.10.100.203: For diagnostics look at /opt/mapr/logs/ for 
createsystemvolumes.log, warden.log and configured services log files
10.10.100.202: For diagnostics look at /opt/mapr/logs/ for 
createsystemvolumes.log, warden.log and configured services log files
10.10.100.201: For diagnostics look at /opt/mapr/logs/ for 
createsystemvolumes.log, warden.log and configured services log files

[root@centos-01 bin]# clush -g khurram ls -lrt 
/opt/mapr/drill/drill-0.9.0/logs/drillbit.log
10.10.100.201: -rw-r--r-- 1 mapr mapr 17522 Apr  6 20:49 
/opt/mapr/drill/drill-0.9.0/logs/drillbit.log
10.10.100.204: -rw-r--r-- 1 mapr mapr 17522 Apr  6 20:49 
/opt/mapr/drill/drill-0.9.0/logs/drillbit.log
10.10.100.202: -rw-r--r-- 1 mapr mapr 17522 Apr  6 20:49 
/opt/mapr/drill/drill-0.9.0/logs/drillbit.log
10.10.100.203: -rw-r--r-- 1 mapr mapr 17522 Apr  6 20:49 
/opt/mapr/drill/drill-0.9.0/logs/drillbit.log

Manually remove drillbit.log files from cluster nodes

[root@centos-01 bin]# clush -g khurram rm -rf 
/opt/mapr/drill/drill-0.9.0/logs/drillbit.log
[root@centos-01 bin]# clush -g khurram ls -lrt 
/opt/mapr/drill/drill-0.9.0/logs/drillbit.log
10.10.100.201: ls: cannot access /opt/mapr/drill/drill-0.9.0/logs/drillbit.log: 
No such file or directory
clush: 10.10.100.201: exited with exit code 2
10.10.100.204: ls: cannot access /opt/mapr/drill/drill-0.9.0/logs/drillbit.log: 
No such file or directory
clush: 10.10.100.204: exited with exit code 2
10.10.100.203: ls: cannot access /opt/mapr/drill/drill-0.9.0/logs/drillbit.log: 
No such file or directory
clush: 10.10.100.203: exited with exit code 2
10.10.100.202: ls: cannot access /opt/mapr/drill/drill-0.9.0/logs/drillbit.log: 
No such file or directory
clush: 10.10.100.202: exited with exit code 2

[root@centos-01 bin]# ./sqlline -u jdbc:drill:
Drill log directory: /opt/mapr/drill/drill-0.9.0/logs
sqlline version 1.1.6
0: jdbc:drill: use sys;
+++
| ok |  summary   |
+++
| true   | Default schema changed to 'sys' |
+++
1 row selected (1.199 seconds)
0: jdbc:drill: select * from sys.version;
+++-+-++
| commit_id  | commit_message | commit_time | build_email | build_time |
+++-+-++
| 9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: Exit early from 
HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 EDT | Unknown 
| 26.03.2015 @ 16:59:31 EDT |
+++-+-++
1 row selected (0.608 seconds)
0: jdbc:drill: select * from sys.drillbits;

[jira] [Updated] (DRILL-2039) use forks-per-CPU units in forkCount

2015-04-06 Thread Hanifi Gunes (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hanifi Gunes updated DRILL-2039:

Assignee: Parth Chandra  (was: Hanifi Gunes)

 use forks-per-CPU units in forkCount
 

 Key: DRILL-2039
 URL: https://issues.apache.org/jira/browse/DRILL-2039
 Project: Apache Drill
  Issue Type: Improvement
  Components: Tools, Build  Test
Reporter: Daniel Barclay (Drill)
Assignee: Parth Chandra
Priority: Minor
 Fix For: 0.9.0

 Attachments: DRILL-2039.1.patch.txt


 Can Surefire's fork count be set using units of forks per CPU instead of the 
 absolute number of forks (i.e., {{forkCount1C/forkCount}} instead of 
 {{forkCount4/forkcount}})?
 That would let developers with more powerful machines still run tests in 
 parallel while letting others avoid or reduce test timeout errors without the 
 risk of forgetting to re-edit pom.xml after every switch to a different git 
 branch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2039) use forks-per-CPU units in forkCount

2015-04-06 Thread Hanifi Gunes (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hanifi Gunes updated DRILL-2039:

Attachment: DRILL-2039.1.patch.txt

Use 1 fork per logical core

 use forks-per-CPU units in forkCount
 

 Key: DRILL-2039
 URL: https://issues.apache.org/jira/browse/DRILL-2039
 Project: Apache Drill
  Issue Type: Improvement
  Components: Tools, Build  Test
Reporter: Daniel Barclay (Drill)
Assignee: Hanifi Gunes
Priority: Minor
 Fix For: 0.9.0

 Attachments: DRILL-2039.1.patch.txt


 Can Surefire's fork count be set using units of forks per CPU instead of the 
 absolute number of forks (i.e., {{forkCount1C/forkCount}} instead of 
 {{forkCount4/forkcount}})?
 That would let developers with more powerful machines still run tests in 
 parallel while letting others avoid or reduce test timeout errors without the 
 risk of forgetting to re-edit pom.xml after every switch to a different git 
 branch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (DRILL-2060) Support constant folding in expressions in Filter

2015-04-06 Thread Jason Altekruse (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Altekruse resolved DRILL-2060.

Resolution: Fixed

Fixed in 8d70709ddaddc7111517f4bed883fc9908ef94de

 Support constant folding in expressions in Filter
 -

 Key: DRILL-2060
 URL: https://issues.apache.org/jira/browse/DRILL-2060
 Project: Apache Drill
  Issue Type: New Feature
  Components: Execution - Flow, Query Planning  Optimization
Reporter: Aditya Kishore
Assignee: Jason Altekruse
 Fix For: 0.9.0

 Attachments: DRILL-2060-15-mar-15.patch, 
 DRILL-2060-const-folding-feb-6.diff, 
 DRILL-2060-part1-const-folding-feb-27.patch, 
 DRILL-2060-part2-const-folding-feb-27.patch, DRILL-2060.patch


 If one side of a filter condition is a constant expression, we should be able 
 to evaluate the expression in the planning phase and replace the expression 
 with the constant value.
 For example, the following filter
 {{Where shipdate  date '1997-01-01' + interval '1' year}}
 can be re-written as
 {{Where shipdate  date '1998-01-01'}}
 Or 
 {{Where age + 5  42}}
 can be re-written as
 {{Where age  37}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2383) add exception and pause injections for testing drillbit stability

2015-04-06 Thread Sudheesh Katkam (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sudheesh Katkam updated DRILL-2383:
---
Attachment: DRILL-2383.3.patch.txt

Addressed review comments and rebased on current master.

 add exception and pause injections for testing drillbit stability
 -

 Key: DRILL-2383
 URL: https://issues.apache.org/jira/browse/DRILL-2383
 Project: Apache Drill
  Issue Type: New Feature
  Components: Execution - Flow
Reporter: Chris Westin
Assignee: Sudheesh Katkam
 Fix For: 0.9.0

 Attachments: DRILL-2383.1.patch.txt, DRILL-2383.3.patch.txt


 Use the exception injection mechanism to add exception injections to test a 
 variety of distributed failure scenarios.
 Here are some scenarios we've worked out before:
 1. Cancellation:
   TC1: cancel before any result set is returned
   TC2: cancel in the middle of fetching result set
   TC3: cancel after all result set are produced but not all are fetched
   TC4: cancel after everything is completed and fetched
 As test setup, we need:
   - query dataset large enough to be sent to different drillbits, e.g., TPCH 
 100
   - queries that force multiple drillbits to work on them; e.g., count ... 
 group by
 2. Completed (in each case check all drillbits are still up and running):
   TC1: success
   TC2: failed query - before query is executed - while sql parsing
   TC3: failed query - before query is executed - while sending fragments to 
 other drillbits for execution
   TC4: failed query - during query execution
 It is currently not possible to create a scenario in which a query may hang.
 To check all drillbits up and running and in a clean state, run:
 -select count(*) from sys.drillbits;-
 {code}
 select count(*) from sys.memory;
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Reopened] (DRILL-2060) Support constant folding in expressions in Filter

2015-04-06 Thread Jason Altekruse (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Altekruse reopened DRILL-2060:

  Assignee: Jason Altekruse  (was: Jacques Nadeau)

Messed up the workflow, should have resolved instead of closed

 Support constant folding in expressions in Filter
 -

 Key: DRILL-2060
 URL: https://issues.apache.org/jira/browse/DRILL-2060
 Project: Apache Drill
  Issue Type: New Feature
  Components: Execution - Flow, Query Planning  Optimization
Reporter: Aditya Kishore
Assignee: Jason Altekruse
 Fix For: 0.9.0

 Attachments: DRILL-2060-15-mar-15.patch, 
 DRILL-2060-const-folding-feb-6.diff, 
 DRILL-2060-part1-const-folding-feb-27.patch, 
 DRILL-2060-part2-const-folding-feb-27.patch, DRILL-2060.patch


 If one side of a filter condition is a constant expression, we should be able 
 to evaluate the expression in the planning phase and replace the expression 
 with the constant value.
 For example, the following filter
 {{Where shipdate  date '1997-01-01' + interval '1' year}}
 can be re-written as
 {{Where shipdate  date '1998-01-01'}}
 Or 
 {{Where age + 5  42}}
 can be re-written as
 {{Where age  37}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (DRILL-2704) Schema change not reported in Union All query

2015-04-06 Thread Khurram Faraaz (JIRA)

Khurram Faraaz created DRILL-2704:
-

 Summary: Schema change not reported in Union All query
 Key: DRILL-2704
 URL: https://issues.apache.org/jira/browse/DRILL-2704
 Project: Apache Drill
  Issue Type: Bug
  Components: Query Planning  Optimization
Affects Versions: 0.9.0
 Environment: 4 node cluster on CentOS 
Reporter: Khurram Faraaz
Assignee: Jinfeng Ni


Schema change is not reported in Union All query. Inputs to Union All read from 
CSV file.

{code}
0: jdbc:drill: select columns[0] from `f1.csv` where columns[0] = 13 union all 
select columns[0] from `f2.csv` where columns[0] = 10;
Query failed: RemoteRpcException: Failure while running fragment., 9.99 [ 
a80303f5-781c-4288-875e-90ad60c8caac on centos-02.qa.lab:31010 ]
[ a80303f5-781c-4288-875e-90ad60c8caac on centos-02.qa.lab:31010 ]


Error: exception while executing query: Failure while executing query. 
(state=,code=0)

0: jdbc:drill: select * from `f1.csv`;
++
|  columns   |
++
| [1]  |
| [2]  |
| [55] |
| [77] |
| [-100] |
| [13] |
| [17] |
| [19] |
| [2]  |
| [1000] |
| [99] |
| [9.99] |
| [12345.987] |
| [Hello World] |
| [this is a test string] |
| [foobar] |
| [] |
| [new string] |
| [34534534] |
| [345345] |
| [2321312] |
| [67867867] |
| [112319089] |
| [-14695967] |
| [100]|
| [99] |
| [7]  |
| [5]  |
| [13] |
| [] |
| [] |
| [] |
| [0]  |
| [1]  |
| [-1] |
| null   |
| null   |
| null   |
| null   |
++
39 rows selected (0.16 seconds)
0: jdbc:drill: select * from `f2.csv`;
++
|  columns   |
++
| [1]  |
| [0]  |
| [-1] |
| [99] |
| [1000] |
| [10] |
| [-13]|
| [-17]|
| [19] |
| [17] |
| [13] |
| [] |
| [no way] |
| [good quality] |
| [] |
| [done!]  |
| [Yay!!!] |
| null   |
++
18 rows selected (0.128 seconds)

Casting columns[0] to varchar gives correct results

0: jdbc:drill: select cast(columns[0] as varchar(100)) from `f1.csv` where 
columns[0] = '13' union all select cast(columns[0] as varchar(100)) from 
`f2.csv` where columns[0] = '10';
++
|   EXPR$0   |
++
| 13 |
| 13 |
| 10 |
++
3 rows selected (0.474 seconds)

Stack trace from drillbit.log for failing query

2015-04-06 23:31:43,674 [2adce8a0-24c8-58ad-2f46-30c42a33ef2a:frag:0:0] ERROR 
o.a.d.e.w.f.AbstractStatusReporter - Error 
a4ec50e8-88ea-4474-9a36-b2250f34e340: Failure while running fragment.
java.lang.NumberFormatException: 9.99
at 
org.apache.drill.exec.expr.fn.impl.StringFunctionHelpers.nfeI(StringFunctionHelpers.java:96)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.expr.fn.impl.StringFunctionHelpers.varCharToInt(StringFunctionHelpers.java:121)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.test.generated.FiltererGen37.doEval(FilterTemplate2.java:42)
 ~[na:na]
at 
org.apache.drill.exec.test.generated.FiltererGen37.filterBatchNoSV(FilterTemplate2.java:98)
 ~[na:na]
at 
org.apache.drill.exec.test.generated.FiltererGen37.filterBatch(FilterTemplate2.java:71)
 ~[na:na]
at 
org.apache.drill.exec.physical.impl.filter.FilterRecordBatch.doWork(FilterRecordBatch.java:82)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:93)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:142)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:99)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:89)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:51)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.physical.impl.svremover.RemovingRecordBatch.innerNext(RemovingRecordBatch.java:96)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
at 
org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:142)
 ~[drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]

[jira] [Closed] (DRILL-2681) add sql functions docs (string, convert_to/from)

2015-04-06 Thread Kristine Hahn (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kristine Hahn closed DRILL-2681.


 add sql functions docs (string, convert_to/from)
 

 Key: DRILL-2681
 URL: https://issues.apache.org/jira/browse/DRILL-2681
 Project: Apache Drill
  Issue Type: Task
  Components: Documentation
Reporter: Kristine Hahn
Assignee: Kristine Hahn

 Includes SQL scalar functions, maprdb plugin doc fixes, time zone limitations 
 section, Query HBase doc fixes, fix broken links in contribution guidelines 
 and contribution ideas, prelim. overview of config options



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1686) Formalize and document sqlline logging configuration

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1686:
-
Fix Version/s: (was: 0.9.0)
   1.1.0

 Formalize and document sqlline logging configuration
 

 Key: DRILL-1686
 URL: https://issues.apache.org/jira/browse/DRILL-1686
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - CLI
Affects Versions: 0.7.0
Reporter: Jacques Nadeau
Assignee: Daniel Barclay (Drill)
Priority: Minor
 Fix For: 1.1.0


 Drill bit install is creating zero size sqlline.log and does not log any 
 failures from sqlline cli. All error/info/debug messages are logged to 
 drillbit.log
 Need a solution to log sqlline errors to sqlline.log 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1686) Formalize and document sqlline logging configuration

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1686:
-
Assignee: Daniel Barclay (Drill)  (was: Mehant Baid)

 Formalize and document sqlline logging configuration
 

 Key: DRILL-1686
 URL: https://issues.apache.org/jira/browse/DRILL-1686
 Project: Apache Drill
  Issue Type: Bug
  Components: Client - CLI
Affects Versions: 0.7.0
Reporter: Jacques Nadeau
Assignee: Daniel Barclay (Drill)
Priority: Minor
 Fix For: 1.1.0


 Drill bit install is creating zero size sqlline.log and does not log any 
 failures from sqlline cli. All error/info/debug messages are logged to 
 drillbit.log
 Need a solution to log sqlline errors to sqlline.log 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (DRILL-2508) Add new column to sys.options table that exposes whether or not the current system value is the default

2015-04-06 Thread Sudheesh Katkam (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14394684#comment-14394684
 ] 

Sudheesh Katkam edited comment on DRILL-2508 at 4/6/15 9:43 PM:


Updated patch, use status instead of boolean.

Statuses can be: BOOT, DEFAULT, CHANGED


was (Author: sudheeshkatkam):
Updated patch with status instead of boolean.

Statuses can be: BOOT, DEFAULT, CHANGED

 Add new column to sys.options table that exposes whether or not the current 
 system value is the default
 ---

 Key: DRILL-2508
 URL: https://issues.apache.org/jira/browse/DRILL-2508
 Project: Apache Drill
  Issue Type: Improvement
  Components: Storage - Other
Reporter: Victoria Markman
Assignee: Sudheesh Katkam
 Fix For: 0.9.0

 Attachments: DRILL-2508.1.patch.txt, DRILL-2508.2.patch.txt


 Need to be able to see system parameters that I changed.
 There is an enhancement already to reset them to default values: drill-1065
 I don't necessarily want to do that, I just want to see only things that I 
 changed : default value vs. my change.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (DRILL-2702) Support more advanced constant expression folding

2015-04-06 Thread Jason Altekruse (JIRA)

Jason Altekruse created DRILL-2702:
--

 Summary: Support more advanced constant expression folding
 Key: DRILL-2702
 URL: https://issues.apache.org/jira/browse/DRILL-2702
 Project: Apache Drill
  Issue Type: Improvement
  Components: Query Planning  Optimization
Reporter: Jason Altekruse
Assignee: Jinfeng Ni


A variety of constant expressions are now folded at planning time with the work 
completed for DRILL-2060. One of the examples listed in the issue is not 
currently supported. Supporting an expression re-write like this will require 
more work, this issue is here to track that enhancement request. These types of 
rewrites are particularly useful for Drill because the speed of Drill is often 
limited by the types of filter pushdowns supported by the underlying storage 
system. For system that support range scans, we can rewrite inequalities to 
range scans in simple cases. Currently we cannot re-write more complicated 
inequalities like this one below, this query would prompt a full table scan 
with a filter, but the transformation shown can turn it into a range scan.

Where age + 5  42
can be re-written as
Where age  37



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (DRILL-2681) add sql functions docs (string, convert_to/from)

2015-04-06 Thread Kristine Hahn (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kristine Hahn resolved DRILL-2681.
--
Resolution: Fixed

 add sql functions docs (string, convert_to/from)
 

 Key: DRILL-2681
 URL: https://issues.apache.org/jira/browse/DRILL-2681
 Project: Apache Drill
  Issue Type: Task
  Components: Documentation
Reporter: Kristine Hahn
Assignee: Kristine Hahn

 Includes SQL scalar functions, maprdb plugin doc fixes, time zone limitations 
 section, Query HBase doc fixes, fix broken links in contribution guidelines 
 and contribution ideas, prelim. overview of config options



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (DRILL-1840) Cloning the source repository as documented in the wiki

2015-04-06 Thread Kristine Hahn (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kristine Hahn resolved DRILL-1840.
--
Resolution: Fixed

Sorry for the delay. Thanks for letting us know about the broken links. 

We fixed the links in the docs at the new site http://drill.apache.org/docs/. 
We're moving away from the wiki docs.

The corrected links are in these sections:

* http://drill.apache.org/docs/apache-drill-contribution-guidelines/
* http://drill.apache.org/docs/apache-drill-contribution-ideas/

Thanks!


 Cloning the source repository as documented in the wiki
 ---

 Key: DRILL-1840
 URL: https://issues.apache.org/jira/browse/DRILL-1840
 Project: Apache Drill
  Issue Type: Improvement
  Components: Documentation
 Environment: wiki
Reporter: Karl Heinz Marbaise
Assignee: Kristine Hahn
 Fix For: Future


 On the following page 
 https://cwiki.apache.org/confluence/display/DRILL/Compiling+Drill+from+Source 
 it is mentioned to clone the following repository:
 {{git clone https://git-wip-us.apache.org/repos/asf/incubator-drill.git}}
 which is the meantime not possible cause the repository name has changed 
 based on the migration from incubator to tlp...
 {{git clone https://git-wip-us.apache.org/repos/asf/drill.git}}
 Unfortunately i haven't the karma to change the wiki nor to login into the 
 wicki so i couldn' fix this issue..



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-1687) Remove references to deprecated dense Decimal representation

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1687:
-
Fix Version/s: (was: 0.9.0)
   1.1.0

 Remove references to deprecated dense Decimal representation
 

 Key: DRILL-1687
 URL: https://issues.apache.org/jira/browse/DRILL-1687
 Project: Apache Drill
  Issue Type: Improvement
  Components: Execution - Data Types
Reporter: Jason Altekruse
Assignee: Mehant Baid
Priority: Minor
 Fix For: 1.1.0






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Closed] (DRILL-2060) Support constant folding in expressions in Filter

2015-04-06 Thread Jason Altekruse (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Altekruse closed DRILL-2060.
--
Resolution: Fixed

Fixed in 8d70709ddaddc7111517f4bed883fc9908ef94de

 Support constant folding in expressions in Filter
 -

 Key: DRILL-2060
 URL: https://issues.apache.org/jira/browse/DRILL-2060
 Project: Apache Drill
  Issue Type: New Feature
  Components: Execution - Flow, Query Planning  Optimization
Reporter: Aditya Kishore
Assignee: Jacques Nadeau
 Fix For: 0.9.0

 Attachments: DRILL-2060-15-mar-15.patch, 
 DRILL-2060-const-folding-feb-6.diff, 
 DRILL-2060-part1-const-folding-feb-27.patch, 
 DRILL-2060-part2-const-folding-feb-27.patch, DRILL-2060.patch


 If one side of a filter condition is a constant expression, we should be able 
 to evaluate the expression in the planning phase and replace the expression 
 with the constant value.
 For example, the following filter
 {{Where shipdate  date '1997-01-01' + interval '1' year}}
 can be re-written as
 {{Where shipdate  date '1998-01-01'}}
 Or 
 {{Where age + 5  42}}
 can be re-written as
 {{Where age  37}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (DRILL-2448) Remove outdated code to ignore type resolution with varchar vs varbinary now that implicit casting subsumes it

2015-04-06 Thread Jason Altekruse (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Altekruse resolved DRILL-2448.

   Resolution: Fixed
Fix Version/s: (was: 1.0.0)
   0.9.0

Fixed in a49b99ff1a679a0dfb696686d62449edbb231f37

 Remove outdated code to ignore type resolution with varchar vs varbinary now 
 that implicit casting subsumes it
 --

 Key: DRILL-2448
 URL: https://issues.apache.org/jira/browse/DRILL-2448
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Jason Altekruse
Assignee: Jason Altekruse
Priority: Critical
 Fix For: 0.9.0

 Attachments: DRILL-2448-varchar-varbinary-implicit-cast.patch


 Function resolution included a small condition to allow varchar and varbinary 
 functions to be resolved for either incoming type. While it is valid to 
 implicitly cast between these two, this early workaround creates a 
 technically invalid expression tree that happens to work with the current 
 code generation system. This however does create an issue for the interpreted 
 expression evaluator. Removing the code simply causes an implicit cast to be 
 added during materialization, this works for both generated code expression 
 evaluation as well as the interpreter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2539) NullReader should allocate an empty vector in copy* methods

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2539:
-
Component/s: Execution - Data Types

 NullReader should allocate an empty vector in copy* methods
 ---

 Key: DRILL-2539
 URL: https://issues.apache.org/jira/browse/DRILL-2539
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types, Execution - Flow
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
 Fix For: 0.9.0


 Projecting an non-existing field from a repeated type fails with an NPE 
 mainly because projected vector is not allocated and underlying buffer is 
 dead. This issue proposes to allocate an empty vector in NullReader's copy* 
 methods.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2611) Fixed length value vectors report invalid value count

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2611:
-
Component/s: Execution - Data Types

 Fixed length value vectors report invalid value count
 -

 Key: DRILL-2611
 URL: https://issues.apache.org/jira/browse/DRILL-2611
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
 Fix For: 0.9.0


 We maintain an exclusive value count variable in fixed vectors however we 
 don't update it upon calling set/Safe. Accessor reports the value count from 
 the variable ignoring values that are already in the buffer or written via 
 set/Safe. This causes execution failures manifested as IOOB when underlying 
 data is sparse. We should either remove the variable and report value count 
 directly investigating the buffer if not computationally expensive or update 
 the variable each time we make a write to the vector.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2164) Composite vectors should rely on VectorContainer for maintaining child vectors

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2164:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Composite vectors should rely on VectorContainer for maintaining child vectors
 --

 Key: DRILL-2164
 URL: https://issues.apache.org/jira/browse/DRILL-2164
 Project: Apache Drill
  Issue Type: Sub-task
  Components: Execution - Data Types
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
 Fix For: 1.0.0


 As of DRILL-1885, the logic that handles maintaining vectors is mostly 
 scattered and repeated across AbstractContainerVector and VectorContainer. We 
 should come up with an abstraction unifying vector container logic for better 
 code re-use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2141) Data type error in group by and order by for JSON

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2141:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Data type error in group by and order by for JSON
 -

 Key: DRILL-2141
 URL: https://issues.apache.org/jira/browse/DRILL-2141
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.7.0
Reporter: Andries Engelbrecht
Assignee: Hanifi Gunes
 Fix For: 1.0.0

 Attachments: FlumeData.1422748800086, drillbit.log, new_drillbit.log


 When doing group by and oder by on complex nested JSON getting Data type 
 errors.
 Query:
 select t.retweeted_status.`user`.name as name, count(t.retweeted_status.id) 
 as rt_count from `./nfl` t where t.retweeted_status.`user`.name is not null 
 group by t.retweeted_status.`user`.name order by count(t.retweeted_status.id) 
 desc limit 10;
 Screen output:
 Query failed: Query failed: Failure while running fragment., Failure while 
 reading vector.  Expected vector class of 
 org.apache.drill.exec.vector.NullableIntVector but was holding vector class 
 org.apache.drill.exec.vector.NullableVarCharVector. [ 
 c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 [ c6ea670f-5fa0-491c-acfb-5ccd128ec324 on drilldemo:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 Drill log attached



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2504) Aggregate query with grouping results in Error

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2504:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Aggregate query with grouping results in Error
 --

 Key: DRILL-2504
 URL: https://issues.apache.org/jira/browse/DRILL-2504
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
 Environment: 4 node cluster
 {code}
 0: jdbc:drill: select * from sys.version;
 +++-+-++
 | commit_id  | commit_message | commit_time | build_email | build_time |
 +++-+-++
 | f658a3c513ddf7f2d1b0ad7aa1f3f65049a594fe | DRILL-2209 Insert 
 ProjectOperator with MuxExchange | 09.03.2015 @ 01:49:18 EDT | Unknown | 
 09.03.2015 @ 04:52:49 EDT |
 +++-+-++
 1 row selected (0.062 seconds)
 {code}
Reporter: Khurram Faraaz
Assignee: Mehant Baid
 Fix For: 1.0.0


 The below aggregate query with group by over distinct/non-distinct data 
 results in an Exception. Please note that I had set enable_hashagg=false and 
 I was querying from a CSV file. Query was run over a four node cluster.
 alter system set `planner.enable_hashagg`=true;
 alter session set `planner.enable_hashagg`=true;
 {code}
 0: jdbc:drill: alter system set `planner.enable_hashagg`=false;
 +++
 | ok |  summary   |
 +++
 | true   | planner.enable_hashagg updated. |
 +++
 1 row selected (0.075 seconds)
 0: jdbc:drill: select columns[4], sum(columns[0]), count(distinct 
 columns[1]), max(columns[2]), count(distinct columns[3]), max(columns[5]), 
 min(columns[6]), avg(columns[7])
 . . . . . . .  from `conftest.csv`
 . . . . . . .  group by columns[4];
 Query failed: Query stopped., Failure while trying to materialize incoming 
 schema.  Errors:
  
 Error in expression at index -1.  Error: Missing function implementation: 
 [castINT(BIT-OPTIONAL)].  Full expression: --UNKNOWN EXPRESSION--.. [ 
 6cd09ba7-3e4b-4b3b-b111-39f74f53e1b0 on centos-01.qa.lab:31010 ]
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}
 {code}
 Stack trace from drillbit.log
 2015-03-19 17:47:43,123 [2af4f441-8c04-99f9-1a12-a55a7c72ece7:frag:0:0] ERROR 
 o.a.d.e.w.f.AbstractStatusReporter - Error 
 bab1babd-48fe-4719-8a77-dc5826027ba7: Failure while running fragment.
 org.apache.drill.exec.exception.SchemaChangeException: Failure while trying 
 to materialize incoming schema.  Errors:
 Error in expression at index -1.  Error: Missing function implementation: 
 [castINT(BIT-OPTIONAL)].  Full expression: --UNKNOWN EXPRESSION--..
 at 
 org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.setupNewSchema(ProjectRecordBatch.java:390)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:78)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:134)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:142)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:99)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:89)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:51)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:134)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:142)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
 at 
 org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]

[jira] [Updated] (DRILL-2598) Order by with limit on complex type throw IllegalStateException

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2598:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Order by with limit on complex type throw IllegalStateException
 ---

 Key: DRILL-2598
 URL: https://issues.apache.org/jira/browse/DRILL-2598
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Hanifi Gunes
Priority: Blocker
 Fix For: 1.0.0


 drill 0.8 release candidate:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select * from sys.version;
 +++-+-++
 | commit_id  | commit_message | commit_time | build_email | build_time |
 +++-+-++
 | 462e50ce9c4b829c2a4bafdeb9763bfba677c726 | DRILL-2575: 
 FragmentExecutor.cancel() blasts through state transitions regardless of 
 current state | 25.03.2015 @ 21:11:23 PDT |
 {code}
 The following query involving limit and group by caused the 
 IllegalStateException:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select t.id, t.soa from 
 `complex.json` t order by t.id limit 10;
 +++
 | id |soa |
 +++
 | 1  | 
 [{in:1},{in:1,fl:1.12345},{in:1,fl:10.12345},{in:1,fl:10.6789,bool:true,str:here
  is a string at row 1}] |
 | 2  | 
 [{in:2},{in:2,fl:2.12345},{in:2,fl:20.12345},{in:2,fl:20.6789,bool:true,str:here
  is a string at row 2,nul:not null}] |
 | 3  | 
 [{in:3},{in:3,fl:3.12345},{in:3,fl:30.12345,nul:not 
 null},{in:3,fl:30.6789,bool:true,str:here is a string at row 3}] |
 | 4  | 
 [{in:4},{in:4,fl:4.12345},{in:4,fl:40.12345,nul:not 
 null},{in:4,fl:40.6789,bool:false,str:here is a string at row 
 4,nul:not null}] |
 | 5  | 
 [{in:5},{in:5,fl:5.12345},{in:5,fl:50.12345,nul:not 
 null},{in:5,fl:50.6789,bool:false,str:here is a string at row 5}] |
 | 6  | 
 [{in:6},{in:6,fl:6.12345},{in:6,fl:60.12345,nul:not 
 null},{in:6,fl:60.6789,bool:false,str:here is a string at row 6}] |
 | 7  | 
 [{in:7},{in:7,fl:7.12345},{in:7,fl:70.12345,nul:not 
 null},{in:7,fl:70.6789,bool:false,str:here is a string at row 
 7,nul:not null}] |
 | 8  | 
 [{in:8},{in:8,fl:8.12345},{in:8,fl:80.12345,nul:not 
 null},{in:8,fl:80.6789,bool:true,str:here is a string at row 
 8,nul:not null}] |
 | 9  | 
 [{in:9},{in:9,fl:9.12345},{in:9,fl:90.12345,nul:not 
 null},{in:9,fl:90.6789,bool:true,str:here is a string at row 9}] |
 | 10 | 
 [{in:10},{in:10,fl:10.12345},{in:10,fl:100.12345,nul:not 
 null},{in:10,fl:100.6789,bool:false,str:here is a string at row 
 10,nul:not null}] |
 Query failed: RemoteRpcException: Failure while running fragment., Attempted 
 to close accountor with 25 buffer(s) still allocatedfor QueryId: 
 2aeb3baf-acc1-5615-4537-f215a47d4893, MajorFragmentId: 0, MinorFragmentId: 0.
   Total 25 allocation(s) of byte size(s): 512, 512, 512, 512, 512, 512, 
 512, 512, 512, 512, 512, 512, 512, 512, 512, 512, 512, 512, 512, 512, 512, 
 512, 512, 512, 512, at stack location:
   
 org.apache.drill.exec.memory.TopLevelAllocator$ChildAllocator.buffer(TopLevelAllocator.java:231)
   
 org.apache.drill.exec.vector.BitVector.allocateNewSafe(BitVector.java:95)
   
 org.apache.drill.exec.vector.BitVector.allocateNew(BitVector.java:78)
   
 org.apache.drill.exec.vector.NullableBitVector.allocateNew(NullableBitVector.java:168)
   
 org.apache.drill.exec.vector.complex.RepeatedMapVector$RepeatedMapTransferPair.init(RepeatedMapVector.java:282)
   
 org.apache.drill.exec.vector.complex.RepeatedMapVector$RepeatedMapTransferPair.init(RepeatedMapVector.java:260)
   
 org.apache.drill.exec.vector.complex.RepeatedMapVector.getTransferPair(RepeatedMapVector.java:126)
   
 org.apache.drill.exec.physical.impl.sort.RecordBatchData.init(RecordBatchData.java:57)
   
 org.apache.drill.exec.physical.impl.TopN.TopNBatch.innerNext(TopNBatch.java:222)
   
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:142)
   
 org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118)
   
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:99)
   
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:89)
   
 org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:51)

[jira] [Updated] (DRILL-1951) Can't cast numeric value with decimal point read from CSV file into integer data type

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1951:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Can't cast numeric value with decimal point read from CSV file into integer 
 data type
 -

 Key: DRILL-1951
 URL: https://issues.apache.org/jira/browse/DRILL-1951
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.8.0
Reporter: Victoria Markman
Assignee: Mehant Baid
 Fix For: 1.0.0


 sales.csv file:
 {code}
 997,Ford,ME350,3000.00, comment#1
 1999,Chevy,Venture,4900.00, comment#2
 1999,Chevy,Venture,5000.00, comment#3
 1996,Jeep,Cherokee,1.01, comment#4
 0: jdbc:drill:schema=dfs select cast(columns[3] as decimal(18,2))  from 
 `sales.csv`;
 ++
 |   EXPR$0   |
 ++
 | 3000.00|
 | 4900.00|
 | 5000.00|
 | 1.01   |
 ++
 4 rows selected (0.093 seconds)
 {code}
 -- Can cast to decimal
 {code}
 0: jdbc:drill:schema=dfs select cast(columns[3] as decimal(18,2))  from 
 `sales.csv`;
 ++
 |   EXPR$0   |
 ++
 | 3000.00|
 | 4900.00|
 | 5000.00|
 | 1.01   |
 ++
 4 rows selected (0.095 seconds)
 {code}
 -- Can cast to float
 {code}
 0: jdbc:drill:schema=dfs select cast(columns[3] as float)  from `sales.csv`;
 ++
 |   EXPR$0   |
 ++
 | 3000.0 |
 | 4900.0 |
 | 5000.0 |
 | 1.01   |
 ++
 4 rows selected (0.112 seconds)
 {code}-- Can't cast to INT/BIGINT
 {code}
 0: jdbc:drill:schema=dfs select cast(columns[3] as bigint)  from `sales.csv`;
 Query failed: Query failed: Failure while running fragment., 3000.00 [ 
 4818451a-c731-48a9-9992-1e81ab1d520d on atsqa4-134.qa.lab:31010 ]
 [ 4818451a-c731-48a9-9992-1e81ab1d520d on atsqa4-134.qa.lab:31010 ]
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}
 -- Same works with json/parquet files
 {code}
 0: jdbc:drill:schema=dfs select a1  from `t1.json`;
 ++
 | a1 |
 ++
 | 10.01  |
 ++
 1 row selected (0.077 seconds)
 0: jdbc:drill:schema=dfs select cast(a1 as int)  from `t1.json`;
 ++
 |   EXPR$0   |
 ++
 | 10 |
 ++
 0: jdbc:drill:schema=dfs select * from test_cast;
 ++
 | a1 |
 ++
 | 10.0100|
 ++
 1 row selected (0.06 seconds)
 0: jdbc:drill:schema=dfs select cast(a1 as int) from test_cast;
 ++
 |   EXPR$0   |
 ++
 | 10 |
 ++
 1 row selected (0.094 seconds)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2150) Create an abstraction for repeated value vectors.

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2150:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Create an abstraction for repeated value vectors.
 -

 Key: DRILL-2150
 URL: https://issues.apache.org/jira/browse/DRILL-2150
 Project: Apache Drill
  Issue Type: Sub-task
  Components: Execution - Data Types
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
 Fix For: 1.0.0


 This sub-task is concerned about creating an abstraction for repeated value 
 vectors. The existing abstraction seems invalid. The purpose is to provide a 
 minimal interface that enables code re-usability.
 The proposal is to preserve existing functionalities such as exposing group 
 count, group size as well as providing low level access to underlying offsets 
 and data vectors.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2669) Error happening without limit clause and works with limit clause

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2669:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Error happening without limit clause and works with limit clause
 

 Key: DRILL-2669
 URL: https://issues.apache.org/jira/browse/DRILL-2669
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Affects Versions: 0.8.0
 Environment: mapr sandbox 4.0.2
Reporter: Sudhakar Thota
Assignee: Mehant Baid
 Fix For: 1.0.0


 Perhaps this could be a bug. I get the same results.
 But the plan is very different, the UnionExchange is set up immediately after 
 the scan operation in successful case( Case 1 ), where as UnionExchange is 
 happening after scanproject (Case -2).
 Case -1.Successful case.
 {code}
 0: jdbc:drill: explain plan for select to_timestamp(t.t, 
 '-MM-dd''T''HH:mm:ss.SSS''Z''') FROM (select * from 
 dfs.sthota_prq.`/tstamp_test/*.parquet` limit 13015351) t;
 --+
 text  json
 --+
 00-00 Screen
 00-01 Project(EXPR$0=[TO_TIMESTAMP(ITEM($0, 't'), 
 '-MM-dd''T''HH:mm:ss.SSS''Z''')])
 00-02 SelectionVectorRemover
 00-03 Limit(fetch=[13015351])
 00-04 UnionExchange
 01-01 Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath 
 [path=maprfs:/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test/1_2_0.parquet],
  ReadEntryWithPath 
 [path=maprfs:/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test/1_1_0.parquet],
  ReadEntryWithPath 
 [path=maprfs:/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test/1_0_0.parquet]],
  selectionRoot=/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test, 
 numFiles=3, columns=[`*`]]])
 {
 head :
 Unknown macro: { version }
 ,
 {code}
 Case -2. Unsuccessful case:
 {code}
 0: jdbc:drill: explain plan for select to_timestamp(t.t, 
 '-MM-dd''T''HH:mm:ss.SSS''Z''') FROM (select * from 
 dfs.sthota_prq.`/tstamp_test/*.parquet` ) t;
 --+
 text  json
 --+
 00-00 Screen
 00-01 UnionExchange
 01-01 Project(EXPR$0=[TO_TIMESTAMP(ITEM($0, 't'), 
 '-MM-dd''T''HH:mm:ss.SSS''Z''')])
 01-02 Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath 
 [path=maprfs:/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test/1_2_0.parquet],
  ReadEntryWithPath 
 [path=maprfs:/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test/1_1_0.parquet],
  ReadEntryWithPath 
 [path=maprfs:/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test/1_0_0.parquet]],
  selectionRoot=/mapr/demo.mapr.com/user/sthota/parquet/tstamp_test, 
 numFiles=3, columns=[`*`]]])
 {
 head :
 Unknown macro: { version }
 ,
 {code}
 {code}
 0: jdbc:drill: select to_timestamp(t.t, '-MM-dd''T''HH:mm:ss.SSS''Z''') 
 FROM (select * from dfs.sthota_prq.`/tstamp_test/*.parquet` limit 10) t;
 
 EXPR$0
 
 2015-01-27 13:43:53.0
 2015-01-27 13:43:49.0
 2015-01-27 13:43:47.0
 2015-01-27 13:43:47.0
 2015-01-27 13:43:47.0
 2015-01-27 13:43:45.0
 2015-01-27 13:43:43.0
 2015-01-27 13:43:43.0
 2015-01-27 13:43:43.0
 2015-01-27 13:43:39.0
 
 10 rows selected (1.127 seconds)
 {code}
 {code}
 0: jdbc:drill: select to_timestamp(t.t, '-MM-dd''T''HH:mm:ss.SSS''Z''') 
 FROM (select * from dfs.sthota_prq.`/tstamp_test/*.parquet`) t;
 {code}
 {code}
 0: jdbc:drill: select to_timestamp(t.t, '-MM-dd''T''HH:mm:ss.SSS''Z''') 
 FROM (select * from dfs.sthota_prq.`/tstamp_test/*.parquet`) t;
 Query failed: RemoteRpcException: Failure while trying to start remote 
 fragment, Expression has syntax error! line 1:30:mismatched input 'T' 
 expecting CParen [ ab817e5a-9b74-47dd-b3c6-3bbf025c7de9 on maprdemo:31010 ]
 Error: exception while executing query: Failure while executing query. 
 (state=,code=0)
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2685) Unique-ify local Hive metastore directory or unit test fails

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2685:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Unique-ify local Hive metastore directory or unit test fails
 

 Key: DRILL-2685
 URL: https://issues.apache.org/jira/browse/DRILL-2685
 Project: Apache Drill
  Issue Type: Bug
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
Priority: Blocker
 Fix For: 1.0.0


 Hive test suites subclasses HiveTestBase that in turn generates data. When 
 tests are run in a concurrent setting, one removes the common directory while 
 the other is working on it, failing unit-tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2125) Add input template file in the source files generated by freemarker

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2125:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Add input template file in the source files generated by freemarker
 ---

 Key: DRILL-2125
 URL: https://issues.apache.org/jira/browse/DRILL-2125
 Project: Apache Drill
  Issue Type: Improvement
  Components: Tools, Build  Test
Reporter: Mehant Baid
Assignee: Mehant Baid
 Fix For: 1.0.0

 Attachments: DRILL-2125.patch


 Currently only some generated source files include information as to which 
 template was used to create the sources. For better readability and modifying 
 the template it'd be good to include which template was used to generate the 
 sources.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2277) COUNT(*) should return 0 instead of an empty result set when there are no records

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2277:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 COUNT(*) should return 0 instead of an empty result set when there are no 
 records
 -

 Key: DRILL-2277
 URL: https://issues.apache.org/jira/browse/DRILL-2277
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Reporter: Rahul Challapalli
Assignee: Mehant Baid
Priority: Critical
 Fix For: 1.0.0


 git.commit.id.abbrev=6676f2d
 Data Set :
 {code}
 {
  id:1
 }
 {code}
 Query :
 {code}
 select count(*) from `temp.json` where uid  1;
 ++
 |   EXPR$0   |
 ++
 ++
 {code}
 Postgres returns 0 in this case.
 Marking it as critical since we return an incorrect result



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2425) Wrong results when identifier change cases within the same data file

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2425:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Wrong results when identifier change cases within the same data file
 

 Key: DRILL-2425
 URL: https://issues.apache.org/jira/browse/DRILL-2425
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Affects Versions: 0.8.0
Reporter: Chun Chang
Assignee: Hanifi Gunes
Priority: Critical
 Fix For: 1.0.0


 #Fri Mar 06 16:51:10 EST 2015
 git.commit.id.abbrev=fb293ba
 I have the following JSON file that one of the identifier change cases:
 {code}
 [root@qa-node120 md-83]# hadoop fs -cat 
 /drill/testdata/complex_type/json/schema/a.json
 {SOURCE: ebm,msAddressIpv6Array: null}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.0, 
 msAddressIpv6_2:88.222.333.0}}
 {SOURCE: ebm,msAddressIpv6Array: {msAddressIpv6_1:99.111.222.1, 
 msAddressIpv6_2:88.222.333.1}}
 {SOURCE: ebm,msAddressIpv6Array: {msaddressipv6_1:99.111.222.2, 
 msAddressIpv6_2:88.222.333.2}}
 {code}
 Query this file through drill gives wrong results:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +-+
 | msAddressIpv6_1 |
 +-+
 | null|
 | null|
 | null|
 | 99.111.222.2|
 +-+
 {code}
 plan:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ explain plan for select 
 t.msAddressIpv6Array.msAddressIpv6_1 as msAddressIpv6_1 from `schema/a.json` 
 t;
 +++
 |text|json|
 +++
 | 00-00Screen
 00-01  Project(msAddressIpv6_1=[ITEM($0, 'msAddressIpv6_1')])
 00-02Scan(groupscan=[EasyGroupScan 
 [selectionRoot=/drill/testdata/complex_type/json/schema/a.json, numFiles=1, 
 columns=[`msAddressIpv6Array`.`msAddressIpv6_1`], 
 files=[maprfs:/drill/testdata/complex_type/json/schema/a.json]]])
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2171) Test framework throws IOOB for tests changing schema

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2171:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Test framework throws IOOB for tests changing schema
 

 Key: DRILL-2171
 URL: https://issues.apache.org/jira/browse/DRILL-2171
 Project: Apache Drill
  Issue Type: Bug
  Components: Tools, Build  Test
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
 Fix For: 1.0.0


 I added a unit test as part of DRILL-1605 that resolves a problem with schema 
 change. Unfortunately test framework suffers from a similar problem throwing 
 IOOB while trying to verify the results. 
 TestSchemaChange#testMultiFilesWithDifferentSchema is currently ignored until 
 a patch is available for this issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2234) IOOB when streaming aggregate is on the left side of hash join

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2234:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 IOOB when streaming aggregate is on the left side of hash join
 --

 Key: DRILL-2234
 URL: https://issues.apache.org/jira/browse/DRILL-2234
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Relational Operators
Reporter: Mehant Baid
Assignee: Mehant Baid
 Fix For: 1.0.0


 This issue is similar to DRILL-2107. 
 Issue can be reproduced by enabling SwapJoinRule in DrillRuleSets and running 
 the following query.
 alter session set `planner.slice_target` = 1;
 alter session set `planner.enable_hashagg` = false;
 alter session set `planner.enable_streamagg` = true;
 select l_suppkey, sum(l_extendedprice)/sum(l_quantity) as avg_price 
 from cp.`tpch/lineitem.parquet` where l_orderkey in
 (select o_orderkey from cp.`tpch/orders.parquet` where o_custkey = 2) 
 group by l_suppkey having sum(l_extendedprice)/sum(l_quantity)  1850.0;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2340) count(*) fails with subquery not containing limit

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2340:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 count(*) fails with subquery not containing limit
 -

 Key: DRILL-2340
 URL: https://issues.apache.org/jira/browse/DRILL-2340
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Chun Chang
Assignee: Hanifi Gunes
Priority: Critical
 Fix For: 1.0.0


 #Wed Feb 25 17:07:31 EST 2015
 git.commit.id.abbrev=f7ef5ec
 count(*) with subquery containing limit works fine:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select count(*) from (select 
 t.soa[0] soa0, t.soa[1] soa1, t.soa[2] soa2 from `complex.json` t limit 
 20) tt;
 ++
 |   EXPR$0   |
 ++
 | 20 |
 ++
 {code}
 But if I remove the limit, query fails with IllegalStateException:
 {code}
 0: jdbc:drill:schema=dfs.drillTestDirComplexJ select count(*) from (select 
 t.soa[0] soa0, t.soa[1] soa1, t.soa[2] soa2 from `complex.json` t) tt;
 ++
 |   EXPR$0   |
 ++
 Query failed: RemoteRpcException: Failure while running fragment., You tried 
 to do a batch data read operation when you were in a state of STOP.  You can 
 only do this type of operation when you are in a state of OK or 
 OK_NEW_SCHEMA. [ d3226020-a2b0-4497-948f-34ea2309ddb7 on 
 qa-node120.qa.lab:31010 ]
 [ d3226020-a2b0-4497-948f-34ea2309ddb7 on qa-node120.qa.lab:31010 ]
 java.lang.RuntimeException: java.sql.SQLException: Failure while executing 
 query.
   at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
   at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
   at sqlline.SqlLine.print(SqlLine.java:1809)
   at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
   at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
   at sqlline.SqlLine.dispatch(SqlLine.java:889)
   at sqlline.SqlLine.begin(SqlLine.java:763)
   at sqlline.SqlLine.start(SqlLine.java:498)
   at sqlline.SqlLine.main(SqlLine.java:460)
 {code}
 Here is the exception in drill bit.log:
 {code}
 2015-02-27 14:17:32,247 [2b0f1303-61ec-2350-4b62-b6b29d11c534:foreman] INFO  
 o.a.drill.exec.work.foreman.Foreman - State change requested.  PENDING -- 
 RUNNING
 2015-02-27 14:17:32,267 [2b0f1303-61ec-2350-4b62-b6b29d11c534:frag:0:0] ERROR 
 o.a.drill.exec.ops.FragmentContext - Fragment Context received failure.
 java.lang.IllegalStateException: Needed to be in state INIT or IN_FLOAT8 but 
 in mode IN_BIGINT
   at 
 org.apache.drill.exec.vector.complex.impl.SingleListWriter.float8(SingleListWriter.java:427)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:418)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:256)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.vector.complex.fn.JsonReader.writeDataSwitch(JsonReader.java:208)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.vector.complex.fn.JsonReader.writeToVector(JsonReader.java:182)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.vector.complex.fn.JsonReader.write(JsonReader.java:156) 
 ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.store.easy.json.JSONRecordReader.next(JSONRecordReader.java:125)
  ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:165) 
 ~[drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118)
  [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:99)
  [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:89)
  [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext(AbstractSingleRecordBatch.java:51)
  [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at 
 org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:134)
  [drill-java-exec-0.8.0-SNAPSHOT-rebuffed.jar:0.8.0-SNAPSHOT]
   at

[jira] [Updated] (DRILL-1782) Error in drill-root pom - Typo in excludeSubprojects

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-1782:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Error in drill-root pom - Typo in excludeSubprojects
 

 Key: DRILL-1782
 URL: https://issues.apache.org/jira/browse/DRILL-1782
 Project: Apache Drill
  Issue Type: Bug
  Components: Tools, Build  Test
Affects Versions: 0.7.0
Reporter: Yash Sharma
Assignee: Mehant Baid
 Fix For: 1.0.0

 Attachments: DRILL-1782.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2227) Using flatten in a project, filter (where in ...), and order by throws an ClassCastException

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2227:
-
Fix Version/s: (was: 0.9.0)
   1.0.0

 Using flatten in a project, filter (where in ...), and order by throws an 
 ClassCastException
 

 Key: DRILL-2227
 URL: https://issues.apache.org/jira/browse/DRILL-2227
 Project: Apache Drill
  Issue Type: Bug
  Components: Functions - Drill
Reporter: Rahul Challapalli
Assignee: Hanifi Gunes
 Fix For: 1.0.0

 Attachments: data.json, error.log


 git.commit.id.abbrev=3d863b5
 The below query fails :
 {code}
 select uid, flatten(lst_lst[0]) from `data.json` where 2 in (select 
 flatten(lst_lst[1]) from `data.json`) order by flatten(lst_lst[1]);
 Query failed: RemoteRpcException: Failure while running fragment., 
 org.apache.drill.exec.vector.BigIntVector cannot be cast to 
 org.apache.drill.exec.vector.RepeatedVector 
 {code}
 However if we apply flatten on the same column in the select part, it 
 succeeds
 {code}
 0: jdbc:drill:schema=dfs_eea select uid, flatten(lst_lst[1]) from 
 `data.json` where 2 in (select flatten(lst_lst[1]) from `data.json`) order by 
 flatten(lst_lst[1]);
 +++
 |uid |   EXPR$1   |
 +++
 | 1  | 2  |
 | 2  | 2  |
 | 1  | 3  |
 | 2  | 3  |
 | 1  | 4  |
 | 2  | 4  |
 | 1  | 5  |
 | 2  | 5  |
 | 1  | 6  |
 | 2  | 6  |
 +++
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2681) add sql functions docs (string, convert_to/from)

2015-04-06 Thread Kristine Hahn (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kristine Hahn updated DRILL-2681:
-
Description: Includes SQL scalar functions, maprdb plugin doc fixes, time 
zone limitations section, Query HBase doc fixes, fix broken links in 
contribution guidelines and contribution ideas, prelim. overview of config 
options

 add sql functions docs (string, convert_to/from)
 

 Key: DRILL-2681
 URL: https://issues.apache.org/jira/browse/DRILL-2681
 Project: Apache Drill
  Issue Type: Task
  Components: Documentation
Reporter: Kristine Hahn
Assignee: Kristine Hahn

 Includes SQL scalar functions, maprdb plugin doc fixes, time zone limitations 
 section, Query HBase doc fixes, fix broken links in contribution guidelines 
 and contribution ideas, prelim. overview of config options



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2684) Unique-ify view names in TestUnionAll TestExampleQueries test suites

2015-04-06 Thread Hanifi Gunes (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hanifi Gunes updated DRILL-2684:

Assignee: Parth Chandra  (was: Hanifi Gunes)

 Unique-ify view names in TestUnionAll  TestExampleQueries test suites
 --

 Key: DRILL-2684
 URL: https://issues.apache.org/jira/browse/DRILL-2684
 Project: Apache Drill
  Issue Type: Bug
  Components: Tools, Build  Test
Reporter: Hanifi Gunes
Assignee: Parth Chandra
Priority: Blocker
 Fix For: 0.9.0


 Two test classes rely on the same view, region_name. One drops the view while 
 the other is working on it and suddenly execution fails. Bummer!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2684) Unique-ify view names in TestUnionAll TestExampleQueries test suites

2015-04-06 Thread Hanifi Gunes (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hanifi Gunes updated DRILL-2684:

Attachment: DRILL-2684.1.patch.txt

Use unique view names to avoid failing concurrent test runs.

 Unique-ify view names in TestUnionAll  TestExampleQueries test suites
 --

 Key: DRILL-2684
 URL: https://issues.apache.org/jira/browse/DRILL-2684
 Project: Apache Drill
  Issue Type: Bug
  Components: Tools, Build  Test
Reporter: Hanifi Gunes
Assignee: Parth Chandra
Priority: Blocker
 Fix For: 0.9.0

 Attachments: DRILL-2684.1.patch.txt


 Two test classes rely on the same view, region_name. One drops the view while 
 the other is working on it and suddenly execution fails. Bummer!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-2039) use forks-per-CPU units in forkCount

2015-04-06 Thread Jacques Nadeau (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-2039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482137#comment-14482137
 ] 

Jacques Nadeau commented on DRILL-2039:
---

Should this be 0.5 per logical core?  I assume that on quad with ht, that would 
be 8 logical cores.  However, we find we hit timeouts after 4 threads on that 
type of machine.

 use forks-per-CPU units in forkCount
 

 Key: DRILL-2039
 URL: https://issues.apache.org/jira/browse/DRILL-2039
 Project: Apache Drill
  Issue Type: Improvement
  Components: Tools, Build  Test
Reporter: Daniel Barclay (Drill)
Assignee: Parth Chandra
Priority: Minor
 Fix For: 0.9.0

 Attachments: DRILL-2039.1.patch.txt


 Can Surefire's fork count be set using units of forks per CPU instead of the 
 absolute number of forks (i.e., {{forkCount1C/forkCount}} instead of 
 {{forkCount4/forkcount}})?
 That would let developers with more powerful machines still run tests in 
 parallel while letting others avoid or reduce test timeout errors without the 
 risk of forgetting to re-edit pom.xml after every switch to a different git 
 branch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (DRILL-658) Add support for multiple distinct counts without a grouping key.

2015-04-06 Thread Victoria Markman (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482235#comment-14482235
 ] 

Victoria Markman commented on DRILL-658:


Can we please try fixing this error for GA possibly ? This will have to be 
documented as a limitation in aggregate function section.

 Add support for multiple distinct counts without a grouping key.
 

 Key: DRILL-658
 URL: https://issues.apache.org/jira/browse/DRILL-658
 Project: Apache Drill
  Issue Type: New Feature
  Components: Query Planning  Optimization
Reporter: Chun Chang
Assignee: Jinfeng Ni
 Fix For: Future


 postgres:
 foodmart=# select count(distinct(c_int)), count(distinct(c_bigint)), 
 count(distinct(c_float4)), count(distinct(c_float8)) from data;
  count | count | count | count
 ---+---+---+---
 21 |23 |22 |20
 (1 row)
 drill:
 0: jdbc:drill:schema=dfs select count(distinct(c_int)), 
 count(distinct(c_bigint)), count(distinct(c_float4)), 
 count(distinct(c_float8)) from data;
 Query failed: org.apache.drill.exec.rpc.RpcException: Remote failure while 
 running query.[error_id: 658d1c88-3968-457f-82e1-26c76ff7b54b
 endpoint {
   address: qa-node120.qa.lab
   user_port: 31010
   control_port: 31011
   data_port: 31012
 }
 error_type: 0
 message: Failure while parsing sql.  CannotPlanException:[ Node 
 [rel#5852:Subset#33.PHYSICAL.SINGLETON([]).[]] could not be implemented;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2611) Fixed length value vectors report invalid value count

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2611:
-
Priority: Critical  (was: Major)

 Fixed length value vectors report invalid value count
 -

 Key: DRILL-2611
 URL: https://issues.apache.org/jira/browse/DRILL-2611
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
Priority: Critical
 Fix For: 0.9.0


 We maintain an exclusive value count variable in fixed vectors however we 
 don't update it upon calling set/Safe. Accessor reports the value count from 
 the variable ignoring values that are already in the buffer or written via 
 set/Safe. This causes execution failures manifested as IOOB when underlying 
 data is sparse. We should either remove the variable and report value count 
 directly investigating the buffer if not computationally expensive or update 
 the variable each time we make a write to the vector.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (DRILL-2611) Fixed length value vectors report invalid value count

2015-04-06 Thread Parth Chandra (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Parth Chandra updated DRILL-2611:
-
Fix Version/s: 0.9.0

 Fixed length value vectors report invalid value count
 -

 Key: DRILL-2611
 URL: https://issues.apache.org/jira/browse/DRILL-2611
 Project: Apache Drill
  Issue Type: Bug
  Components: Execution - Data Types
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
 Fix For: 0.9.0


 We maintain an exclusive value count variable in fixed vectors however we 
 don't update it upon calling set/Safe. Accessor reports the value count from 
 the variable ignoring values that are already in the buffer or written via 
 set/Safe. This causes execution failures manifested as IOOB when underlying 
 data is sparse. We should either remove the variable and report value count 
 directly investigating the buffer if not computationally expensive or update 
 the variable each time we make a write to the vector.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (DRILL-2701) configuration options

2015-04-06 Thread Kristine Hahn (JIRA)

Kristine Hahn created DRILL-2701:


 Summary: configuration options
 Key: DRILL-2701
 URL: https://issues.apache.org/jira/browse/DRILL-2701
 Project: Apache Drill
  Issue Type: Task
  Components: Documentation
Reporter: Kristine Hahn
Assignee: Kristine Hahn






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

1 2 >

1 - 100 of 158 matches

Mail list logo