[jira] [Commented] (DRILL-6253) HashAgg Unit Testing And Refactoring

2018-04-02 Thread Boaz Ben-Zvi (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423429#comment-16423429
 ] 

Boaz Ben-Zvi commented on DRILL-6253:
-

Comments added to the design doc ...

> HashAgg Unit Testing And Refactoring
> 
>
> Key: DRILL-6253
> URL: https://issues.apache.org/jira/browse/DRILL-6253
> Project: Apache Drill
>  Issue Type: Improvement
>Reporter: Timothy Farkas
>Assignee: Timothy Farkas
>Priority: Major
> Fix For: 1.14.0
>
>
> This is a parent issue to hold all the subtasks required to refactor HashAgg 
> to make it unit testable. Design doc
> https://docs.google.com/document/d/110BAWg3QXMfdmuqB0p3HuaoKpPGY-lqCRtHFxdh53Ds/edit?usp=sharing



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (DRILL-5977) predicate pushdown support kafkaMsgOffset

2018-04-02 Thread Pritesh Maker (JIRA)

 [ 
https://issues.apache.org/jira/browse/DRILL-5977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pritesh Maker reassigned DRILL-5977:


Assignee: Abhishek Ravi

> predicate pushdown support kafkaMsgOffset
> -
>
> Key: DRILL-5977
> URL: https://issues.apache.org/jira/browse/DRILL-5977
> Project: Apache Drill
>  Issue Type: Improvement
>Reporter: B Anil Kumar
>Assignee: Abhishek Ravi
>Priority: Major
> Fix For: 1.14.0
>
>
> As part of Kafka storage plugin review, below is the suggestion from Paul.
> {noformat}
> Does it make sense to provide a way to select a range of messages: a starting 
> point or a count? Perhaps I want to run my query every five minutes, scanning 
> only those messages since the previous scan. Or, I want to limit my take to, 
> say, the next 1000 messages. Could we use a pseudo-column such as 
> "kafkaMsgOffset" for that purpose? Maybe
> SELECT * FROM  WHERE kafkaMsgOffset > 12345
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6202) Deprecate usage of IndexOutOfBoundsException to re-alloc vectors

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423360#comment-16423360
 ] 

ASF GitHub Bot commented on DRILL-6202:
---

Github user vrozov commented on the issue:

https://github.com/apache/drill/pull/1144
  
It is not clear why get/set Byte/Char/Short/Int/Long/Float/Double do not 
delegate to UDLE, while get/set Bytes delegates to UDLE and relies on netty 
'AbstractByteBuf` for bounds checking. IMO, it will be good to have the 
behavior consistent for all methods.

In many cases including `VariableLengthVectors`, there is no need to rely 
on UDLE boundary checking as a caller already provides or can provide a 
guarantee that an index is within a buffer boundaries. In those cases, boundary 
check becomes an extra cost. IMO, it will be good to have a consistent behavior 
with ability to enable bounds checking for debugging.


> Deprecate usage of IndexOutOfBoundsException to re-alloc vectors
> 
>
> Key: DRILL-6202
> URL: https://issues.apache.org/jira/browse/DRILL-6202
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Major
> Fix For: 1.14.0
>
>
> As bounds checking may be enabled or disabled, using 
> IndexOutOfBoundsException to resize vectors is unreliable. It works only when 
> bounds checking is enabled.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (DRILL-6287) apache-release profile should be disabled by default

2018-04-02 Thread Vlad Rozov (JIRA)

 [ 
https://issues.apache.org/jira/browse/DRILL-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vlad Rozov updated DRILL-6287:
--
Labels: ready-to-commit  (was: )

> apache-release profile should be disabled by default
> 
>
> Key: DRILL-6287
> URL: https://issues.apache.org/jira/browse/DRILL-6287
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Minor
>  Labels: ready-to-commit
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6272) Remove binary jars files from source distribution

2018-04-02 Thread Vlad Rozov (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423315#comment-16423315
 ] 

Vlad Rozov commented on DRILL-6272:
---

[~volodymyr.tkach] There is a difference between a json and a jar file. A json 
file represents original source submitted to ASF, while a jar file is a 
compilation from a source code.

> Remove binary jars files from source distribution
> -
>
> Key: DRILL-6272
> URL: https://issues.apache.org/jira/browse/DRILL-6272
> Project: Apache Drill
>  Issue Type: Task
>Reporter: Vlad Rozov
>Assignee: Arina Ielchiieva
>Priority: Critical
> Fix For: 1.14.0
>
>
> Per [~vrozov] the source distribution contains binary jar files under 
> exec/java-exec/src/test/resources/jars



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-4091) Support more functions in gis contrib module

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-4091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16423150#comment-16423150
 ] 

ASF GitHub Bot commented on DRILL-4091:
---

Github user ChrisSandison commented on a diff in the pull request:

https://github.com/apache/drill/pull/258#discussion_r178652351
  
--- Diff: 
contrib/gis/src/main/java/org/apache/drill/exec/expr/fn/impl/gis/STUnionAggregate.java
 ---
@@ -0,0 +1,114 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.drill.exec.expr.fn.impl.gis;
+
+import javax.inject.Inject;
+
+import org.apache.drill.exec.expr.DrillAggFunc;
+import org.apache.drill.exec.expr.annotations.FunctionTemplate;
+import org.apache.drill.exec.expr.annotations.Output;
+import org.apache.drill.exec.expr.annotations.Param;
+import org.apache.drill.exec.expr.annotations.Workspace;
+import org.apache.drill.exec.expr.holders.BigIntHolder;
+import org.apache.drill.exec.expr.holders.IntHolder;
+import org.apache.drill.exec.expr.holders.NullableVarBinaryHolder;
+import org.apache.drill.exec.expr.holders.ObjectHolder;
+import org.apache.drill.exec.expr.holders.UInt1Holder;
+
+import com.esri.core.geometry.SpatialReference;
+
+import io.netty.buffer.DrillBuf;
+
+@FunctionTemplate(name = "st_unionaggregate", scope = 
FunctionTemplate.FunctionScope.POINT_AGGREGATE)
--- End diff --

Is there documentation for that for aggregate functions?


> Support more functions in gis contrib module
> 
>
> Key: DRILL-4091
> URL: https://issues.apache.org/jira/browse/DRILL-4091
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Functions - Drill
>Reporter: Karol Potocki
>Assignee: Karol Potocki
>Priority: Major
>
> Support for commonly used gis functions in gis contrib module: relate, 
> contains, crosses, intersects, touches, difference, disjoint, buffer, union 
> etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Closed] (DRILL-6009) No drillbits on index page

2018-04-02 Thread Venkata Jyothsna Donapati (JIRA)

 [ 
https://issues.apache.org/jira/browse/DRILL-6009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Venkata Jyothsna Donapati closed DRILL-6009.

Resolution: Cannot Reproduce

> No drillbits on index page
> --
>
> Key: DRILL-6009
> URL: https://issues.apache.org/jira/browse/DRILL-6009
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Arina Ielchiieva
>Assignee: Venkata Jyothsna Donapati
>Priority: Minor
> Fix For: 1.14.0
>
> Attachments: empty_drillbits.JPG
>
>
> After DRILL-4286 once I saw that index page showed no drillbits at all but it 
> was working, so at least one drillbit was online (empty_drillbits.JPG). After 
> refresh everything was fine.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-4091) Support more functions in gis contrib module

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-4091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422981#comment-16422981
 ] 

ASF GitHub Bot commented on DRILL-4091:
---

Github user ChrisSandison commented on a diff in the pull request:

https://github.com/apache/drill/pull/258#discussion_r178619522
  
--- Diff: 
contrib/gis/src/main/java/org/apache/drill/exec/expr/fn/impl/gis/STXFunc.java 
---
@@ -0,0 +1,64 @@
+/**
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.drill.exec.expr.fn.impl.gis;
+
+import java.sql.Types;
+
+import javax.inject.Inject;
+
+import org.apache.drill.exec.expr.DrillSimpleFunc;
+import org.apache.drill.exec.expr.annotations.FunctionTemplate;
+import org.apache.drill.exec.expr.annotations.Output;
+import org.apache.drill.exec.expr.annotations.Param;
+import org.apache.drill.exec.expr.holders.Float8Holder;
+import org.apache.drill.exec.expr.holders.VarBinaryHolder;
+
+import com.esri.core.geometry.Geometry.Type;
+import com.esri.core.geometry.ogc.OGCPoint;
+
+import io.netty.buffer.DrillBuf;
+
+@FunctionTemplate(name = "st_x", scope = 
FunctionTemplate.FunctionScope.SIMPLE,
+  nulls = FunctionTemplate.NullHandling.NULL_IF_NULL)
+public class STXFunc implements DrillSimpleFunc {
+  @Param
+  VarBinaryHolder geomParam;
+
+  @Output
+  Float8Holder out;
+
+  @Inject
+  DrillBuf buffer;
+
+  public void setup() {
+  }
+
+  public void eval() {
+
+com.esri.core.geometry.ogc.OGCGeometry geom;
+
+geom = com.esri.core.geometry.ogc.OGCGeometry
+.fromBinary(geomParam.buffer.nioBuffer(geomParam.start, 
geomParam.end - geomParam.start));
+
+if(geom != null && geom.geometryType().equals("Point")){
+  out.value = ((com.esri.core.geometry.ogc.OGCPoint) geom).X();
+} else {
+  out.value = Double.NaN;
--- End diff --

@cgivre it looks like the assigning of `NaN` is breaking the test suite. Is 
this the behaviour, or could this be the default null handling that is 
provided? Ditto for other times this is assigned


> Support more functions in gis contrib module
> 
>
> Key: DRILL-4091
> URL: https://issues.apache.org/jira/browse/DRILL-4091
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Functions - Drill
>Reporter: Karol Potocki
>Assignee: Karol Potocki
>Priority: Major
>
> Support for commonly used gis functions in gis contrib module: relate, 
> contains, crosses, intersects, touches, difference, disjoint, buffer, union 
> etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (DRILL-6272) Remove binary jars files from source distribution

2018-04-02 Thread Volodymyr Tkach (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422931#comment-16422931
 ] 

Volodymyr Tkach edited comment on DRILL-6272 at 4/2/18 6:30 PM:


[~vrozov] those binaries serve us as data input for testing purposes, they are 
not intended to be executed/compiled by users. I think that this paragraph 
[http://www.apache.org/legal/release-policy.html#compiled-packages] is saying 
about binaries that users will execute manually or programmatically, but that 
is not our case, for us those files are  raw data, like json files for testing 
purposes.


was (Author: volodymyr.tkach):
[~vrozov] those binaries serve us as data input for testing purposes, they are 
not intended to be executed/compiled by users. I think that this paragraph 
[http://www.apache.org/legal/release-policy.html#compiled-packages] is saying 
about binaries that users will execute manually or programmatically, but that 
is not our case, for us those files just raw data, like json files for testing 
purposes.

> Remove binary jars files from source distribution
> -
>
> Key: DRILL-6272
> URL: https://issues.apache.org/jira/browse/DRILL-6272
> Project: Apache Drill
>  Issue Type: Task
>Reporter: Vlad Rozov
>Assignee: Arina Ielchiieva
>Priority: Critical
> Fix For: 1.14.0
>
>
> Per [~vrozov] the source distribution contains binary jar files under 
> exec/java-exec/src/test/resources/jars



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6272) Remove binary jars files from source distribution

2018-04-02 Thread Volodymyr Tkach (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422931#comment-16422931
 ] 

Volodymyr Tkach commented on DRILL-6272:


[~vrozov] those binaries serve us as data input for testing purposes, they are 
not intended to be executed/compiled by users. I think that this paragraph 
[http://www.apache.org/legal/release-policy.html#compiled-packages] is saying 
about binaries that users will execute manually or programmatically, but that 
is not our case, for us those files just raw data, like json files for testing 
purposes.

> Remove binary jars files from source distribution
> -
>
> Key: DRILL-6272
> URL: https://issues.apache.org/jira/browse/DRILL-6272
> Project: Apache Drill
>  Issue Type: Task
>Reporter: Vlad Rozov
>Assignee: Arina Ielchiieva
>Priority: Critical
> Fix For: 1.14.0
>
>
> Per [~vrozov] the source distribution contains binary jar files under 
> exec/java-exec/src/test/resources/jars



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (DRILL-6306) Should not be able to run queries against disabled storage plugins

2018-04-02 Thread Krystal (JIRA)
Krystal created DRILL-6306:
--

 Summary: Should not be able to run queries against disabled 
storage plugins
 Key: DRILL-6306
 URL: https://issues.apache.org/jira/browse/DRILL-6306
 Project: Apache Drill
  Issue Type: Bug
  Components: Storage - Other
Affects Versions: 1.13.0
Reporter: Krystal


Currently, queries against disabled storage plugins are returning data.  This 
should not be the case.  Queries against disabled storage plugins should fail.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (DRILL-5977) predicate pushdown support kafkaMsgOffset

2018-04-02 Thread Kunal Khatua (JIRA)

 [ 
https://issues.apache.org/jira/browse/DRILL-5977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kunal Khatua reassigned DRILL-5977:
---

Assignee: (was: Bhallamudi Venkata Siva Kamesh)

> predicate pushdown support kafkaMsgOffset
> -
>
> Key: DRILL-5977
> URL: https://issues.apache.org/jira/browse/DRILL-5977
> Project: Apache Drill
>  Issue Type: Improvement
>Reporter: B Anil Kumar
>Priority: Major
> Fix For: 1.14.0
>
>
> As part of Kafka storage plugin review, below is the suggestion from Paul.
> {noformat}
> Does it make sense to provide a way to select a range of messages: a starting 
> point or a count? Perhaps I want to run my query every five minutes, scanning 
> only those messages since the previous scan. Or, I want to limit my take to, 
> say, the next 1000 messages. Could we use a pseudo-column such as 
> "kafkaMsgOffset" for that purpose? Maybe
> SELECT * FROM  WHERE kafkaMsgOffset > 12345
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-4547) Javadoc fails with Java8

2018-04-02 Thread Vitalii Diravka (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-4547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422648#comment-16422648
 ] 

Vitalii Diravka commented on DRILL-4547:


So what is the next step for this Jira? 
I suppose it should be the refactoring of Drill java docs and moving onto Java8 
doclint. But the task is really minor.

> Javadoc fails with Java8
> 
>
> Key: DRILL-4547
> URL: https://issues.apache.org/jira/browse/DRILL-4547
> Project: Apache Drill
>  Issue Type: Bug
>  Components: Tools, Build  Test
>Affects Versions: 1.6.0
>Reporter: Laurent Goujon
>Assignee: Venkata Jyothsna Donapati
>Priority: Major
> Fix For: 1.14.0
>
>
> Javadoc cannot be generated when using Java8 (likely because the parser is 
> now more strict).
> Here's an example of issues when trying to generate javadocs in module 
> {{drill-fmpp-maven-plugin}}
> {noformat}
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-javadoc-plugin:2.9.1:jar (attach-javadocs) on 
> project drill-fmpp-maven-plugin: MavenReportException: Error while creating 
> archive:
> [ERROR] Exit code: 1 - 
> /Users/laurent/devel/drill/tools/fmpp/src/main/java/org/apache/drill/fmpp/mojo/FMPPMojo.java:44:
>  error: unknown tag: goal
> [ERROR] * @goal generate
> [ERROR] ^
> [ERROR] 
> /Users/laurent/devel/drill/tools/fmpp/src/main/java/org/apache/drill/fmpp/mojo/FMPPMojo.java:45:
>  error: unknown tag: phase
> [ERROR] * @phase generate-sources
> [ERROR] ^
> [ERROR] 
> /Users/laurent/devel/drill/tools/fmpp/target/generated-sources/plugin/org/apache/drill/fmpp/mojo/HelpMojo.java:25:
>  error: unknown tag: goal
> [ERROR] * @goal help
> [ERROR] ^
> [ERROR] 
> /Users/laurent/devel/drill/tools/fmpp/target/generated-sources/plugin/org/apache/drill/fmpp/mojo/HelpMojo.java:26:
>  error: unknown tag: requiresProject
> [ERROR] * @requiresProject false
> [ERROR] ^
> [ERROR] 
> /Users/laurent/devel/drill/tools/fmpp/target/generated-sources/plugin/org/apache/drill/fmpp/mojo/HelpMojo.java:27:
>  error: unknown tag: threadSafe
> [ERROR] * @threadSafe
> [ERROR] ^
> [ERROR] 
> [ERROR] Command line was: 
> /Library/Java/JavaVirtualMachines/jdk1.8.0_72.jdk/Contents/Home/bin/javadoc 
> @options @packages
> [ERROR] 
> [ERROR] Refer to the generated Javadoc files in 
> '/Users/laurent/devel/drill/tools/fmpp/target/apidocs' dir.
> [ERROR] -> [Help 1]
> [ERROR] 
> [ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
> switch.
> [ERROR] Re-run Maven using the -X switch to enable full debug logging.
> [ERROR] 
> [ERROR] For more information about the errors and possible solutions, please 
> read the following articles:
> [ERROR] [Help 1] 
> http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
> [ERROR] 
> [ERROR] After correcting the problems, you can resume the build with the 
> command
> [ERROR]   mvn  -rf :drill-fmpp-maven-plugin
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6287) apache-release profile should be disabled by default

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422641#comment-16422641
 ] 

ASF GitHub Bot commented on DRILL-6287:
---

Github user vdiravka commented on a diff in the pull request:

https://github.com/apache/drill/pull/1182#discussion_r178572285
  
--- Diff: pom.xml ---
@@ -66,6 +66,7 @@
 
 4096
 4096
+-Xdoclint:none
--- End diff --

Thanks


> apache-release profile should be disabled by default
> 
>
> Key: DRILL-6287
> URL: https://issues.apache.org/jira/browse/DRILL-6287
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (DRILL-6272) Remove binary jars files from source distribution

2018-04-02 Thread Vlad Rozov (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422629#comment-16422629
 ] 

Vlad Rozov edited comment on DRILL-6272 at 4/2/18 3:40 PM:
---

[~arina] Please see 
http://www.apache.org/legal/release-policy.html#compiled-packages and 
http://www.apache.org/dev/release-publishing.html#valid: ??The Apache Software 
Foundation exists to create open source software. Thus, the fundamental 
requirement for a release is that it consist of the necessary *source code* to 
build the project. Optionally, a release may also be accompanied by *compiled 
binaries* for the convenience of users.??


was (Author: vrozov):
[~arina] Please see http://www.apache.org/dev/release-publishing.html#valid: 
??The Apache Software Foundation exists to create open source software. Thus, 
the fundamental requirement for a release is that it consist of the necessary 
*source code* to build the project. Optionally, a release may also be 
accompanied by *compiled binaries* for the convenience of users.??

> Remove binary jars files from source distribution
> -
>
> Key: DRILL-6272
> URL: https://issues.apache.org/jira/browse/DRILL-6272
> Project: Apache Drill
>  Issue Type: Task
>Reporter: Vlad Rozov
>Assignee: Arina Ielchiieva
>Priority: Critical
> Fix For: 1.14.0
>
>
> Per [~vrozov] the source distribution contains binary jar files under 
> exec/java-exec/src/test/resources/jars



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6272) Remove binary jars files from source distribution

2018-04-02 Thread Vlad Rozov (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422629#comment-16422629
 ] 

Vlad Rozov commented on DRILL-6272:
---

[~arina] Please see http://www.apache.org/dev/release-publishing.html#valid: 
??The Apache Software Foundation exists to create open source software. Thus, 
the fundamental requirement for a release is that it consist of the necessary 
*source code* to build the project. Optionally, a release may also be 
accompanied by *compiled binaries* for the convenience of users.??

> Remove binary jars files from source distribution
> -
>
> Key: DRILL-6272
> URL: https://issues.apache.org/jira/browse/DRILL-6272
> Project: Apache Drill
>  Issue Type: Task
>Reporter: Vlad Rozov
>Assignee: Arina Ielchiieva
>Priority: Critical
> Fix For: 1.14.0
>
>
> Per [~vrozov] the source distribution contains binary jar files under 
> exec/java-exec/src/test/resources/jars



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6287) apache-release profile should be disabled by default

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422616#comment-16422616
 ] 

ASF GitHub Bot commented on DRILL-6287:
---

Github user vrozov commented on a diff in the pull request:

https://github.com/apache/drill/pull/1182#discussion_r178567909
  
--- Diff: pom.xml ---
@@ -66,6 +66,7 @@
 
 4096
 4096
+-Xdoclint:none
--- End diff --

@vdiravka Please see DRILL-4547.


> apache-release profile should be disabled by default
> 
>
> Key: DRILL-6287
> URL: https://issues.apache.org/jira/browse/DRILL-6287
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6287) apache-release profile should be disabled by default

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422590#comment-16422590
 ] 

ASF GitHub Bot commented on DRILL-6287:
---

Github user vdiravka commented on a diff in the pull request:

https://github.com/apache/drill/pull/1182#discussion_r178561972
  
--- Diff: pom.xml ---
@@ -66,6 +66,7 @@
 
 4096
 4096
+-Xdoclint:none
--- End diff --

Do we need a task (new Jira) for refactoring the Drill java docs and moving 
onto Java 8 doclint?


> apache-release profile should be disabled by default
> 
>
> Key: DRILL-6287
> URL: https://issues.apache.org/jira/browse/DRILL-6287
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6287) apache-release profile should be disabled by default

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422560#comment-16422560
 ] 

ASF GitHub Bot commented on DRILL-6287:
---

Github user parthchandra commented on the issue:

https://github.com/apache/drill/pull/1182
  
Sorry, Maven not being a strong point, I didn't understand initially what I 
was looking at.

+1



> apache-release profile should be disabled by default
> 
>
> Key: DRILL-6287
> URL: https://issues.apache.org/jira/browse/DRILL-6287
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6287) apache-release profile should be disabled by default

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422255#comment-16422255
 ] 

ASF GitHub Bot commented on DRILL-6287:
---

Github user vrozov commented on the issue:

https://github.com/apache/drill/pull/1182
  
There are two issues with enabling `apache-release` by default:
-  it triggers creating source `apache-drill-...-src.tar.gz` and 
`apache-drill-...-src.zip` archives.
- maven build for any sub-module fails.

The change disables activation of the `apache-release` profile based on JDK 
version and requires explicit activation during the Apache release process.

JDK 1.7 is not supported. See DRILL-1491 and #1143.


> apache-release profile should be disabled by default
> 
>
> Key: DRILL-6287
> URL: https://issues.apache.org/jira/browse/DRILL-6287
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6016) Error reading INT96 created by Apache Spark

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422075#comment-16422075
 ] 

ASF GitHub Bot commented on DRILL-6016:
---

Github user parthchandra commented on the issue:

https://github.com/apache/drill/pull/1166
  
@rajrahul thanks for making all the changes (and of course for the fix)!


> Error reading INT96 created by Apache Spark
> ---
>
> Key: DRILL-6016
> URL: https://issues.apache.org/jira/browse/DRILL-6016
> Project: Apache Drill
>  Issue Type: Bug
>Affects Versions: 1.13.0
>Reporter: Rahul Raj
>Assignee: Rahul Raj
>Priority: Major
>  Labels: ready-to-commit
> Fix For: 1.14.0
>
>
> Hi,
> I am getting the error - SYSTEM ERROR : ClassCastException: 
> org.apache.drill.exec.vector.TimeStampVector cannot be cast to 
> org.apache.drill.exec.vector.VariableWidthVector while trying to read a spark 
> INT96 datetime field on Drill 1.11 in spite of setting the property 
> store.parquet.reader.int96_as_timestamp to  true.
> I believe this was fixed in drill 
> 1.10(https://issues.apache.org/jira/browse/DRILL-4373). What could be wrong.
> I have attached the dataset at 
> https://github.com/rajrahul/files/blob/master/result.tar.gz



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6223) Drill fails on Schema changes

2018-04-02 Thread Parth Chandra (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422069#comment-16422069
 ] 

Parth Chandra commented on DRILL-6223:
--

{quote}Sorry to say, I still disagree with this statement: "This pull request 
adds logic to detect and eliminate dangling columns".
{quote}
Adding some more background -

[~paul-rogers] has this exactly right. At the high level, dropping any columns 
for star queries is not the solution as a star queries explicitly asks for all 
columns.

More importantly, as Paul pointed out, the solution (in some cases) is to 
provide a schema.

For Parquet (or any other source that has schema information), we might solve 
the problem by creating a union of the columns (and also take care of type 
changes by promoting columns to a common type). Parquet currently does not do 
that. Worse, it overrides the schema seen so far with the new schema when a 
change is encountered (I would suggest, once again, a perusal of the Parquet 
metadata cache code). This means in the presence of schema change with Parquet 
files, even the planning gets off on the wrong foot.

The only solution, again, as Paul mentioned, is to provide a composite schema 
by inferring it or asking the user to provide one. The latter is hard because 
sometimes the user does not have the schema or because it is mutating all the 
time (it happens when people move fast and break things). This is exactly when 
they want to use Drill, and, in fact, this was a primary use case for the early 
design of Drill. As a design constraint, we had to assume that we could not 
know the schema until runtime (we may have taken it too far :( ).

We could try to infer the schema by doing a complete scan but that has the same 
issues as asking the user to specify as schema and, in addition, can take 
really really long.

 

 

 

 

> Drill fails on Schema changes 
> --
>
> Key: DRILL-6223
> URL: https://issues.apache.org/jira/browse/DRILL-6223
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Execution - Relational Operators
>Affects Versions: 1.10.0, 1.12.0
>Reporter: salim achouche
>Assignee: salim achouche
>Priority: Major
> Fix For: 1.14.0
>
>
> Drill Query Failing when selecting all columns from a Complex Nested Data 
> File (Parquet) Set). There are differences in Schema among the files:
>  * The Parquet files exhibit differences both at the first level and within 
> nested data types
>  * A select * will not cause an exception but using a limit clause will
>  * Note also this issue seems to happen only when multiple Drillbit minor 
> fragments are involved (concurrency higher than one)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6009) No drillbits on index page

2018-04-02 Thread Arina Ielchiieva (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422046#comment-16422046
 ] 

Arina Ielchiieva commented on DRILL-6009:
-

[~vdonapati] as I have mentioned I saw this scenario only once, so I guess if 
you cannot reproduce it, just close Jira with won't reproduce. If issue pops up 
again, we'll re-open it.

> No drillbits on index page
> --
>
> Key: DRILL-6009
> URL: https://issues.apache.org/jira/browse/DRILL-6009
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Arina Ielchiieva
>Assignee: Venkata Jyothsna Donapati
>Priority: Minor
> Fix For: 1.14.0
>
> Attachments: empty_drillbits.JPG
>
>
> After DRILL-4286 once I saw that index page showed no drillbits at all but it 
> was working, so at least one drillbit was online (empty_drillbits.JPG). After 
> refresh everything was fine.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6223) Drill fails on Schema changes

2018-04-02 Thread Parth Chandra (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422042#comment-16422042
 ] 

Parth Chandra commented on DRILL-6223:
--

{quote}To your point about compensation logic in the context of Schema Changes
{quote} * 
{quote}Why do you think it is ok to dynamically include new columns?{quote}
 * 
{quote}Yet it is not ok to exclude them?{quote}

Usually, in real world data with dynamically changing schema's, new columns are 
added and not removed. 
 * 
{quote}Consider a batch of 32k rows{quote}
 * 
{quote}A VV with null integer values will require 32kb (bits) + 32kb * 4 = 
160kb{quote}
 * 
{quote}Each missing column will require that much memory per 
mini-fragment{quote}

One of the guarantees provided by value vectors is that elements can be 
accessed by index in constant time (or, in the case of nested elements in O(m) 
where m is the level of nesting) . The representation is based on providing 
this guarantee. It comes at the cost of additional memory usage, which is a 
deliberate tradeoff.
{quote}This is unless (similarly to the implicit columns) we optimize the VV 
storage representation or / and push the column preservation to higher layers 
such as the client or foreman
{quote}
It would be wonderful to improve vectors to use much less memory while 
providing the same guarantees. A proposal would be welcome. 

 

> Drill fails on Schema changes 
> --
>
> Key: DRILL-6223
> URL: https://issues.apache.org/jira/browse/DRILL-6223
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Execution - Relational Operators
>Affects Versions: 1.10.0, 1.12.0
>Reporter: salim achouche
>Assignee: salim achouche
>Priority: Major
> Fix For: 1.14.0
>
>
> Drill Query Failing when selecting all columns from a Complex Nested Data 
> File (Parquet) Set). There are differences in Schema among the files:
>  * The Parquet files exhibit differences both at the first level and within 
> nested data types
>  * A select * will not cause an exception but using a limit clause will
>  * Note also this issue seems to happen only when multiple Drillbit minor 
> fragments are involved (concurrency higher than one)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6287) apache-release profile should be disabled by default

2018-04-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421992#comment-16421992
 ] 

ASF GitHub Bot commented on DRILL-6287:
---

Github user parthchandra commented on the issue:

https://github.com/apache/drill/pull/1182
  
I don't understand why the apache-release be disabled by default. And I 
don't see how this change achieves that anyway.

Also, moving -Xdoclint:none to all profiles implies we are no longer 
supporting development using JDK7 ? I'm OK with that, but not sure if we 
concluded that at the time of the 1.13 release.

If that's what we want to do, I'm fine with this change.



> apache-release profile should be disabled by default
> 
>
> Key: DRILL-6287
> URL: https://issues.apache.org/jira/browse/DRILL-6287
> Project: Apache Drill
>  Issue Type: Bug
>Reporter: Vlad Rozov
>Assignee: Vlad Rozov
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (DRILL-6272) Remove binary jars files from source distribution

2018-04-02 Thread Arina Ielchiieva (JIRA)

 [ 
https://issues.apache.org/jira/browse/DRILL-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arina Ielchiieva reassigned DRILL-6272:
---

Assignee: Arina Ielchiieva  (was: Volodymyr Tkach)

> Remove binary jars files from source distribution
> -
>
> Key: DRILL-6272
> URL: https://issues.apache.org/jira/browse/DRILL-6272
> Project: Apache Drill
>  Issue Type: Task
>Reporter: Vlad Rozov
>Assignee: Arina Ielchiieva
>Priority: Critical
> Fix For: 1.14.0
>
>
> Per [~vrozov] the source distribution contains binary jar files under 
> exec/java-exec/src/test/resources/jars



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6272) Remove binary jars files from source distribution

2018-04-02 Thread Arina Ielchiieva (JIRA)

[ 
https://issues.apache.org/jira/browse/DRILL-6272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421975#comment-16421975
 ] 

Arina Ielchiieva commented on DRILL-6272:
-

[~vrozov] these files are needed for Dynamic UDF support tests (when we 
dynamically load functions into Drill). For that purpose we need both source 
and binary files present. We cannot compile them during Drill build since they 
are created from separate project, neither they can be pulled from maven repo, 
since this project is not published.
Example of the project: https://github.com/arina-ielchiieva/drillUDF. 
Also can you please point to Apache documentation where it is indicated that we 
cannot have such files in release.


> Remove binary jars files from source distribution
> -
>
> Key: DRILL-6272
> URL: https://issues.apache.org/jira/browse/DRILL-6272
> Project: Apache Drill
>  Issue Type: Task
>Reporter: Vlad Rozov
>Assignee: Volodymyr Tkach
>Priority: Critical
> Fix For: 1.14.0
>
>
> Per [~vrozov] the source distribution contains binary jar files under 
> exec/java-exec/src/test/resources/jars



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)