[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-07 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..


Patch Set 8:

> Change has been successfully rebased and submitted as
 > 06d078e76bad6944ad487fd432ff6d0aa9174960 by Tim Armstrong

Thanks so much again Joe, Tim and Paul!


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 8
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Fri, 08 Feb 2019 01:17:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-07 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..


Patch Set 7:

> Uploaded patch set 7: Commit message was updated.

Noticed a duplicate word in the commit message. Submitted the patch again with 
the duplicate word removed.


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 7
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 07 Feb 2019 20:20:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-07 Thread Yongjun Zhang (Code Review)
Hello Paul Rogers, Tim Armstrong, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12213

to look at the new patch set (#7).

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..

IMPALA-7929: Allow null qualifier in THBaseFilter

Impala query failed with "IntenalException: Required field 'qualifier'
was not present!" on table created via hive and mapped to HBase, because
the qualifier of the HBase key column is null in the mapped table, and
Impala required non-null qualifier. The fix here is to relax this
requirement.

Test:
Added unit test.
Tested in real cluster.

Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
---
M common/thrift/PlanNodes.thrift
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
A testdata/workloads/functional-query/queries/QueryTest/hbase-col-filter.test
M tests/query_test/test_hbase_queries.py
4 files changed, 70 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/12213/7
--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 7
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-07 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..


Patch Set 6:

> Thanks for fixing this!

Thank Joe, Tim and Paul a lot for the review!


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 07 Feb 2019 19:04:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-07 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..


Patch Set 6:

Uploaded patch set 6 that removed an extra empty line.


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 07 Feb 2019 17:14:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-07 Thread Yongjun Zhang (Code Review)
Hello Paul Rogers, Tim Armstrong, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12213

to look at the new patch set (#6).

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..

IMPALA-7929: Allow null qualifier in THBaseFilter

Impala query failed with "IntenalException: Required field 'qualifier'
was not present!" on table created via hive and mapped to HBase, because
the qualifier of the HBase key key column is null in the mapped table,
and Impala required non-null qualifier. The fix here is to relax this
requirement.

Test:
Added unit test.
Tested in real cluster.

Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
---
M common/thrift/PlanNodes.thrift
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
A testdata/workloads/functional-query/queries/QueryTest/hbase-col-filter.test
M tests/query_test/test_hbase_queries.py
4 files changed, 70 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/12213/6
--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-06 Thread Yongjun Zhang (Code Review)
Hello Paul Rogers, Tim Armstrong, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12213

to look at the new patch set (#5).

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..

IMPALA-7929: Allow null qualifier in THBaseFilter

Impala query failed with "IntenalException: Required field 'qualifier'
was not present!" on table created via hive and mapped to HBase, because
the qualifier of the HBase key key column is null in the mapped table,
and Impala required non-null qualifier. The fix here is to relax this
requirement.

Test:
Added unit test.
Tested in real cluster.

Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
---
M common/thrift/PlanNodes.thrift
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
A testdata/workloads/functional-query/queries/QueryTest/hbase-col-filter.test
M tests/query_test/test_hbase_queries.py
4 files changed, 71 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/12213/5
--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 5
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-06 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..


Patch Set 5:

(6 comments)

HI Joe and Paul,

Thanks a lot for the review. Sorry for late response. I just uploaded new rev 
to address all of them. Added another new rev to address two cosmetic thing 
(indentation and extra empty line). Hope it looks good.

Would you please take a look at the new rev? thanks.

http://gerrit.cloudera.org:8080/#/c/12213/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/12213/3//COMMIT_MSG@7
PS3, Line 7: IMPALA-7929:
> We always use ":" after the JIRA in our commit titles. i.e.
Thanks Joe. Addressed in new rev.


http://gerrit.cloudera.org:8080/#/c/12213/3//COMMIT_MSG@9
PS3, Line 9: Impala query failed with "IntenalException: Required field 
'qualifier'
   : was not present!" on table created via hive and mapped to HBase, 
because
   : the qualifier of the HBase key key column is null in the mapped 
table,
   : and Impala required non-null q
> Please wrap commit messages at about 70-75 characters. "git log" adds about
Addressed in new rev.


http://gerrit.cloudera.org:8080/#/c/12213/3/common/thrift/PlanNodes.thrift
File common/thrift/PlanNodes.thrift:

http://gerrit.cloudera.org:8080/#/c/12213/3/common/thrift/PlanNodes.thrift@287
PS3, Line 287:   // The qualifier for HBase K
> Please add a comment describing why this is optional.
Addressed in new rev.


http://gerrit.cloudera.org:8080/#/c/12213/3/fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
File fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java:

http://gerrit.cloudera.org:8080/#/c/12213/3/fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java@285
PS3, Line 285: nalyzer);
 : FeHBaseTable tbl = (FeHBaseTable) desc_.getTable();
 :
> A couple notes on this comment:
Addressed in new rev.


http://gerrit.cloudera.org:8080/#/c/12213/3/fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java@285
PS3, Line 285: nalyzer);
 : FeHBaseTable tbl = (FeHBaseTable) desc_.getTable();
 :
> Well said.
Agreed.


http://gerrit.cloudera.org:8080/#/c/12213/3/tests/query_test/test_hbase_queries.py
File tests/query_test/test_hbase_queries.py:

http://gerrit.cloudera.org:8080/#/c/12213/3/tests/query_test/test_hbase_queries.py@94
PS3, Line 94:   self.run_stmt_in_hive(del_table)
:
:
:
:
:
:
:
> I would prefer that this use a .test file and run_test_case() to verify the
Addressed in new rev.



--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 5
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 07 Feb 2019 04:18:28 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7929: Allow null qualifier in THBaseFilter

2019-02-06 Thread Yongjun Zhang (Code Review)
Hello Paul Rogers, Tim Armstrong, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12213

to look at the new patch set (#4).

Change subject: IMPALA-7929: Allow null qualifier in THBaseFilter
..

IMPALA-7929: Allow null qualifier in THBaseFilter

Impala query failed with "IntenalException: Required field 'qualifier'
was not present!" on table created via hive and mapped to HBase, because
the qualifier of the HBase key key column is null in the mapped table,
and Impala required non-null qualifier. The fix here is to relax this
requirement.

Test:
Added unit test.
Tested in real cluster.

Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
---
M common/thrift/PlanNodes.thrift
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
A testdata/workloads/functional-query/queries/QueryTest/hbase-col-filter.test
M tests/query_test/test_hbase_queries.py
4 files changed, 73 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/12213/4
--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-19 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..


Patch Set 2:

I found that even if I comment out the code at

https://github.com/apache/impala/blob/1c94450ca92606fb6b708de2ea07445cc6610dbf/be/src/exec/hbase-table-scanner.cc#L394

and it still works, which seems mysterious. After some study, I found out why:  
when no filters are passed to hbase, impala get the all rows from hbase, then 
impala evaluate the predicates itself
See
https://github.infra.cloudera.com/CDH/Impala/blob/cdh6.x/be/src/exec/hbase-scan-node.cc#L252

This does the filtering for the predicates that involve column that has no 
qualifier. It's not very efficient, because we get back all rows from hbase 
scan, and filter out majority. It would be nice to have hbase to filter before 
returning to impala.

Given this understanding, I think We can create a separate jira to address this 
performance issue, and push the fix forward.

Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Sun, 20 Jan 2019 05:33:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..


Patch Set 2:

> > (1 comment)
 >
 > HI Joe, thanks for the very good observation. I did some checking,
 > indeed the code you referred to skip this kind of entries. However,
 > the fact that the tests I added get the right results. Need to
 > understand why. Thanks.

I figured out that for column mapped to hbase key, the filter is added here
https://github.com/apache/impala/blob/1c94450ca92606fb6b708de2ea07445cc6610dbf/be/src/exec/hbase-table-scanner.cc#L394
I think this could explain why it still works for my unit test even though the 
filters are skipped in the code blocks before this one.


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Mon, 14 Jan 2019 22:44:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..


Patch Set 2:

> (1 comment)

HI Joe, thanks for the very good observation. I did some checking, indeed the 
code you referred to skip this kind of entries. However, the fact that the 
tests I added get the right results. Need to understand why. Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Mon, 14 Jan 2019 16:24:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-11 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..


Patch Set 2:

(8 comments)

http://gerrit.cloudera.org:8080/#/c/12213/1/common/thrift/PlanNodes.thrift
File common/thrift/PlanNodes.thrift:

http://gerrit.cloudera.org:8080/#/c/12213/1/common/thrift/PlanNodes.thrift@287
PS1, Line 287:   2: optional string qualifier
> We write this in Java and read it in C++ (hbase-table-scanner.cc). In Java,
Thanks Joe, I thought once we make something optional in the middle, the ones 
that follow it need to be too. But it seems I was wrong about that. I changed 
it in the new rev, such that only qualifier is optional.
In C++ world, I saw that qualifier is by default constructed as empty string. I 
also observed the same from here 
https://github.com/gdb/impala/blob/master/be/src/exec/hbase-table-scanner.cc#L315


http://gerrit.cloudera.org:8080/#/c/12213/1/common/thrift/PlanNodes.thrift@291
PS1, Line 291:   3: required i32 op_ordinal
 :   4: required string filter_constant
> Do these need to be optional? Is filter_constant ever null?
Thanks Joe, Tim and Paul. I made only qualifier optional in the new rev now. I 
originally thought once we make something optional in the middle, the ones that 
follow it need to be too. But I was wrong about that.


http://gerrit.cloudera.org:8080/#/c/12213/1/tests/query_test/test_hbase_queries.py
File tests/query_test/test_hbase_queries.py:

http://gerrit.cloudera.org:8080/#/c/12213/1/tests/query_test/test_hbase_queries.py@28
PS1, Line 28: )
> flake8: E123 closing bracket does not match indentation of opening bracket'
Oops, sorry, I missed this one in the new rev. Will do in next.


http://gerrit.cloudera.org:8080/#/c/12213/1/tests/query_test/test_hbase_queries.py@73
PS1, Line 73:   def test_hbase_col_name(self, unique_database):
> line has trailing whitespace
Solved in new rev. thanks.


http://gerrit.cloudera.org:8080/#/c/12213/1/tests/query_test/test_hbase_queries.py@77
PS1, Line 77: table_name = 
"{0}.hbase_col_name_testkeyx".format(unique_database)
> We generally prefer using {0}, {1} and .format() over the % operator in new
Solved in new rev. Thanks.


http://gerrit.cloudera.org:8080/#/c/12213/1/tests/query_test/test_hbase_queries.py@78
PS1, Line 78: cr_table = """CREATE TABLE {0} (k STRING, c STRING) STORED BY
:'org.apache.hadoop.hive.hbase.HBaseStorageHandler' 
WITH SERDEPROPERTIES
:('hbase.columns.mapping'=':key,cf:c', 
'serialization.format'='1')
:TBLPROPERTIES ('hbase.table.name'=\'{1}\',
:
'storage_handler'='org.apache.hadoop.hive.hbase.HBaseStorageHandler')
:""".format(table_name, table_name)
:
> Couple things here:
Solved in new rev. Thanks.


http://gerrit.cloudera.org:8080/#/c/12213/1/tests/query_test/test_hbase_queries.py@89
PS1, Line 89: self.run_stmt_in_hive(cr_table)
> We should validate that the results are correct just to make sure that we h
Added data and new queries to do that in the new rev. Since I have to create a 
table, then run query, I modified the test I added to do so. Hope it looks 
good. Thanks.


http://gerrit.cloudera.org:8080/#/c/12213/1/tests/query_test/test_hbase_queries.py@91
PS1, Line 91:
> This is not needed - the unique_database handles deleting the database
When I run the same tests again and again, if I don't have this, it will fail 
complaining the hbase already exists. After I added this, then the test can be 
run multiple times without issue. Seems there is something for us to understand?



--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Fri, 11 Jan 2019 22:26:05 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-11 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..

IMPALA-7929. Impala query on HBASE table failing with InternalException.

Impala query failed with "IntenalException: Required field 'qualifier' was not 
present!"
on table created via hive and mapped to hbase, because the qualifier of the 
hbase key
key column is null in the mapped table, and Impala requires non-null qualifier. 
The fix
here relaxes this requirement.

Test:
Added unit test.
Tested in real cluster.

Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
---
M common/thrift/PlanNodes.thrift
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
M tests/query_test/test_hbase_queries.py
3 files changed, 54 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/12213/2
--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-10 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..


Patch Set 1:

Hi Joe and Tim, thanks a lot for the prompt review, all great comments! I will 
address them in next rev asap.


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 10 Jan 2019 22:32:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-10 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12213 )

Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..


Patch Set 1:

Hello Joe and Tim,

I just posted a patch. Would you please help taking a look? Thanks a lot.

Many thanks to Joe for the discussions about solutions and testing.


--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 10 Jan 2019 16:49:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7929. Impala query on HBASE table failing with InternalException.

2019-01-10 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12213


Change subject: IMPALA-7929. Impala query on HBASE table failing with 
InternalException.
..

IMPALA-7929. Impala query on HBASE table failing with InternalException.

Impala query failed with "IntenalException: Required field 'qualifier' was not 
present!"
on table created via hive and mapped to hbase, because the qualifier of the 
hbase key
key column is null in the mapped table, and Impala requires non-null qualifier. 
The fix
here relaxes this requirement.

Test:
Added unit test.
Tested in real cluster.

Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
---
M common/thrift/PlanNodes.thrift
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
M tests/query_test/test_hbase_queries.py
3 files changed, 41 insertions(+), 6 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/12213/1
--
To view, visit http://gerrit.cloudera.org:8080/12213
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I378c2249604481067b5b1c3a3bbb28c30ad4d751
Gerrit-Change-Number: 12213
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2019-01-09 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 6:

> Change has been successfully rebased and submitted as
 > e9652a48dd00c3c076ddccaa940d074b6970b7fc by Joe McDonnell

Many thanks Joe and Tim for the review and commit!

--Yongjun


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Wed, 09 Jan 2019 17:31:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2019-01-04 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 5:

> Patch Set 4: Code-Review+1

Many thanks again for the review Joe. I just updated a new rev that fixes the 
flake8 issue.


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 5
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Fri, 04 Jan 2019 23:19:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2019-01-04 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..

IMPALA-6742: Profiles of running queries should include execution summary.

Currently execution summary is not included in the profiles of running
queries, and it's only reported when the query is finished. This jira makes
the execution summary to the profile reported when queries are still running.

Testing:
Added unit test.
Done with real cluster.

Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
---
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M tests/query_test/test_observability.py
3 files changed, 42 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/11591/5
--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 5
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2019-01-03 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..

IMPALA-5474: Adding a trivial subquery turns error into warning

After adding a subquery to a query that fails with ERROR, it fails with WARNING.
The fix here makes it return ERROR.

Testing:
Added unit tests;
Done real cluster testing with reported cases.

Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
---
M shell/impala_client.py
M tests/shell/test_shell_commandline.py
2 files changed, 36 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/12022/5
--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 5
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2019-01-03 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..


Patch Set 5:

> (1 comment)

Thanks Tim. I had some discussions with Joe, and updated the comment in rev5. 
Would you please take a look? thanks.


--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 5
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Fri, 04 Jan 2019 06:23:49 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2019-01-03 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..


Patch Set 4:

(1 comment)

> Please file a bug for the flaky test - we need to investigate that
 > and fix it (it seems like it could be a product bug).

Agree, we can file a jira after the jira is committed, since the failed test is 
from testing uncommitted code.

http://gerrit.cloudera.org:8080/#/c/12022/4/shell/impala_client.py
File shell/impala_client.py:

http://gerrit.cloudera.org:8080/#/c/12022/4/shell/impala_client.py@543
PS4, Line 543: """Returns ERROR message for the last_query_handle. Caller 
knows that the query has
> This comment is misleading now, which is worse than no comment - this retur
Hi Tim, thanks for your review and comment. Sorry for my late reply. This 
method does get error or ERROR when there is, because "True" is passed to the 
"warn" parameter of get_warn_or_error_log. Maybe we can change the method name 
from get_error_log to get_error_log_if_there_is? If so, we can change the other 
places accordingly. Or we can modify the comment to indicate the same thing. 
e.g., "Returns ERROR message for the last_query_handle when applicable. ...". I 
prefer the latter. What do you think? thanks.



--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 03 Jan 2019 19:55:00 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-12-26 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 4:

BTW, the newly added unit test does fail without my fix. Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 27 Dec 2018 05:43:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-12-26 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 4:

> I would like there to be a test that breaks if this change is not
 > there (or somehow this breaks later).
 > query_test/test_observability.py is the right place to add this
 > test.
 > Here are a couple ways to do this:
 > 1. Use a sleep in the query. See query_test/test_rows_availability.py:
 > "select * from functional.alltypestiny where month = 1 and bool_col
 > = sleep(1000);" will take two seconds. Read that test and
 > understand why the sleep works.
 > 2. Execute a query that returns rows, and get the profile before
 > fetching any of the rows. If we haven't called fetch, then the
 > client has not called ImpalaServer::UnregisterQuery().
 > Of these, I think I prefer #2. See test_exec_summary() for an
 > example.
 > Unrelated to your change, I would like you to hand test your change
 > by putting a sleep between these two lines:
 > https://github.com/apache/impala/blob/master/be/src/service/client-request-state.cc#L516-L517
 > Then fetch the profile and verify it doesn't crash. I have a theory
 > that this might not work.
Hi Joe,

Thanks a lot for the review and comments and sorry for getting back to this 
late. Very good suggestions!

I just uploaded a new rev to address it by adding a new unit test per your 
comment #2. I also did the manual test of adding a sleep call between 
L516-L517, I did not observe behavior difference. Would you please help taking 
a look?

Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 27 Dec 2018 04:19:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-12-26 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..

IMPALA-6742: Profiles of running queries should include execution summary.

Currently execution summary is not included in the profiles of running
queries, and it's only reported when the query is finished. This jira makes
the execution summary to the profile reported when queries are still running.

Testing:
Added unit test.
Done with real cluster.

Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
---
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M tests/query_test/test_observability.py
3 files changed, 41 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/11591/4
--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-12 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..


Patch Set 4:

> Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/3561/

The failed test https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/3863/ 
appears to be not caused by my change. This new rev only has some additional 
comments added, comparing with last rev that had successful test run.

Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 13 Dec 2018 04:45:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-12 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..


Patch Set 4:

Thanks much for the review and comments Tim! I added comments to the new 
methods.


--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Wed, 12 Dec 2018 23:09:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-12 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..

IMPALA-5474: Adding a trivial subquery turns error into warning

After adding a subquery to a query that fails with ERROR, it fails with WARNING.
The fix here makes it return ERROR.

Testing:
Added unit tests;
Done real cluster testing with reported cases.

Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
---
M shell/impala_client.py
M tests/shell/test_shell_commandline.py
2 files changed, 33 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/12022/4
--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-11 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..


Patch Set 3:

> Yeah WARNING isn't a query state, it's just that the error log
 > mechanism can be used to return warnings that don't fail the query.
 >
 > It should be valid to call fetch on any query. Since
 > https://issues.apache.org/jira/browse/IMPALA-5903 was fixed every
 > query type has some kind of result set (even if it's just a status
 > message). impala-shell tries to be a bit more clever and report DML
 > results and similar in a special way, but other clients just fetch
 > the result set.
 >
 > If you look at ImpalaServer::FetchInternal() if called once the
 > query is in the EXCEPTION state it will end up raising the query
 > status as a BeeswaxException.

 > Yeah WARNING isn't a query state, it's just that the error log
 > mechanism can be used to return warnings that don't fail the query.
 >
 > It should be valid to call fetch on any query. Since
 > https://issues.apache.org/jira/browse/IMPALA-5903 was fixed every
 > query type has some kind of result set (even if it's just a status
 > message). impala-shell tries to be a bit more clever and report DML
 > results and similar in a special way, but other clients just fetch
 > the result set.
 >
 > If you look at ImpalaServer::FetchInternal() if called once the
 > query is in the EXCEPTION state it will end up raising the query
 > status as a BeeswaxException.

HI Tim,

Thanks a lot for the review and comments. I agree with you that calling fetch() 
would make the two queries end that the same state. However, adding the logic I 
mentioned earlier would complicate the wait_to_finish() method quite a bit.

My thinking is that, we call get_warning_log() all over the places, even though 
we know some are ERROR and some are warning. So I propose adding a new method 
get_error_log() so we have two methods to choose from. To fix the jira here, we 
simply need to replace the one called when EXCEPTION state is detected with 
get_error_log. This way, the code looks clear, and we don't need to touch any 
other testcases I touched with previous rev.

Would you please take a look at rev3 I just uploaded?

Thanks a lot.


--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 3
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Wed, 12 Dec 2018 00:29:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-11 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..

IMPALA-5474: Adding a trivial subquery turns error into warning

After adding a subquery to a query that fails with ERROR, it fails with WARNING.
The fix here makes it return ERROR.

Testing:
Added unit tests;
Done real cluster testing with reported cases.

Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
---
M shell/impala_client.py
M tests/shell/test_shell_commandline.py
2 files changed, 27 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/12022/3
--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 3
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-03 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..


Patch Set 2:

> (1 comment)

Thanks Tim, good points. The experiment you did is exactly what I did to find 
out where the WARN/ERROR messages are from.

Internally we only have the following states:

beeswax::QueryState::type ClientRequestState::BeeswaxQueryState() const { 
  switch (operation_state_) { 
case TOperationState::INITIALIZED_STATE: return 
beeswax::QueryState::CREATED;
case TOperationState::PENDING_STATE: return beeswax::QueryState::COMPILED;
case TOperationState::RUNNING_STATE: return beeswax::QueryState::RUNNING;   
 
case TOperationState::FINISHED_STATE: return beeswax::QueryState::FINISHED;
case TOperationState::ERROR_STATE: return beeswax::QueryState::EXCEPTION;
default: {   
  DCHECK(false) << "Add explicit translation for all used TOperationState 
values";
  return beeswax::QueryState::EXCEPTION;

}   
  } 
}

We can see here that ERROR is treated the same as EXCEPTION in the above code. 
If some EXCEPTION is not the same as ERROR, then we want to have a distinct 
state to distinguish. For example, we may have EXCEPTION_ERROR, and 
EXCEPTION_WARN states.

Due to lack of two distinct states, we get into the situation that either way 
the wait_to_finish function can be wrong: if we issue a WARN there, it may be 
an ERROR; if we issue an ERROR, it may be just a WARN.

Your suggestion to call fetch() when in EXCEPTION state sounds good, but is it 
always valid to try fetch() when in EXCEPTION state? There is some logic/code 
between wait_to_finish and fetch() in impala_shell.py. We may need to apply the 
same logic inside wait_to_finish when detecting EXCEPTION state.

Thanks.
 > (1 comment)


--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 04 Dec 2018 06:41:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-02 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..

IMPALA-5474: Adding a trivial subquery turns error into warning

After adding a subquery to a query that fails with ERROR, it fails with WARNING.
The fix here makes it return ERROR.

Testing:
Added unit tests;
Done real cluster testing with reported cases.

Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
---
M shell/impala_client.py
M shell/impala_shell.py
M tests/shell/test_shell_commandline.py
3 files changed, 33 insertions(+), 12 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/12022/2
--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-02 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12022 )

Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..


Patch Set 2:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/12022/1/tests/shell/test_shell_commandline.py
File tests/shell/test_shell_commandline.py:

http://gerrit.cloudera.org:8080/#/c/12022/1/tests/shell/test_shell_commandline.py@220
PS1, Line 220:
> flake8: E501 line too long (115 > 90 characters)
Done


http://gerrit.cloudera.org:8080/#/c/12022/1/tests/shell/test_shell_commandline.py@220
PS1, Line 220:
> The error message has specific numbers, including the actual number of colu
Hi Paul, Thanks a lot for the review and comments. I reused the existing table 
that I believe was handcrafted to have the 11/10 mismatch. This test is also 
used in other places in other tests (see 
./testdata/workloads/functional-query/queries/QueryTest/parquet-abort-on-error.test).
 So the 11/10 numbers are essentially hardcoded for this testdata. So I'm not 
sure whether it's worth more work to parameterize the test. But in general I 
agreeing that parameterizing stuff is helpful. I just updated new rev to fix 
the line length, and I also added some comments to the test. Thanks.



--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Sun, 02 Dec 2018 18:42:42 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-5474: Adding a trivial subquery turns error into warning

2018-12-01 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12022


Change subject: IMPALA-5474: Adding a trivial subquery turns error into warning
..

IMPALA-5474: Adding a trivial subquery turns error into warning

After adding a subquery to a query that fails with ERROR, it fails with WARNING.
The fix here makes it return ERROR.

Testing:
Added unit tests;
Done real cluster testing with reported cases.

Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
---
M shell/impala_client.py
M shell/impala_shell.py
M tests/shell/test_shell_commandline.py
3 files changed, 24 insertions(+), 12 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/12022/1
--
To view, visit http://gerrit.cloudera.org:8080/12022
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ibedb11dd3d50bcdb21d508f7d21691925491946e
Gerrit-Change-Number: 12022
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 2:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/11591/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11591/1//COMMIT_MSG@9
PS1, Line 9: Currently execution summary is not included in the profiles of 
running
   : queries, and it's only reported when the query is finished. This 
jira makes
   : the execution summary to the profile reported wh
> Use shorter lines. My recommendation is to wrap at 70 characters.
Thanks, addressed in new rev.


http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.h
File be/src/service/impala-server.h:

http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.h@637
PS1, Line 637: void UpdateE
> You don't want [[noreturn]]
Thanks Joe, my misunderstanding of this annotation. Fixed in new rev.


http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.cc
File be/src/service/impala-server.cc:

http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.cc@1128
PS1, Line 1128:<< PrettyPrinter::Print(cpu_limit_
> I'm thinking this might not hold for some calls to GetRuntimeProfileStr() t
Good catch Joe. Indeed. Fixed in new rev by adding a check.

I tried to run test_observability both locally and at jenkins, the former had 
some failures, however, the latter is clean. Looking into why it failed locally.



--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:51:06 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 2:

Hi Joe, thanks a lot for the very good review and sorry for late update. I just 
uploaded a new rev. Interestingly, some tests in test_observability failed 
locally but all is clean in jenkins. One question about your comment about line 
637, I saw other places included [[noreturn]], what's the guideline for having 
it or not? thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 30 Oct 2018 17:36:59 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Yongjun Zhang (Code Review)
Hello Tim Armstrong, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/11591

to look at the new patch set (#2).

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..

IMPALA-6742: Profiles of running queries should include execution summary.

Currently execution summary is not included in the profiles of running
queries, and it's only reported when the query is finished. This jira makes
the execution summary to the profile reported when queries are still running.

Testing:
Done with real cluster.

Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
---
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
2 files changed, 19 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/11591/2
--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary

2018-10-30 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11591


Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary
..

IMPALA-6742: Profiles of running queries should include execution summary

Currently execution summary is not included in the profiles of running queries, 
and it's
only reported when the query is finished. This jira makes the execution summary 
to the
profile reported when queries are still running.

Testing:
This is a draft, tests are yet to be done.

Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
---
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
2 files changed, 15 insertions(+), 4 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/11591/1
--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7166: ExecSummary should be a first class object.

2018-10-04 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/11555 )

Change subject: IMPALA-7166: ExecSummary should be a first class object.
..

IMPALA-7166: ExecSummary should be a first class object.

Impala RuntimeProfile currently contains "ExecSummary" as a string. We should 
make it a
first class thrift object, so that tools can extract these fields (Est rows 
etc..).

Testing:
Modified unit test.

Change-Id: I4791237a5579f16c9efda8e57876d48980739e13
---
M be/src/service/impala-server.cc
M be/src/util/runtime-profile-test.cc
M be/src/util/runtime-profile.cc
M be/src/util/runtime-profile.h
M common/thrift/RuntimeProfile.thrift
5 files changed, 48 insertions(+), 3 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/55/11555/3
--
To view, visit http://gerrit.cloudera.org:8080/11555
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I4791237a5579f16c9efda8e57876d48980739e13
Gerrit-Change-Number: 11555
Gerrit-PatchSet: 3
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-7166: ExecSummary should be a first class object.

2018-10-04 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11555 )

Change subject: IMPALA-7166: ExecSummary should be a first class object.
..


Patch Set 3:

Thanks for the review Tim. I uploaded new rev to address other comments for 
completeness here. But I think let me fix IMPALA-6742 first.


--
To view, visit http://gerrit.cloudera.org:8080/11555
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4791237a5579f16c9efda8e57876d48980739e13
Gerrit-Change-Number: 11555
Gerrit-PatchSet: 3
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 04 Oct 2018 23:19:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7166: ExecSummary should be a first class object.

2018-10-03 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11555 )

Change subject: IMPALA-7166: ExecSummary should be a first class object.
..


Patch Set 2:

> (1 comment)
 >
 > We're having some non-trivial discussions here so let's publish
 > this code review so it's visible to everyone else.

 > (1 comment)
 >
 > We're having some non-trivial discussions here so let's publish
 > this code review so it's visible to everyone else.

Thanks Tim.

 > (1 comment)
 >
 > We're having some non-trivial discussions here so let's publish
 > this code review so it's visible to everyone else.

 > (1 comment)
 >
 > We're having some non-trivial discussions here so let's publish
 > this code review so it's visible to everyone else.

Thanks Tim, good info. I will work on IMPALA-6742.


--
To view, visit http://gerrit.cloudera.org:8080/11555
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4791237a5579f16c9efda8e57876d48980739e13
Gerrit-Change-Number: 11555
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 04 Oct 2018 01:11:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7166: ExecSummary should be a first class object.

2018-10-03 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11555


Change subject: IMPALA-7166: ExecSummary should be a first class object.
..

IMPALA-7166: ExecSummary should be a first class object.

Impala RuntimeProfile currently contains "ExecSummary" as a string. We should 
make it a
first class thrift object, so that tools can extract these fields (Est rows 
etc..).

Testing:
Modified unit test.

Change-Id: I4791237a5579f16c9efda8e57876d48980739e13
---
M be/src/service/impala-server.cc
M be/src/util/runtime-profile-test.cc
M be/src/util/runtime-profile.cc
M be/src/util/runtime-profile.h
M common/thrift/RuntimeProfile.thrift
5 files changed, 47 insertions(+), 3 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/55/11555/2
--
To view, visit http://gerrit.cloudera.org:8080/11555
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I4791237a5579f16c9efda8e57876d48980739e13
Gerrit-Change-Number: 11555
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-27 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..


Patch Set 10:

Hi Tim, Vuk and Dan, thanks a lot for the help with this jira.

Best.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 10
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 27 Sep 2018 22:05:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-27 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..


Patch Set 9:

> (7 comments)

Thanks for the review Vuk, all the comments here are addressed in rev9.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 9
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 27 Sep 2018 16:38:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-27 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..


Patch Set 9:

> (2 comments)

Thanks for the review Vuk. These two comments are addressed in rev9. I used 
"this case" instead of "these cases" since it's only one case there. Hope it 
works for you.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 9
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 27 Sep 2018 16:37:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-27 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..


Patch Set 9:

> (2 comments)
 >
 > Looks good to me, just had a couple of suggestions

Thanks for the review Tim, all your comments are addressed in newer rev.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 9
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 27 Sep 2018 16:35:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-27 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#9). ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..

IMPALA-6202. mod and % should be equivalent.

Currently in DECIMAL V2 mode, typeof(9.9 % 3) is DECIMAL(2,1) and 
typeof(mod(9.9, 3)) is
DECIMAL(4,1), while both are expected to be DECIMAL(2,1). This jira fixes V2 
mode by
replacing "mod" with "%" at parser stage thus they share the same code path 
afterwards.

Testing:
Added unit tests and done real cluster testing.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/main/java/org/apache/impala/catalog/Type.java
M fe/src/main/java/org/apache/impala/catalog/View.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M fe/src/test/java/org/apache/impala/common/FrontendTestBase.java
M fe/src/test/java/org/apache/impala/service/JdbcTest.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering-disabled.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering.test
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
12 files changed, 126 insertions(+), 32 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/9
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 9
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-26 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#8). ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..

IMPALA-6202. mod and % should be equivalent.

Currently in DECIMAL V2 mode, typeof(9.9 % 3) is DECIMAL(2,1) and 
typeof(mod(9.9, 3)) is
DECIMAL(4,1), while both are expected to be DECIMAL(2,1). This jira fixes V2 
mode by
replacing "mod" with "%" at parser stage thus they share the same code path 
afterwards.

Testing:
Added unit tests and done real cluster testing.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/main/java/org/apache/impala/catalog/Type.java
M fe/src/main/java/org/apache/impala/catalog/View.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M fe/src/test/java/org/apache/impala/common/FrontendTestBase.java
M fe/src/test/java/org/apache/impala/service/JdbcTest.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering-disabled.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering.test
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
12 files changed, 126 insertions(+), 32 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/8
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 8
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-25 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..


Patch Set 6:

Thanks a lot for the review Vuk, just updated new rev to address your comments.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 25 Sep 2018 21:54:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-25 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..

IMPALA-6202. mod and % should be equivalent.

Currently in DECIMAL V2 mode, typeof(9.9 % 3) is DECIMAL(2,1) and 
typeof(mod(9.9, 3)) is
DECIMAL(4,1), while both are expected to be DECIMAL(2,1). This jira fixes V2 
mode by
replacing "mod" with "%" at parser stage thus they share the same code path 
afterwards.

Testing:
Added unit tests and done real cluster testing.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M fe/src/test/java/org/apache/impala/common/FrontendTestBase.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering-disabled.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering.test
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
8 files changed, 120 insertions(+), 31 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/6
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-25 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..

IMPALA-6202. mod and % should be equivalent.

Currently in DECIMAL V2 mode, typeof(9.9 % 3) is DECIMAL(2,1) and 
typeof(mod(9.9, 3)) is
DECIMAL(4,1), while both are expected to be DECIMAL(2,1). This jira fixes V2 
mode by
replacing "mod" with "%" at parser stage thus they share the same code path 
afterwards.

Testing:
Added unit tests and done real cluster testing.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M fe/src/test/java/org/apache/impala/common/FrontendTestBase.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering-disabled.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering.test
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
8 files changed, 125 insertions(+), 30 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/5
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 5
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6202. mod and % should be equivalent.

2018-09-20 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % should be equivalent.
..

IMPALA-6202. mod and % should be equivalent.

Currently typeof(9.9 % 3) is DECIMAL(2,1) and typeof(mod(9.9, 3)) is 
DECIMAL(4,1), it's
expected that both to be DECIMAL(2,1). This jira fixes it by making mod and % 
equivalent.
The solution is to replace "mod" with "%".

Testing:
Added unit tests and done real cluster testing.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering-disabled.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering.test
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
5 files changed, 91 insertions(+), 26 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/4
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 4
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6202. mod and % are not equivalent.

2018-09-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % are not equivalent.
..


Patch Set 3:

Uploaded a quick v3 to prove concept, it seems to be a much better solution. 
The reason I'm looking for new solution is, the casting in the previous 
solution can fail for certain cases. That's the hole I was referring to. I will 
craft coding and probably add more tests later. One side note, this will change 
the behavior of "mod" for certain cases, thus may appear incompatible. Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 3
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Sat, 15 Sep 2018 00:14:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % are not equivalent.

2018-09-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % are not equivalent.
..

IMPALA-6202. mod and % are not equivalent.

Currently typeof(9.9 % 3) is DECIMAL(2,1) and typeof(mod(9.9, 3)) is 
DECIMAL(4,1), it's
expected that both to be DECIMAL(2,1). This jira fixes it by makeing mod and % 
equivalent.
The solution is to replace "mod" with "%".

Testing:
Added unit tests and done real cluster testing.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
3 files changed, 79 insertions(+), 15 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/3
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 3
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6202. mod and % are not equivalent.

2018-09-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % are not equivalent.
..


Patch Set 2:

I'm thinking about a different approach now, say, we can probably simply 
convert a "mod" to "%" when doing the parsing, this way, all code used by "%" 
would be shared, and thus making them really equivalent. Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Fri, 14 Sep 2018 23:37:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % are not equivalent.

2018-09-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % are not equivalent.
..


Patch Set 2:

> (2 comments)

Very good suggestion Vuk, I tried it out, and added some more tests and 
uncovered a hole in the solution. Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Fri, 14 Sep 2018 23:20:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % are not equivalent.

2018-09-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % are not equivalent.
..


Patch Set 2:

Hi Dan and Vuk, thanks a lot for your review comments. I just uploaded a new 
rev to address some test failures. I will address your comments in next rev. 
Thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Fri, 14 Sep 2018 20:11:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6202. mod and % are not equivalent.

2018-09-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/11443 )

Change subject: IMPALA-6202. mod and % are not equivalent.
..

IMPALA-6202. mod and % are not equivalent.

Currently typeof(9.9 % 3) is DECIMAL(2,1) and typeof(mod(9.9, 3)) is 
DECIMAL(4,1), it's
expected that both to be DECIMAL(2,1). This jira fixes it for Decimal V2 mode. 
V1 behavior
is left as is since the V1 mode will be dropped soon per IMPALA-4924.

Testing:
Added unit tests to cover the above mentioned two cases for both Decimal V1 and 
V2 mode,
while the two cases are fixed in V2 mode, they remain different in V1 mode.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/java/org/apache/impala/analysis/Expr.java
M fe/src/main/java/org/apache/impala/analysis/TypesUtil.java
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
3 files changed, 72 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/2
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-6202. mod and % are not equivalent.

2018-09-14 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11443


Change subject: IMPALA-6202. mod and % are not equivalent.
..

IMPALA-6202. mod and % are not equivalent.

Currently typeof(9.9 % 3) is DECIMAL(2,1) and typeof(mod(9.9, 3)) is 
DECIMAL(4,1), it's
expected that both to be DECIMAL(2,1). This jira fixes it for Decimal V2 mode. 
V1 behavior
is left as is since the V1 mode will be dropped soon per IMPALA-4924.

Testing:
Added unit tests to cover the above mentioned two cases for both Decimal V1 and 
V2 mode,
while the two cases are fixed in V2 mode, they remain different in V1 mode.

Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
---
M fe/src/main/java/org/apache/impala/analysis/Expr.java
M fe/src/main/java/org/apache/impala/analysis/TypesUtil.java
M testdata/workloads/functional-query/queries/QueryTest/decimal-exprs.test
3 files changed, 75 insertions(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/43/11443/1
--
To view, visit http://gerrit.cloudera.org:8080/11443
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ib0067da04083859ffbf662a29007154461bea2ba
Gerrit-Change-Number: 11443
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6442: Misleading file offset reporting in error messages.

2018-09-07 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/11379 )

Change subject: IMPALA-6442: Misleading file offset reporting in error messages.
..

IMPALA-6442: Misleading file offset reporting in error messages.

The error message described in IMPALA-6442 incorrectly reported the file offset 
where the
Parquet footer starts, as if the offset is counted from the file end instead of 
from the
file beginning. The fix changed the reported file offset to be counted from the 
beginning
of the Parquet file.

Testing:
Create a small table that contains one row of data with a single column that's 
bigint and
store it as Parquet. Manually changed the footer size field to be
  1) smaller than the original footer size by 1, to trigger the error message 
fixed by
this jira to be printed, to verify that the fix functions correctly;
  2) bigger than the file size, thus to trigger another related error message 
to be
printed.

Change-Id: I35235e99ea9ceb0d31961dd3b8069f7194f5a2de
---
M be/src/exec/hdfs-parquet-scanner.cc
M testdata/data/README
A testdata/data/corrupt_footer_len_decr.parquet
A testdata/data/corrupt_footer_len_incr.parquet
A 
testdata/workloads/functional-query/queries/QueryTest/parquet-corrupt-footer-len-decr.test
A 
testdata/workloads/functional-query/queries/QueryTest/parquet-corrupt-footer-len-incr.test
M tests/query_test/test_scanners.py
7 files changed, 86 insertions(+), 18 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/79/11379/6
--
To view, visit http://gerrit.cloudera.org:8080/11379
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I35235e99ea9ceb0d31961dd3b8069f7194f5a2de
Gerrit-Change-Number: 11379
Gerrit-PatchSet: 6
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6442: Misleading file offset reporting in error messages.

2018-09-06 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11379 )

Change subject: IMPALA-6442: Misleading file offset reporting in error messages.
..


Patch Set 2:

(8 comments)

http://gerrit.cloudera.org:8080/#/c/11379/1/be/src/exec/hdfs-parquet-scanner.cc
File be/src/exec/hdfs-parquet-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/11379/1/be/src/exec/hdfs-parquet-scanner.cc@1165
PS1, Line 1165: sca
> Now that there are two variables storing some kind of length it would be go
Done in new rev.


http://gerrit.cloudera.org:8080/#/c/11379/1/be/src/exec/hdfs-parquet-scanner.cc@1214
PS1, Line 1214: // file_len - 4-byte footer length field - 4-byte version 
number field - metadata size
  :   int64_t metadata_start = file_len - sizeof(int32_t) -
> not your change but can you please fix the line wrapping while you're here?
Done in new rev.


http://gerrit.cloudera.org:8080/#/c/11379/1/be/src/exec/hdfs-parquet-scanner.cc@1228
PS1, Line 1228: const HdfsFileDesc* file_desc = 
scan_node_->GetFileDesc(partition_id, filename());
> If both are NULL we won't catch it now. Are we sure that cannot happen? Sho
Thanks a lot for the review Lars.

The assumption is that stream_->file_desc() is not null, if it's null, the call 
to get file length before this branch would have failed already. If we want, we 
could add a DCHECK in the beginning of the function.

I added the DCHECK here for multiple reasons:
1. only the logic in the existing code here tries to find file_desc.
2. this branch is an "unlikely" branch. Code out side this branch has no need 
to find file_desc.
3. the partition_id variable is only used in this branch.

Another version of my code did move these few lines to before this branch, but 
I chose to move it here because otherwise it's going to be some additional work 
for other code that doesn't need to reach this "unlikely" branch and doesn't 
use the partition_id variable.

What do you think?


http://gerrit.cloudera.org:8080/#/c/11379/1/be/src/exec/hdfs-parquet-scanner.cc@1267
PS1, Line 1267: "at file of
> nit: wrap after file_length
done in new rev.


http://gerrit.cloudera.org:8080/#/c/11379/1/testdata/data/corrupt_footer_len_decr.parquet
File testdata/data/corrupt_footer_len_decr.parquet:

http://gerrit.cloudera.org:8080/#/c/11379/1/testdata/data/corrupt_footer_len_decr.parquet@1
PS1, Line 1: PAR1LÒ,
%c^f&6&6(ÒÒ,,Hschema%c%$%c^f&6&6(ÒÒ,f(Oimpala
 version 2.11.0-SNAPSHOT (build 
26b2cc85d2084e9e464669b2a67a8015ede173e1)½PAR1
> Add a description for both test files to the README file in the same folder
done in new rev.


http://gerrit.cloudera.org:8080/#/c/11379/1/tests/query_test/test_scanners.py
File tests/query_test/test_scanners.py:

http://gerrit.cloudera.org:8080/#/c/11379/1/tests/query_test/test_scanners.py@401
PS1, Line 401:   def corrupt_footer_len_common(self, vector, unique_database, 
testname_postfix):
> The function should have a comment. Please find a more descriptive name for
Done in new rev.


http://gerrit.cloudera.org:8080/#/c/11379/1/tests/query_test/test_scanners.py@412
PS1, Line 412: create_table_and_copy_files(self.client,
> nit: single line
Done in new rev.


http://gerrit.cloudera.org:8080/#/c/11379/1/tests/query_test/test_scanners.py@420
PS1, Line 420:
> nit: capitalization
Done in new rev.



--
To view, visit http://gerrit.cloudera.org:8080/11379
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I35235e99ea9ceb0d31961dd3b8069f7194f5a2de
Gerrit-Change-Number: 11379
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Thu, 06 Sep 2018 08:11:06 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6442: Misleading file offset reporting in error messages.

2018-09-06 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11379


Change subject: IMPALA-6442: Misleading file offset reporting in error messages.
..

IMPALA-6442: Misleading file offset reporting in error messages.

The error message described in IMPALA-6442 incorrectly reported the file offset 
where the
Parquet footer starts, as if the offset is counted from the file end instead of 
from the
file beginning. The fix changed the reported file offset to be counted from the 
beginning
of the Parquet file.

Testing:
Create a small table that contains one row of data with a single column that's 
bigint and
store it as Parquet. Manually changed the footer size field to be
  1) smaller than the original footer size by 1, to trigger the error message 
fixed by
this jira to be printed, to verify that the fix functions correctly;
  2) bigger than the file size, thus to trigger another related error message 
to be
printed.

Change-Id: I35235e99ea9ceb0d31961dd3b8069f7194f5a2de
---
M be/src/exec/hdfs-parquet-scanner.cc
M testdata/data/README
A testdata/data/corrupt_footer_len_decr.parquet
A testdata/data/corrupt_footer_len_incr.parquet
M tests/query_test/test_scanners.py
5 files changed, 65 insertions(+), 17 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/79/11379/2
--
To view, visit http://gerrit.cloudera.org:8080/11379
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I35235e99ea9ceb0d31961dd3b8069f7194f5a2de
Gerrit-Change-Number: 11379
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6442: Misleading file offset reporting in error messages.

2018-09-04 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has abandoned this change. ( 
http://gerrit.cloudera.org:8080/11336 )

Change subject: IMPALA-6442: Misleading file offset reporting in error messages.
..


Abandoned

Abandon this one, and will work on a new one. Thanks.
--
To view, visit http://gerrit.cloudera.org:8080/11336
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: abandon
Gerrit-Change-Id: Ie78ef9b0e11b8505ab7bc965aafdb2729b0027c9
Gerrit-Change-Number: 11336
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Yongjun Zhang 


[Impala-ASF-CR] IMPALA-6442: Misleading file offset reporting in error messages.

2018-08-27 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11336 )

Change subject: IMPALA-6442: Misleading file offset reporting in error messages.
..


Patch Set 1:

Thanks a lot for the review Joe. After reading the code more carefully, I agree 
with your comment.


--
To view, visit http://gerrit.cloudera.org:8080/11336
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie78ef9b0e11b8505ab7bc965aafdb2729b0027c9
Gerrit-Change-Number: 11336
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 28 Aug 2018 03:23:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6442: Misleading file offset reporting in error messages.

2018-08-27 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11336 )

Change subject: IMPALA-6442: Misleading file offset reporting in error messages.
..


Patch Set 1:

> Hello,
 >
 > This is the Apache Impala project. Not everyone who contributes to
 > Apache Impala work at Cloudera, which mean not everyone who
 > contributes can see that link. Please use 
 > https://jenkins.impala.io/job/gerrit-verify-dryrun-external/
 > .

Thanks Michael, I'm looking into whether I can create a unit test. Will run 
test there next time.


--
To view, visit http://gerrit.cloudera.org:8080/11336
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie78ef9b0e11b8505ab7bc965aafdb2729b0027c9
Gerrit-Change-Number: 11336
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Mon, 27 Aug 2018 20:12:20 +
Gerrit-HasComments: No