[jira] [Created] (IMPALA-7272) impalad crash when Fatigue test

2018-07-09 Thread yyzzjj (JIRA)
yyzzjj created IMPALA-7272:
--

 Summary: impalad   crash when Fatigue test
 Key: IMPALA-7272
 URL: https://issues.apache.org/jira/browse/IMPALA-7272
 Project: IMPALA
  Issue Type: Bug
  Components: Backend
Affects Versions: Impala 2.12.0, Impala 2.11.0
 Environment: apache  branch  [329979d6fb0caa0dc449d7e0aa75460c30e868f0]

centos 6.5
Reporter: yyzzjj
 Attachments: e4386102-833c-40bb-4eec10b2-827c76be.dmp, 
impalad_node0.ERROR, impalad_node0.WARNING

(gdb) bt
#0 0x003269832635 in raise () from /lib64/libc.so.6
#1 0x003269833e15 in abort () from /lib64/libc.so.6
#2 0x04010f64 in google::DumpStackTraceAndExit() ()
#3 0x040079dd in google::LogMessage::Fail() ()
#4 0x04009282 in google::LogMessage::SendToLog() ()
#5 0x040073b7 in google::LogMessage::Flush() ()
#6 0x0400a97e in google::LogMessageFatal::~LogMessageFatal() ()
#7 0x01a2dfab in impala::MemPool::CheckIntegrity (this=0x5916e1f8, 
check_current_chunk_empty=true)
 at /export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.cc:258
#8 0x01a2cf56 in impala::MemPool::FindChunk (this=0x5916e1f8, 
min_size=10, check_limits=true) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.cc:158
#9 0x01a3dd1b in impala::MemPool::Allocate (alignment=8, size=10, 
this=0x5916e1f8) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.h:273
#10 impala::MemPool::TryAllocate (this=0x5916e1f8, size=10) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.h:109
#11 0x01caefb8 in impala::StringBuffer::GrowBuffer 
(this=0x7f90d9489c28, new_size=10) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/string-buffer.h:85
#12 0x01caee18 in impala::StringBuffer::Append (this=0x7f90d9489c28, 
str=0x7f92cda6e039 "1104700843don...@jd.com业务运营部\230\340\246͒\177", str_len=10)
 at /export/ldb/online/kudu_rpc_branch/be/src/runtime/string-buffer.h:53
#13 0x01cac864 in impala::StringMinMaxFilter::CopyToBuffer 
(this=0x7f90d9489c00, buffer=0x7f90d9489c28, value=0x7f90d9489c08, len=10)
 at /export/ldb/online/kudu_rpc_branch/be/src/util/min-max-filter.cc:304
#14 0x01cac2a9 in impala::StringMinMaxFilter::MaterializeValues 
(this=0x7f90d9489c00) at 
/export/ldb/online/kudu_rpc_branch/be/src/util/min-max-filter.cc:229
#15 0x02b9641a in impala::FilterContext::MaterializeValues 
(this=0x61cc0b70) at 
/export/ldb/online/kudu_rpc_branch/be/src/exec/filter-context.cc:97
#16 0x7f93fdb9440e in ?? ()
#17 0x7f90a97f5400 in ?? ()
#18 0x2acd2bba01a2e0f7 in ?? ()
#19 0x5916e140 in ?? ()
#20 0x7f930c34d740 in ?? ()
#21 0x7f90a97f5220 in ?? ()
#22 0x66aa77bb66aa77bb in ?? ()
#23 0x61cc0b70 in ?? ()
#24 0x61cc0b70 in ?? ()
#25 0x61cc0b98 in ?? ()
#26 0x61cc0b70 in ?? ()
#27 0x7f90a97f5300 in ?? ()
#28 0x01ab84ed in 
impala::RuntimeFilterBank::AllocateScratchMinMaxFilter (this=, 
 filter_id=, 
 type=) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/runtime-filter-bank.cc:250
Backtrace stopped: previous frame inner to this frame (corrupt stack?)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7272) impalad crash when Fatigue test

2018-07-09 Thread yyzzjj (JIRA)
yyzzjj created IMPALA-7272:
--

 Summary: impalad   crash when Fatigue test
 Key: IMPALA-7272
 URL: https://issues.apache.org/jira/browse/IMPALA-7272
 Project: IMPALA
  Issue Type: Bug
  Components: Backend
Affects Versions: Impala 2.12.0, Impala 2.11.0
 Environment: apache  branch  [329979d6fb0caa0dc449d7e0aa75460c30e868f0]

centos 6.5
Reporter: yyzzjj
 Attachments: e4386102-833c-40bb-4eec10b2-827c76be.dmp, 
impalad_node0.ERROR, impalad_node0.WARNING

(gdb) bt
#0 0x003269832635 in raise () from /lib64/libc.so.6
#1 0x003269833e15 in abort () from /lib64/libc.so.6
#2 0x04010f64 in google::DumpStackTraceAndExit() ()
#3 0x040079dd in google::LogMessage::Fail() ()
#4 0x04009282 in google::LogMessage::SendToLog() ()
#5 0x040073b7 in google::LogMessage::Flush() ()
#6 0x0400a97e in google::LogMessageFatal::~LogMessageFatal() ()
#7 0x01a2dfab in impala::MemPool::CheckIntegrity (this=0x5916e1f8, 
check_current_chunk_empty=true)
 at /export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.cc:258
#8 0x01a2cf56 in impala::MemPool::FindChunk (this=0x5916e1f8, 
min_size=10, check_limits=true) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.cc:158
#9 0x01a3dd1b in impala::MemPool::Allocate (alignment=8, size=10, 
this=0x5916e1f8) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.h:273
#10 impala::MemPool::TryAllocate (this=0x5916e1f8, size=10) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/mem-pool.h:109
#11 0x01caefb8 in impala::StringBuffer::GrowBuffer 
(this=0x7f90d9489c28, new_size=10) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/string-buffer.h:85
#12 0x01caee18 in impala::StringBuffer::Append (this=0x7f90d9489c28, 
str=0x7f92cda6e039 "1104700843don...@jd.com业务运营部\230\340\246͒\177", str_len=10)
 at /export/ldb/online/kudu_rpc_branch/be/src/runtime/string-buffer.h:53
#13 0x01cac864 in impala::StringMinMaxFilter::CopyToBuffer 
(this=0x7f90d9489c00, buffer=0x7f90d9489c28, value=0x7f90d9489c08, len=10)
 at /export/ldb/online/kudu_rpc_branch/be/src/util/min-max-filter.cc:304
#14 0x01cac2a9 in impala::StringMinMaxFilter::MaterializeValues 
(this=0x7f90d9489c00) at 
/export/ldb/online/kudu_rpc_branch/be/src/util/min-max-filter.cc:229
#15 0x02b9641a in impala::FilterContext::MaterializeValues 
(this=0x61cc0b70) at 
/export/ldb/online/kudu_rpc_branch/be/src/exec/filter-context.cc:97
#16 0x7f93fdb9440e in ?? ()
#17 0x7f90a97f5400 in ?? ()
#18 0x2acd2bba01a2e0f7 in ?? ()
#19 0x5916e140 in ?? ()
#20 0x7f930c34d740 in ?? ()
#21 0x7f90a97f5220 in ?? ()
#22 0x66aa77bb66aa77bb in ?? ()
#23 0x61cc0b70 in ?? ()
#24 0x61cc0b70 in ?? ()
#25 0x61cc0b98 in ?? ()
#26 0x61cc0b70 in ?? ()
#27 0x7f90a97f5300 in ?? ()
#28 0x01ab84ed in 
impala::RuntimeFilterBank::AllocateScratchMinMaxFilter (this=, 
 filter_id=, 
 type=) at 
/export/ldb/online/kudu_rpc_branch/be/src/runtime/runtime-filter-bank.cc:250
Backtrace stopped: previous frame inner to this frame (corrupt stack?)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7271) KRPC: cross-port caching of GetLoggedInUser

2018-07-09 Thread Todd Lipcon (JIRA)
Todd Lipcon created IMPALA-7271:
---

 Summary: KRPC: cross-port caching of GetLoggedInUser
 Key: IMPALA-7271
 URL: https://issues.apache.org/jira/browse/IMPALA-7271
 Project: IMPALA
  Issue Type: Bug
  Components: Backend
Affects Versions: Impala 2.12.0
Reporter: Todd Lipcon


In a production cluster I am seeing a high rate of 'mmap' calls under a 
workload which starts and stops a lot of exchanges. These are causing a lot 
contention for the mmap semaphore, which is then causing page faults to be 
rather slow. By running perf record -e syscalls:*mmap I can see that most of 
the calls are coming from KRPC's call to GetLoggedInUser when initting a Proxy 
object. This was fixed by https://gerrit.cloudera.org/#/c/9895/ which needs to 
be cherry-picked over.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Work started] (IMPALA-7186) Docs for kudu_read_mode

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-7186 started by Alex Rodoni.
---
> Docs for kudu_read_mode
> ---
>
> Key: IMPALA-7186
> URL: https://issues.apache.org/jira/browse/IMPALA-7186
> Project: IMPALA
>  Issue Type: Task
>  Components: Docs
>Affects Versions: Impala 3.1.0
>Reporter: Thomas Tauber-Marshall
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: docs, future_release_doc
>
> IMPALA-6812 added a new query option, KUDU_READ_MODE, which should be 
> documented with something like:
> KUDU_READ_MODE Query Option
> This query option allows users to set a desired consistency level for scans 
> of Kudu tables. Possible values are DEFAULT, READ_LATEST, and 
> READ_AT_SNAPSHOT. If DEFAULT is specified, the value of the startup flag 
> '--kudu_read_mode' will be used.
> READ_LATEST
> Kudu provides no consistency guarantees for this mode, expect that all 
> returned rows were committed at some point, sometimes known as 'Read 
> Committed' isolation.
> READ_AT_SNAPSHOT
> Kudu will take a snapshot of the current state of the data and perform the 
> scan over the snapshot, possibly after briefly waiting for ongoing writes to 
> complete. This provides "Read Your Writes" consistency within a single Impala 
> session, except in the case of a Kudu leader change. See the Kudu 
> documentation for more details.
> Type: string
> Default: DEFAULT
> Added in: Impala 3.1



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-7244) Impala 3.1 Doc: Remove unsupported format write support

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni reassigned IMPALA-7244:
---

Assignee: Bikramjeet Vig  (was: Alex Rodoni)

> Impala 3.1 Doc: Remove unsupported format write support
> ---
>
> Key: IMPALA-7244
> URL: https://issues.apache.org/jira/browse/IMPALA-7244
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Affects Versions: Impala 3.1.0
>Reporter: Bikramjeet Vig
>Assignee: Bikramjeet Vig
>Priority: Major
>  Labels: future_release_doc
>
> the parent task removes write support for unsupported formats like 
> *Sequence,* *Avro and compressed text*. Also, the related query options 
> *ALLOW_UNSUPPORTED_FORMATS* and *SEQ_COMPRESSION_MODE* have been migrated to 
> the 'REMOVED' query options type and therefore are no longer functional.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Comment Edited] (IMPALA-7269) Single quote within double-quoted string not matching

2018-07-09 Thread Fredy Wijaya (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537713#comment-16537713
 ] 

Fredy Wijaya edited comment on IMPALA-7269 at 7/9/18 10:32 PM:
---

There has been couple bug fixes related to single/double quotes in Impala 
shell. I tried to reproduce the issue on the latest Impala and was unable to 
reproduce it.
{noformat}
[localhost:21000] default> create table test_table(publisher string);
Query: create table test_table(publisher string)
+-+
| summary |
+-+
| Table has been created. |
+-+
Fetched 1 row(s) in 0.03s
[localhost:21000] default> insert into test_table values("O'Reilly");
Query: insert into test_table values("O'Reilly")
Modified 1 row(s) in 3.32s

[localhost:21000] default> select * from test_table where publisher = 
"O'Reilly";
Query: select * from test_table where publisher = "O'Reilly"
+---+
| publisher |
+---+
| O'Reilly  |
+---+
Fetched 1 row(s) in 0.11s
[localhost:21000] default> select * from test_table where publisher = 
'O\'Reilly';
Query: select * from test_table where publisher = 'O\'Reilly'
+---+
| publisher |
+---+
| O'Reilly  |
+---+
Fetched 1 row(s) in 0.11s
{noformat}
Are you able to reproduce the issue in the latest Impala?


was (Author: fredyw):
There has been couple bug fixes related to single/double quotes in Impala 
shell. I tried to reproduce on the latest Impala and was unable to reproduce 
the issue.

{noformat}
[localhost:21000] default> create table test_table(publisher string);
Query: create table test_table(publisher string)
+-+
| summary |
+-+
| Table has been created. |
+-+
Fetched 1 row(s) in 0.03s
[localhost:21000] default> insert into test_table values("O'Reilly");
Query: insert into test_table values("O'Reilly")
Modified 1 row(s) in 3.32s

[localhost:21000] default> select * from test_table where publisher = 
"O'Reilly";
Query: select * from test_table where publisher = "O'Reilly"
+---+
| publisher |
+---+
| O'Reilly  |
+---+
Fetched 1 row(s) in 0.11s
[localhost:21000] default> select * from test_table where publisher = 
'O\'Reilly';
Query: select * from test_table where publisher = 'O\'Reilly'
+---+
| publisher |
+---+
| O'Reilly  |
+---+
Fetched 1 row(s) in 0.11s
{noformat}

Are you able to reproduce the issue in the latest Impala?


> Single quote within double-quoted string not matching
> -
>
> Key: IMPALA-7269
> URL: https://issues.apache.org/jira/browse/IMPALA-7269
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 2.10.0
>Reporter: Eric Karnowski
>Priority: Major
>
> I'm using Impala to query data with a string value:
> {{O'Reilly}}
> When I query this using double quotes to handle the single quote, it accepts 
> the query but returns no rows. I have to escape the single quote with a 
> backslash for it to match.
> Similarly, 
> {{SELECT "O'Reilly"}}
> returns simply O
>  
> I believe this is the same issue as 
> https://issues.apache.org/jira/browse/IMPALA-1081 and 
> https://issues.apache.org/jira/browse/IMPALA-3153 but these were closed as 
> "cannot reproduce".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7269) Single quote within double-quoted string not matching

2018-07-09 Thread Fredy Wijaya (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537713#comment-16537713
 ] 

Fredy Wijaya commented on IMPALA-7269:
--

There has been couple bug fixes related to single/double quotes in Impala 
shell. I tried to reproduce on the latest Impala and was unable to reproduce 
the issue.

{noformat}
[localhost:21000] default> create table test_table(publisher string);
Query: create table test_table(publisher string)
+-+
| summary |
+-+
| Table has been created. |
+-+
Fetched 1 row(s) in 0.03s
[localhost:21000] default> insert into test_table values("O'Reilly");
Query: insert into test_table values("O'Reilly")
Modified 1 row(s) in 3.32s

[localhost:21000] default> select * from test_table where publisher = 
"O'Reilly";
Query: select * from test_table where publisher = "O'Reilly"
+---+
| publisher |
+---+
| O'Reilly  |
+---+
Fetched 1 row(s) in 0.11s
[localhost:21000] default> select * from test_table where publisher = 
'O\'Reilly';
Query: select * from test_table where publisher = 'O\'Reilly'
+---+
| publisher |
+---+
| O'Reilly  |
+---+
Fetched 1 row(s) in 0.11s
{noformat}

Are you able to reproduce the issue in the latest Impala?


> Single quote within double-quoted string not matching
> -
>
> Key: IMPALA-7269
> URL: https://issues.apache.org/jira/browse/IMPALA-7269
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 2.10.0
>Reporter: Eric Karnowski
>Priority: Major
>
> I'm using Impala to query data with a string value:
> {{O'Reilly}}
> When I query this using double quotes to handle the single quote, it accepts 
> the query but returns no rows. I have to escape the single quote with a 
> backslash for it to match.
> Similarly, 
> {{SELECT "O'Reilly"}}
> returns simply O
>  
> I believe this is the same issue as 
> https://issues.apache.org/jira/browse/IMPALA-1081 and 
> https://issues.apache.org/jira/browse/IMPALA-3153 but these were closed as 
> "cannot reproduce".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7270) Improve the decimal fuzz test

2018-07-09 Thread Taras Bobrovytsky (JIRA)
Taras Bobrovytsky created IMPALA-7270:
-

 Summary: Improve the decimal fuzz test
 Key: IMPALA-7270
 URL: https://issues.apache.org/jira/browse/IMPALA-7270
 Project: IMPALA
  Issue Type: Bug
  Components: Infrastructure
Reporter: Taras Bobrovytsky


The decimal fuzz test has been helpful in finding bugs in the decimal data 
type. However, currently it only supports arithmetic operations. Here are some 
ideas about how it can be expanded:
 * Add binary predicates (greater than, less than, etc)
 * Add operations with other data types for example, (decimal + bigint), 
(bigint <= int).
 * More complex expressions (d1 + d2 / d3 % d4)
 * Add other decimal functions, such as round(), ceil()



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7270) Improve the decimal fuzz test

2018-07-09 Thread Taras Bobrovytsky (JIRA)
Taras Bobrovytsky created IMPALA-7270:
-

 Summary: Improve the decimal fuzz test
 Key: IMPALA-7270
 URL: https://issues.apache.org/jira/browse/IMPALA-7270
 Project: IMPALA
  Issue Type: Bug
  Components: Infrastructure
Reporter: Taras Bobrovytsky


The decimal fuzz test has been helpful in finding bugs in the decimal data 
type. However, currently it only supports arithmetic operations. Here are some 
ideas about how it can be expanded:
 * Add binary predicates (greater than, less than, etc)
 * Add operations with other data types for example, (decimal + bigint), 
(bigint <= int).
 * More complex expressions (d1 + d2 / d3 % d4)
 * Add other decimal functions, such as round(), ceil()



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Comment Edited] (IMPALA-7269) Single quote within double-quoted string not matching

2018-07-09 Thread Ian Cook (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537609#comment-16537609
 ] 

Ian Cook edited comment on IMPALA-7269 at 7/9/18 9:22 PM:
--

This issue also affects expressions in the {{WHERE}} clause that compare string 
columns with double-quoted literal strings. For example, when querying a table 
with a {{STRING}} column named {{publisher}} with the value {{O'Reilly}} in one 
or more row, the query

{{SELECT * FROM _table_ WHERE publisher = "O'Reilly";}}

returns zero rows, but the query

{{SELECT * FROM _table_ WHERE publisher = "O\'Reilly";}}

(with a backslash added before the apostrophe) will return the rows as expected.


was (Author: icook):
This issue also affects expressions in the {{WHERE}} clause that compare string 
columns with double-quoted literal strings. For example, when querying a table 
with a {{STRING}} column named {{publisher}} with the value {{O'Reilly}} in one 
or more row, the query

{{SELECT * FROM _table_ WHERE publisher = "O'Reilly";}}

returns zero rows, but the query

{{ SELECT * FROM _table_ WHERE publisher = "O\'Reilly";}}

(with a backslash added before the apostrophe) will return the rows as expected.

> Single quote within double-quoted string not matching
> -
>
> Key: IMPALA-7269
> URL: https://issues.apache.org/jira/browse/IMPALA-7269
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 2.10.0
>Reporter: Eric Karnowski
>Priority: Major
>
> I'm using Impala to query data with a string value:
> {{O'Reilly}}
> When I query this using double quotes to handle the single quote, it accepts 
> the query but returns no rows. I have to escape the single quote with a 
> backslash for it to match.
> Similarly, 
> {{SELECT "O'Reilly"}}
> returns simply O
>  
> I believe this is the same issue as 
> https://issues.apache.org/jira/browse/IMPALA-1081 and 
> https://issues.apache.org/jira/browse/IMPALA-3153 but these were closed as 
> "cannot reproduce".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7269) Single quote within double-quoted string not matching

2018-07-09 Thread Ian Cook (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537609#comment-16537609
 ] 

Ian Cook commented on IMPALA-7269:
--

This issue also affects expressions in the {{WHERE}} clause that compare string 
columns with double-quoted literal strings. For example, when querying a table 
with a {{STRING}} column named {{publisher}} with the value {{O'Reilly}} in one 
or more row, the query

{{SELECT * FROM _table_ WHERE publisher = "O'Reilly";}}

returns zero rows, but the query

{{ SELECT * FROM _table_ WHERE publisher = "O\'Reilly";}}

(with a backslash added before the apostrophe) will return the rows as expected.

> Single quote within double-quoted string not matching
> -
>
> Key: IMPALA-7269
> URL: https://issues.apache.org/jira/browse/IMPALA-7269
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 2.10.0
>Reporter: Eric Karnowski
>Priority: Major
>
> I'm using Impala to query data with a string value:
> {{O'Reilly}}
> When I query this using double quotes to handle the single quote, it accepts 
> the query but returns no rows. I have to escape the single quote with a 
> backslash for it to match.
> Similarly, 
> {{SELECT "O'Reilly"}}
> returns simply O
>  
> I believe this is the same issue as 
> https://issues.apache.org/jira/browse/IMPALA-1081 and 
> https://issues.apache.org/jira/browse/IMPALA-3153 but these were closed as 
> "cannot reproduce".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7269) Single quote within double-quoted string not matching

2018-07-09 Thread Eric Karnowski (JIRA)
Eric Karnowski created IMPALA-7269:
--

 Summary: Single quote within double-quoted string not matching
 Key: IMPALA-7269
 URL: https://issues.apache.org/jira/browse/IMPALA-7269
 Project: IMPALA
  Issue Type: Bug
Affects Versions: Impala 2.10.0
Reporter: Eric Karnowski


I'm using Impala to query data with a string value:

{{O'Reilly}}

When I query this using double quotes to handle the single quote, it accepts 
the query but returns no rows. I have to escape the single quote with a 
backslash for it to match.

Similarly, 

{{SELECT "O'Reilly"}}

returns simply O

 

I believe this is the same issue as 
https://issues.apache.org/jira/browse/IMPALA-1081 and 
https://issues.apache.org/jira/browse/IMPALA-3153 but these were closed as 
"cannot reproduce".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7269) Single quote within double-quoted string not matching

2018-07-09 Thread Eric Karnowski (JIRA)
Eric Karnowski created IMPALA-7269:
--

 Summary: Single quote within double-quoted string not matching
 Key: IMPALA-7269
 URL: https://issues.apache.org/jira/browse/IMPALA-7269
 Project: IMPALA
  Issue Type: Bug
Affects Versions: Impala 2.10.0
Reporter: Eric Karnowski


I'm using Impala to query data with a string value:

{{O'Reilly}}

When I query this using double quotes to handle the single quote, it accepts 
the query but returns no rows. I have to escape the single quote with a 
backslash for it to match.

Similarly, 

{{SELECT "O'Reilly"}}

returns simply O

 

I believe this is the same issue as 
https://issues.apache.org/jira/browse/IMPALA-1081 and 
https://issues.apache.org/jira/browse/IMPALA-3153 but these were closed as 
"cannot reproduce".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IMPALA-4690) conv() needs substantially more documentation

2018-07-09 Thread Alex Rodoni (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537562#comment-16537562
 ] 

Alex Rodoni commented on IMPALA-4690:
-

[~jbapple] Is this something you are saving for newbies as the label indicates?

> conv() needs substantially more documentation
> -
>
> Key: IMPALA-4690
> URL: https://issues.apache.org/jira/browse/IMPALA-4690
> Project: IMPALA
>  Issue Type: Bug
>  Components: Docs
>Affects Versions: Impala 2.8.0
>Reporter: Jim Apple
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: newbie
>
> The documentation for {{conv()}} doesn't say what the limits are on the 
> bases, what the output format is for digits greater than 9, or what negative 
> bases do. grep for {{MathConversionFunctions}} to see some examples.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7214) Lots of misleading/incorrect use of DataNode in Impala docs

2018-07-09 Thread Alex Rodoni (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537542#comment-16537542
 ] 

Alex Rodoni commented on IMPALA-7214:
-

[~tarmstrong] Besides, the example you gave, how do you recommend we fix the 
doc errors in a systematic way?

I ran "grep -i datanode", and a good number of doc entries were returned, 
including many "datanodes running impala daemons".

> Lots of misleading/incorrect use of DataNode in Impala docs
> ---
>
> Key: IMPALA-7214
> URL: https://issues.apache.org/jira/browse/IMPALA-7214
> Project: IMPALA
>  Issue Type: Bug
>  Components: Docs
>Affects Versions: Impala 2.12.0
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>
> The docs tend to conflate DataNodes (a HDFS service) and Impala daemons. I 
> think this stems from the original deployment practice of always colocating 
> Impala daemons with HDFS datanodes so that HDFS data could always be read 
> from a local DataNode. 
> I'm a bit pedantic so the conflation feels wrong to me regardless, but I 
> think this will become increasingly confusing as alternative deployments 
> without colocated HDFS DataNodes become more common (e.g. running against S3, 
> running with a separate HDFS service).
> E.g. picking an example at random:
> {noformat}
> In Impala 1.4.0 and higher, the LIMIT clause is now 
> optional (rather than required) for
> queries that use the ORDER BY clause. Impala 
> automatically uses a temporary disk work area
> to perform the sort if the sort operation would otherwise exceed the 
> Impala memory limit for a particular
> DataNode.
> {noformat}
> This is wrong because the memory limit is for an Impala daemon, which is the 
> process that does the actual sorting. So here I think it should be "Impala 
> daemon" instead of "DataNode".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-7214) Lots of misleading/incorrect use of DataNode in Impala docs

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-7214 started by Alex Rodoni.
---
> Lots of misleading/incorrect use of DataNode in Impala docs
> ---
>
> Key: IMPALA-7214
> URL: https://issues.apache.org/jira/browse/IMPALA-7214
> Project: IMPALA
>  Issue Type: Bug
>  Components: Docs
>Affects Versions: Impala 2.12.0
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>
> The docs tend to conflate DataNodes (a HDFS service) and Impala daemons. I 
> think this stems from the original deployment practice of always colocating 
> Impala daemons with HDFS datanodes so that HDFS data could always be read 
> from a local DataNode. 
> I'm a bit pedantic so the conflation feels wrong to me regardless, but I 
> think this will become increasingly confusing as alternative deployments 
> without colocated HDFS DataNodes become more common (e.g. running against S3, 
> running with a separate HDFS service).
> E.g. picking an example at random:
> {noformat}
> In Impala 1.4.0 and higher, the LIMIT clause is now 
> optional (rather than required) for
> queries that use the ORDER BY clause. Impala 
> automatically uses a temporary disk work area
> to perform the sort if the sort operation would otherwise exceed the 
> Impala memory limit for a particular
> DataNode.
> {noformat}
> This is wrong because the memory limit is for an Impala daemon, which is the 
> process that does the actual sorting. So here I think it should be "Impala 
> daemon" instead of "DataNode".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-7214) Lots of misleading/incorrect use of DataNode in Impala docs

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni updated IMPALA-7214:

Issue Type: Bug  (was: Documentation)

> Lots of misleading/incorrect use of DataNode in Impala docs
> ---
>
> Key: IMPALA-7214
> URL: https://issues.apache.org/jira/browse/IMPALA-7214
> Project: IMPALA
>  Issue Type: Bug
>  Components: Docs
>Affects Versions: Impala 2.12.0
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>
> The docs tend to conflate DataNodes (a HDFS service) and Impala daemons. I 
> think this stems from the original deployment practice of always colocating 
> Impala daemons with HDFS datanodes so that HDFS data could always be read 
> from a local DataNode. 
> I'm a bit pedantic so the conflation feels wrong to me regardless, but I 
> think this will become increasingly confusing as alternative deployments 
> without colocated HDFS DataNodes become more common (e.g. running against S3, 
> running with a separate HDFS service).
> E.g. picking an example at random:
> {noformat}
> In Impala 1.4.0 and higher, the LIMIT clause is now 
> optional (rather than required) for
> queries that use the ORDER BY clause. Impala 
> automatically uses a temporary disk work area
> to perform the sort if the sort operation would otherwise exceed the 
> Impala memory limit for a particular
> DataNode.
> {noformat}
> This is wrong because the memory limit is for an Impala daemon, which is the 
> process that does the actual sorting. So here I think it should be "Impala 
> daemon" instead of "DataNode".



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-6223) Gracefully handle malformed 'with' queries in impala-shell

2018-07-09 Thread Pooja Nilangekar (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pooja Nilangekar resolved IMPALA-6223.
--
Resolution: Fixed

> Gracefully handle malformed 'with' queries in impala-shell
> --
>
> Key: IMPALA-6223
> URL: https://issues.apache.org/jira/browse/IMPALA-6223
> Project: IMPALA
>  Issue Type: Bug
>  Components: Clients
>Affects Versions: Impala 2.10.0
>Reporter: bharath v
>Assignee: Pooja Nilangekar
>Priority: Minor
>  Labels: newbie
>
> Impala shell can throw a lexer error if it encounters a malformed "with" 
> query.
> {noformat}
> impala-shell.sh -q "with foo as (select bar from temp where temp.a='"
> Starting Impala Shell without Kerberos authentication
> Connected to localhost:21000
> Server version: impalad version 2.11.0-SNAPSHOT DEBUG (build 
> 0ee1765f38082bc5c10aa37b23cb8e57caa57d4e)
> Traceback (most recent call last):
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1463, in 
> execute_queries_non_interactive_mode(options, query_options)
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1338, in 
> execute_queries_non_interactive_mode
> shell.execute_query_list(queries)):
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1218, in 
> execute_query_list
> if self.onecmd(q) is CmdStatus.ERROR:
>   File "/home/bharath/Impala/shell/impala_shell.py", line 505, in onecmd
> return cmd.Cmd.onecmd(self, line)
>   File "/usr/lib/python2.7/cmd.py", line 221, in onecmd
> return func(arg)
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1024, in do_with
> tokens = list(lexer)
>   File "/usr/lib/python2.7/shlex.py", line 269, in next
> token = self.get_token()
>   File "/usr/lib/python2.7/shlex.py", line 96, in get_token
> raw = self.read_token()
>   File "/usr/lib/python2.7/shlex.py", line 172, in read_token
> raise ValueError, "No closing quotation"
> ValueError: No closing quotation
> {noformat}
> This happens because we use shlex to parse the input query to determine if 
> its a DML and it can throw if the input doesn't have balanced quotes.
> {noformat}
> def do_with(self, args):
> """Executes a query with a WITH clause, fetching all rows"""
> query = self.imp_client.create_beeswax_query("with %s" % args,
>  self.set_query_options)
> # Set posix=True and add "'" to escaped quotes
> # to deal with escaped quotes in string literals
> lexer = shlex.shlex(query.query.lstrip(), posix=True)
> lexer.escapedquotes += "'"
> # Because the WITH clause may precede DML or SELECT queries,
> # just checking the first token is insufficient.
> is_dml = False
> tokens = list(lexer)  <
> {noformat}
> A simple shlex repro of that is as follows,
> {noformat}
> >>> lexer = shlex.shlex("with foo as (select bar from temp where temp.a='", 
> >>> posix=True);
> >>> list(lexer)
> Traceback (most recent call last):
>   File "", line 1, in 
>   File "/usr/lib/python2.7/shlex.py", line 269, in next
> token = self.get_token()
>   File "/usr/lib/python2.7/shlex.py", line 96, in get_token
> raw = self.read_token()
>   File "/usr/lib/python2.7/shlex.py", line 172, in read_token
> raise ValueError, "No closing quotation"
> ValueError: No closing quotation
> {noformat}
> Fix: Either catch the exception and handle it gracefully or have a better way 
> to figure out the query type, using a SQL parser (more involved).
> This query also repros it:
> {code}
> with v as (select 1)
> select foo('\\'), ('bar
> ;
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-6223) Gracefully handle malformed 'with' queries in impala-shell

2018-07-09 Thread Pooja Nilangekar (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pooja Nilangekar resolved IMPALA-6223.
--
Resolution: Fixed

> Gracefully handle malformed 'with' queries in impala-shell
> --
>
> Key: IMPALA-6223
> URL: https://issues.apache.org/jira/browse/IMPALA-6223
> Project: IMPALA
>  Issue Type: Bug
>  Components: Clients
>Affects Versions: Impala 2.10.0
>Reporter: bharath v
>Assignee: Pooja Nilangekar
>Priority: Minor
>  Labels: newbie
>
> Impala shell can throw a lexer error if it encounters a malformed "with" 
> query.
> {noformat}
> impala-shell.sh -q "with foo as (select bar from temp where temp.a='"
> Starting Impala Shell without Kerberos authentication
> Connected to localhost:21000
> Server version: impalad version 2.11.0-SNAPSHOT DEBUG (build 
> 0ee1765f38082bc5c10aa37b23cb8e57caa57d4e)
> Traceback (most recent call last):
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1463, in 
> execute_queries_non_interactive_mode(options, query_options)
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1338, in 
> execute_queries_non_interactive_mode
> shell.execute_query_list(queries)):
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1218, in 
> execute_query_list
> if self.onecmd(q) is CmdStatus.ERROR:
>   File "/home/bharath/Impala/shell/impala_shell.py", line 505, in onecmd
> return cmd.Cmd.onecmd(self, line)
>   File "/usr/lib/python2.7/cmd.py", line 221, in onecmd
> return func(arg)
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1024, in do_with
> tokens = list(lexer)
>   File "/usr/lib/python2.7/shlex.py", line 269, in next
> token = self.get_token()
>   File "/usr/lib/python2.7/shlex.py", line 96, in get_token
> raw = self.read_token()
>   File "/usr/lib/python2.7/shlex.py", line 172, in read_token
> raise ValueError, "No closing quotation"
> ValueError: No closing quotation
> {noformat}
> This happens because we use shlex to parse the input query to determine if 
> its a DML and it can throw if the input doesn't have balanced quotes.
> {noformat}
> def do_with(self, args):
> """Executes a query with a WITH clause, fetching all rows"""
> query = self.imp_client.create_beeswax_query("with %s" % args,
>  self.set_query_options)
> # Set posix=True and add "'" to escaped quotes
> # to deal with escaped quotes in string literals
> lexer = shlex.shlex(query.query.lstrip(), posix=True)
> lexer.escapedquotes += "'"
> # Because the WITH clause may precede DML or SELECT queries,
> # just checking the first token is insufficient.
> is_dml = False
> tokens = list(lexer)  <
> {noformat}
> A simple shlex repro of that is as follows,
> {noformat}
> >>> lexer = shlex.shlex("with foo as (select bar from temp where temp.a='", 
> >>> posix=True);
> >>> list(lexer)
> Traceback (most recent call last):
>   File "", line 1, in 
>   File "/usr/lib/python2.7/shlex.py", line 269, in next
> token = self.get_token()
>   File "/usr/lib/python2.7/shlex.py", line 96, in get_token
> raw = self.read_token()
>   File "/usr/lib/python2.7/shlex.py", line 172, in read_token
> raise ValueError, "No closing quotation"
> ValueError: No closing quotation
> {noformat}
> Fix: Either catch the exception and handle it gracefully or have a better way 
> to figure out the query type, using a SQL parser (more involved).
> This query also repros it:
> {code}
> with v as (select 1)
> select foo('\\'), ('bar
> ;
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (IMPALA-6031) Distributed plan describes coordinator-only nodes as scanning

2018-07-09 Thread Pooja Nilangekar (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pooja Nilangekar resolved IMPALA-6031.
--
Resolution: Fixed

> Distributed plan describes coordinator-only nodes as scanning
> -
>
> Key: IMPALA-6031
> URL: https://issues.apache.org/jira/browse/IMPALA-6031
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Affects Versions: Impala 2.11.0
>Reporter: Jim Apple
>Assignee: Pooja Nilangekar
>Priority: Major
>
> In a cluster with one coordinator-only node and three executor-only nodes:
> {noformat}
> Query: explain select count(*) from web_sales a, web_sales b where 
> a.ws_order_number = b.ws_order_number and a.ws_item_sk = b.ws_item_sk
> +--+
> | Explain String  
>  |
> +--+
> | Per-Host Resource Reservation: Memory=136.00MB  
>  |
> | Per-Host Resource Estimates: Memory=3.04GB  
>  |
> | 
>  |
> | F03:PLAN FRAGMENT [UNPARTITIONED] hosts=1 instances=1   
>  |
> |   PLAN-ROOT SINK
>  |
> |   |  mem-estimate=0B mem-reservation=0B 
>  |
> |   | 
>  |
> |   07:AGGREGATE [FINALIZE]   
>  |
> |   |  output: count:merge(*) 
>  |
> |   |  mem-estimate=10.00MB mem-reservation=0B
>  |
> |   |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> |   | 
>  |
> |   06:EXCHANGE [UNPARTITIONED]   
>  |
> |  mem-estimate=0B mem-reservation=0B 
>  |
> |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> | 
>  |
> | F02:PLAN FRAGMENT [HASH(a.ws_item_sk,a.ws_order_number)] hosts=4 
> instances=4 |
> |   DATASTREAM SINK [FRAGMENT=F03, EXCHANGE=06, UNPARTITIONED]
>  |
> |   |  mem-estimate=0B mem-reservation=0B 
>  |
> |   03:AGGREGATE  
>  |
> |   |  output: count(*)   
>  |
> |   |  mem-estimate=10.00MB mem-reservation=0B
>  |
> |   |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> |   | 
>  |
> |   02:HASH JOIN [INNER JOIN, PARTITIONED]
>  |
> |   |  hash predicates: a.ws_item_sk = b.ws_item_sk, a.ws_order_number = 
> b.ws_order_number |
> |   |  runtime filters: RF000 <- b.ws_item_sk, RF001 <- b.ws_order_number 
>  |
> |   |  mem-estimate=2.95GB mem-reservation=136.00MB   
>  |
> |   |  tuple-ids=0,1 row-size=32B cardinality=72376   
>  |
> |   | 
>  |
> |   |--05:EXCHANGE [HASH(b.ws_item_sk,b.ws_order_number)] 
>  |
> |   | mem-estimate=0B mem-reservation=0B  
>  |
> |   | tuple-ids=1 row-size=16B cardinality=72376  
>  |
> |   | 
>  |
> |   04:EXCHANGE [HASH(a.ws_item_sk,a.ws_order_number)]
>  |
> |  mem-estimate=0B mem-reservation=0B 
>  |
> |  tuple-ids=0 row-size=16B cardinality=72376 
>  |
> | 
>  |
> | F00:PLAN FRAGMENT [RANDOM] hosts=4 instances=4  

[jira] [Resolved] (IMPALA-6031) Distributed plan describes coordinator-only nodes as scanning

2018-07-09 Thread Pooja Nilangekar (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pooja Nilangekar resolved IMPALA-6031.
--
Resolution: Fixed

> Distributed plan describes coordinator-only nodes as scanning
> -
>
> Key: IMPALA-6031
> URL: https://issues.apache.org/jira/browse/IMPALA-6031
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Affects Versions: Impala 2.11.0
>Reporter: Jim Apple
>Assignee: Pooja Nilangekar
>Priority: Major
>
> In a cluster with one coordinator-only node and three executor-only nodes:
> {noformat}
> Query: explain select count(*) from web_sales a, web_sales b where 
> a.ws_order_number = b.ws_order_number and a.ws_item_sk = b.ws_item_sk
> +--+
> | Explain String  
>  |
> +--+
> | Per-Host Resource Reservation: Memory=136.00MB  
>  |
> | Per-Host Resource Estimates: Memory=3.04GB  
>  |
> | 
>  |
> | F03:PLAN FRAGMENT [UNPARTITIONED] hosts=1 instances=1   
>  |
> |   PLAN-ROOT SINK
>  |
> |   |  mem-estimate=0B mem-reservation=0B 
>  |
> |   | 
>  |
> |   07:AGGREGATE [FINALIZE]   
>  |
> |   |  output: count:merge(*) 
>  |
> |   |  mem-estimate=10.00MB mem-reservation=0B
>  |
> |   |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> |   | 
>  |
> |   06:EXCHANGE [UNPARTITIONED]   
>  |
> |  mem-estimate=0B mem-reservation=0B 
>  |
> |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> | 
>  |
> | F02:PLAN FRAGMENT [HASH(a.ws_item_sk,a.ws_order_number)] hosts=4 
> instances=4 |
> |   DATASTREAM SINK [FRAGMENT=F03, EXCHANGE=06, UNPARTITIONED]
>  |
> |   |  mem-estimate=0B mem-reservation=0B 
>  |
> |   03:AGGREGATE  
>  |
> |   |  output: count(*)   
>  |
> |   |  mem-estimate=10.00MB mem-reservation=0B
>  |
> |   |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> |   | 
>  |
> |   02:HASH JOIN [INNER JOIN, PARTITIONED]
>  |
> |   |  hash predicates: a.ws_item_sk = b.ws_item_sk, a.ws_order_number = 
> b.ws_order_number |
> |   |  runtime filters: RF000 <- b.ws_item_sk, RF001 <- b.ws_order_number 
>  |
> |   |  mem-estimate=2.95GB mem-reservation=136.00MB   
>  |
> |   |  tuple-ids=0,1 row-size=32B cardinality=72376   
>  |
> |   | 
>  |
> |   |--05:EXCHANGE [HASH(b.ws_item_sk,b.ws_order_number)] 
>  |
> |   | mem-estimate=0B mem-reservation=0B  
>  |
> |   | tuple-ids=1 row-size=16B cardinality=72376  
>  |
> |   | 
>  |
> |   04:EXCHANGE [HASH(a.ws_item_sk,a.ws_order_number)]
>  |
> |  mem-estimate=0B mem-reservation=0B 
>  |
> |  tuple-ids=0 row-size=16B cardinality=72376 
>  |
> | 
>  |
> | F00:PLAN FRAGMENT [RANDOM] hosts=4 instances=4  

[jira] [Commented] (IMPALA-7075) Add support for object ownership for Impala

2018-07-09 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537470#comment-16537470
 ] 

ASF subversion and git services commented on IMPALA-7075:
-

Commit 54b3c607f027c445fc8d52acc4d3b1073e84aa77 in impala's branch 
refs/heads/master from [~fredyw]
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=54b3c60 ]

IMPALA-6988: Implement ALTER TABLE/VIEW SET OWNER

Alter the table/view owner to either user or role.

On table/view creation, the table/view owner will be set to
the current user, which can be viewed via DESCRIBE FORMATTED
command. Having an owner information allows implementing a
feature where an owner can be given certain privileges
automatically upon a table/view creation. See IMPALA-7075.
The ALTER TABLE/VIEW SET OWNER will be useful commands for
transferring ownership (a set of owner privileges) from the
current owner to another owner.

Syntax:
ALTER TABLE table SET OWNER USER user
ALTER TABLE table SET OWNER ROLE role

ALTER VIEW view SET OWNER USER user
ALTER VIEW view SET OWNER ROLE role

Testing:
- Added new FE tests
- Added new E2E tests

Change-Id: Ia1b75b1590b16eb0c2ba326d07ee3fd9897c27d1
Reviewed-on: http://gerrit.cloudera.org:8080/10822
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Add support for object ownership for Impala
> ---
>
> Key: IMPALA-7075
> URL: https://issues.apache.org/jira/browse/IMPALA-7075
> Project: IMPALA
>  Issue Type: Epic
>  Components: Frontend
>Reporter: Fredy Wijaya
>Priority: Major
>  Labels: security
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-6918) Implement COMMENT ON COLUMN

2018-07-09 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-6918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537474#comment-16537474
 ] 

ASF subversion and git services commented on IMPALA-6918:
-

Commit 2fddd39f0eb8cb4d476f35d1fe68f481444eb19b in impala's branch 
refs/heads/master from [~fredyw]
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=2fddd39 ]

IMPALA-6918: [DOCS] COMMENT ON COLUMN privileges

Change-Id: Ib2b3621ecc8113a165ec36beaf82cbc1d480bc54
Reviewed-on: http://gerrit.cloudera.org:8080/10890
Reviewed-by: Alex Rodoni 
Tested-by: Impala Public Jenkins 


> Implement COMMENT ON COLUMN
> ---
>
> Key: IMPALA-6918
> URL: https://issues.apache.org/jira/browse/IMPALA-6918
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Frontend
>Reporter: Fredy Wijaya
>Assignee: Fredy Wijaya
>Priority: Minor
> Fix For: Impala 3.1.0
>
>
> Syntax:
> {noformat}
> COMMENT ON COLUMN my_table.my_column IS 'Employee ID Number';{noformat}
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-3956) Impala shell variable substitution should ignore comments embedded in query.

2018-07-09 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-3956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537473#comment-16537473
 ] 

ASF subversion and git services commented on IMPALA-3956:
-

Commit a70c05ee2d295669707a2ab69d632f5b9cfab1cf in impala's branch 
refs/heads/master from [~arodoni_cloudera]
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=a70c05e ]

IMPALA-3956: [DOCS]  Escape variables with '\' in impala-shell

Change-Id: Ifb95785a143939a94d55d3565364afe1e26c1f3d
Reviewed-on: http://gerrit.cloudera.org:8080/10861
Reviewed-by: Adam Holley 
Reviewed-by: Fredy Wijaya 
Tested-by: Impala Public Jenkins 


> Impala shell variable substitution should ignore comments embedded in query.
> 
>
> Key: IMPALA-3956
> URL: https://issues.apache.org/jira/browse/IMPALA-3956
> Project: IMPALA
>  Issue Type: Bug
>  Components: Clients, Docs
>Affects Versions: Impala 2.6.0
>Reporter: Huaisi Xu
>Assignee: Alex Rodoni
>Priority: Minor
>  Labels: regression, usability
> Fix For: Impala 3.1.0
>
>
> {code:java}
> -- WHERE a >= "${start_date}"
> select
> *
> from --fda;
> test
> -- WHEREa >= "${start_date}"
> {code}
> {code:java}
> Starting Impala Shell without Kerberos authentication
> Connected to localhost:21000
> Server version: impalad version 2.7.0-cdh5-INTERNAL DEBUG (build 
> ebd65a142d3f3f4087eb1c9aaf25d53d2045a4cb)
> Error: Unknown substitution syntax (START_DATE). Use ${VAR:var_name}.
> Could not execute command: -- WHERE a >= "${start_date}"
> select
> *
> from --fda;
> test
> -- WHERE a >= "${start_date}"
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-6223) Gracefully handle malformed 'with' queries in impala-shell

2018-07-09 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537471#comment-16537471
 ] 

ASF subversion and git services commented on IMPALA-6223:
-

Commit 28162117ad462dfe5fd608d4c09ba02ad213285b in impala's branch 
refs/heads/master from poojanilangekar
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=2816211 ]

IMPALA-6223: Gracefully handle malformed 'with' queries in impala-shell

The change handles the exception thrown by shlex while parsing a
malformed query.

This patch was tested by adding both commandline and interactive
shell tests.

Change-Id: Ibb1e9238ac67b8ad3b2caa1748a18b04f384802d
Reviewed-on: http://gerrit.cloudera.org:8080/10876
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Gracefully handle malformed 'with' queries in impala-shell
> --
>
> Key: IMPALA-6223
> URL: https://issues.apache.org/jira/browse/IMPALA-6223
> Project: IMPALA
>  Issue Type: Bug
>  Components: Clients
>Affects Versions: Impala 2.10.0
>Reporter: bharath v
>Assignee: Pooja Nilangekar
>Priority: Minor
>  Labels: newbie
>
> Impala shell can throw a lexer error if it encounters a malformed "with" 
> query.
> {noformat}
> impala-shell.sh -q "with foo as (select bar from temp where temp.a='"
> Starting Impala Shell without Kerberos authentication
> Connected to localhost:21000
> Server version: impalad version 2.11.0-SNAPSHOT DEBUG (build 
> 0ee1765f38082bc5c10aa37b23cb8e57caa57d4e)
> Traceback (most recent call last):
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1463, in 
> execute_queries_non_interactive_mode(options, query_options)
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1338, in 
> execute_queries_non_interactive_mode
> shell.execute_query_list(queries)):
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1218, in 
> execute_query_list
> if self.onecmd(q) is CmdStatus.ERROR:
>   File "/home/bharath/Impala/shell/impala_shell.py", line 505, in onecmd
> return cmd.Cmd.onecmd(self, line)
>   File "/usr/lib/python2.7/cmd.py", line 221, in onecmd
> return func(arg)
>   File "/home/bharath/Impala/shell/impala_shell.py", line 1024, in do_with
> tokens = list(lexer)
>   File "/usr/lib/python2.7/shlex.py", line 269, in next
> token = self.get_token()
>   File "/usr/lib/python2.7/shlex.py", line 96, in get_token
> raw = self.read_token()
>   File "/usr/lib/python2.7/shlex.py", line 172, in read_token
> raise ValueError, "No closing quotation"
> ValueError: No closing quotation
> {noformat}
> This happens because we use shlex to parse the input query to determine if 
> its a DML and it can throw if the input doesn't have balanced quotes.
> {noformat}
> def do_with(self, args):
> """Executes a query with a WITH clause, fetching all rows"""
> query = self.imp_client.create_beeswax_query("with %s" % args,
>  self.set_query_options)
> # Set posix=True and add "'" to escaped quotes
> # to deal with escaped quotes in string literals
> lexer = shlex.shlex(query.query.lstrip(), posix=True)
> lexer.escapedquotes += "'"
> # Because the WITH clause may precede DML or SELECT queries,
> # just checking the first token is insufficient.
> is_dml = False
> tokens = list(lexer)  <
> {noformat}
> A simple shlex repro of that is as follows,
> {noformat}
> >>> lexer = shlex.shlex("with foo as (select bar from temp where temp.a='", 
> >>> posix=True);
> >>> list(lexer)
> Traceback (most recent call last):
>   File "", line 1, in 
>   File "/usr/lib/python2.7/shlex.py", line 269, in next
> token = self.get_token()
>   File "/usr/lib/python2.7/shlex.py", line 96, in get_token
> raw = self.read_token()
>   File "/usr/lib/python2.7/shlex.py", line 172, in read_token
> raise ValueError, "No closing quotation"
> ValueError: No closing quotation
> {noformat}
> Fix: Either catch the exception and handle it gracefully or have a better way 
> to figure out the query type, using a SQL parser (more involved).
> This query also repros it:
> {code}
> with v as (select 1)
> select foo('\\'), ('bar
> ;
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-6031) Distributed plan describes coordinator-only nodes as scanning

2018-07-09 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-6031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537472#comment-16537472
 ] 

ASF subversion and git services commented on IMPALA-6031:
-

Commit 880011fa1f2fc48f9972ad3c673a4bff838cc5ce in impala's branch 
refs/heads/master from poojanilangekar
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=880011f ]

IMPALA-6031: Fix executor node count in distributed plans

Prior to this change, the planner also considered coordinator-only
nodes as executors while estimating the number of scan nodes to be
used in the distributed plan. This change ensures that only
executor nodes are considered for that estimation.

Testing:
Added a new custom cluster test to verify the same.

Change-Id: I44af6b40099a495e13a0a5dc72c491d486d23aa8
Reviewed-on: http://gerrit.cloudera.org:8080/10873
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Distributed plan describes coordinator-only nodes as scanning
> -
>
> Key: IMPALA-6031
> URL: https://issues.apache.org/jira/browse/IMPALA-6031
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Affects Versions: Impala 2.11.0
>Reporter: Jim Apple
>Assignee: Pooja Nilangekar
>Priority: Major
>
> In a cluster with one coordinator-only node and three executor-only nodes:
> {noformat}
> Query: explain select count(*) from web_sales a, web_sales b where 
> a.ws_order_number = b.ws_order_number and a.ws_item_sk = b.ws_item_sk
> +--+
> | Explain String  
>  |
> +--+
> | Per-Host Resource Reservation: Memory=136.00MB  
>  |
> | Per-Host Resource Estimates: Memory=3.04GB  
>  |
> | 
>  |
> | F03:PLAN FRAGMENT [UNPARTITIONED] hosts=1 instances=1   
>  |
> |   PLAN-ROOT SINK
>  |
> |   |  mem-estimate=0B mem-reservation=0B 
>  |
> |   | 
>  |
> |   07:AGGREGATE [FINALIZE]   
>  |
> |   |  output: count:merge(*) 
>  |
> |   |  mem-estimate=10.00MB mem-reservation=0B
>  |
> |   |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> |   | 
>  |
> |   06:EXCHANGE [UNPARTITIONED]   
>  |
> |  mem-estimate=0B mem-reservation=0B 
>  |
> |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> | 
>  |
> | F02:PLAN FRAGMENT [HASH(a.ws_item_sk,a.ws_order_number)] hosts=4 
> instances=4 |
> |   DATASTREAM SINK [FRAGMENT=F03, EXCHANGE=06, UNPARTITIONED]
>  |
> |   |  mem-estimate=0B mem-reservation=0B 
>  |
> |   03:AGGREGATE  
>  |
> |   |  output: count(*)   
>  |
> |   |  mem-estimate=10.00MB mem-reservation=0B
>  |
> |   |  tuple-ids=2 row-size=8B cardinality=1  
>  |
> |   | 
>  |
> |   02:HASH JOIN [INNER JOIN, PARTITIONED]
>  |
> |   |  hash predicates: a.ws_item_sk = b.ws_item_sk, a.ws_order_number = 
> b.ws_order_number |
> |   |  runtime filters: RF000 <- b.ws_item_sk, RF001 <- b.ws_order_number 
>  |
> |   |  mem-estimate=2.95GB mem-reservation=136.00MB   
>  |
> |   |  tuple-ids=0,1 row-size=32B cardinality=72376   
>  |
> |   | 
>  |
> |   |--05:EXCHANGE [HASH(b.ws_item_sk,b.ws_order_number)] 

[jira] [Commented] (IMPALA-7178) Reduce logging for common data errors

2018-07-09 Thread Csaba Ringhofer (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537429#comment-16537429
 ] 

Csaba Ringhofer commented on IMPALA-7178:
-

If the log message is the same for every line, then it could be solved by a 
similar solution. Currently the offset is always (?) the same, but this seems 
to be a bug. If the offset would be replaced with column name, then it could be 
aggregated in a similar way as "out-of-range parquet timestamps".

> Reduce logging for common data errors
> -
>
> Key: IMPALA-7178
> URL: https://issues.apache.org/jira/browse/IMPALA-7178
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Csaba Ringhofer
>Assignee: Csaba Ringhofer
>Priority: Major
>
> Some data errors (for example out-of-range parquet timestamps) can dominate 
> logs if a table contains a large number of rows with invalid data. If an 
> error has its own error code (see common/thrift/generate_error_codes.py), 
> then these errors are already aggregated to the user 
> (RuntimeState::LogError()) for every query, but the logs will contain a new 
> line for every occurrence. This is not too useful most of times, as the log 
> lines will repeat the same information (the corrupt data itself is not logged 
> as it can be sensitive information).
> The best would to reduce logging without loosing information:
> - the first occurrence of an error should be logged (per 
> query/fragment/table/file/column) to help the investigation of cases where 
> the data error leads to other errors and to avoid breaking log analyzer tools 
> that search for the current format
> - other occurrences can be aggregated, like "in query Q table T column C XY 
> error occurred N times"
> An extra goal is to avoid calling RuntimeState::LogError() for other 
> occurrences than the first one, as RuntimeState::LogError() uses a (per 
> fragment) lock.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7268) Rebase gutil on top of Kudu's latest version

2018-07-09 Thread Lars Volker (JIRA)
Lars Volker created IMPALA-7268:
---

 Summary: Rebase gutil on top of Kudu's latest version
 Key: IMPALA-7268
 URL: https://issues.apache.org/jira/browse/IMPALA-7268
 Project: IMPALA
  Issue Type: Improvement
  Components: Backend
Affects Versions: Impala 3.1.0
Reporter: Lars Volker






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7268) Rebase gutil on top of Kudu's latest version

2018-07-09 Thread Lars Volker (JIRA)
Lars Volker created IMPALA-7268:
---

 Summary: Rebase gutil on top of Kudu's latest version
 Key: IMPALA-7268
 URL: https://issues.apache.org/jira/browse/IMPALA-7268
 Project: IMPALA
  Issue Type: Improvement
  Components: Backend
Affects Versions: Impala 3.1.0
Reporter: Lars Volker






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7267) Handle EINTR for syscalls before dumping threads

2018-07-09 Thread Lars Volker (JIRA)
Lars Volker created IMPALA-7267:
---

 Summary: Handle EINTR for syscalls before dumping threads
 Key: IMPALA-7267
 URL: https://issues.apache.org/jira/browse/IMPALA-7267
 Project: IMPALA
  Issue Type: Improvement
  Components: Backend
Affects Versions: Impala 3.1.0
Reporter: Lars Volker


Before using Kudu's support to dump threads, we need to wrap all of our 
syscalls in RETRY_ON_EINTR. See these changes for more context:

https://gerrit.cloudera.org/#/q/RETRY_ON_EINTR+status:merged+project:kudu+branch:master



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7267) Handle EINTR for syscalls before dumping threads

2018-07-09 Thread Lars Volker (JIRA)
Lars Volker created IMPALA-7267:
---

 Summary: Handle EINTR for syscalls before dumping threads
 Key: IMPALA-7267
 URL: https://issues.apache.org/jira/browse/IMPALA-7267
 Project: IMPALA
  Issue Type: Improvement
  Components: Backend
Affects Versions: Impala 3.1.0
Reporter: Lars Volker


Before using Kudu's support to dump threads, we need to wrap all of our 
syscalls in RETRY_ON_EINTR. See these changes for more context:

https://gerrit.cloudera.org/#/q/RETRY_ON_EINTR+status:merged+project:kudu+branch:master



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-6677) Impala Doc: Doc next_day function

2018-07-09 Thread Alex Rodoni (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-6677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537399#comment-16537399
 ] 

Alex Rodoni commented on IMPALA-6677:
-

https://gerrit.cloudera.org/#/c/10893/

> Impala Doc: Doc next_day function
> -
>
> Key: IMPALA-6677
> URL: https://issues.apache.org/jira/browse/IMPALA-6677
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Affects Versions: Impala 2.11.0
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>
>  +[http://gerrit.cloudera.org:8080/1943]+



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-7266) test_insert failure: Unable to drop partition on s3

2018-07-09 Thread Bikramjeet Vig (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bikramjeet Vig reassigned IMPALA-7266:
--

Assignee: Sailesh Mukil

> test_insert failure: Unable to drop partition on s3
> ---
>
> Key: IMPALA-7266
> URL: https://issues.apache.org/jira/browse/IMPALA-7266
> Project: IMPALA
>  Issue Type: Bug
>  Components: Infrastructure
>Affects Versions: Impala 3.1.0
>Reporter: Bikramjeet Vig
>Assignee: Sailesh Mukil
>Priority: Major
>  Labels: broken-build
>
> {noformat}
> 06:01:28 === FAILURES 
> ===
> 06:01:28  TestInsertQueries.test_insert[exec_option: {'sync_ddl': 0, 
> 'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 
> 'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 
> 'exec_single_node_rows_threshold': 0} | table_format: text/none] 
> 06:01:28 query_test/test_insert.py:122: in test_insert
> 06:01:28 multiple_impalad=vector.get_value('exec_option')['sync_ddl'] == 
> 1)
> 06:01:28 .../Impala/tests/common/impala_test_suite.py:366: in run_test_case
> 06:01:28 self.execute_test_case_setup(test_section['SETUP'], 
> table_format_info)
> 06:01:28 .../Impala/tests/common/impala_test_suite.py:489: in 
> execute_test_case_setup
> 06:01:28 self.__drop_partitions(db_name, table_name)
> 06:01:28 .../Impala/tests/common/impala_test_suite.py:614: in 
> __drop_partitions
> 06:01:28 partition, True), 'Could not drop partition: %s' % partition
> 06:01:28 .../Impala/shell/gen-py/hive_metastore/ThriftHiveMetastore.py:2862: 
> in drop_partition_by_name
> 06:01:28 return self.recv_drop_partition_by_name()
> 06:01:28 .../Impala/shell/gen-py/hive_metastore/ThriftHiveMetastore.py:2891: 
> in recv_drop_partition_by_name
> 06:01:28 raise result.o2
> 06:01:28 E   MetaException: MetaException(_message='No such file or 
> directory: 
> s3a://impala-cdh5-s3-test/test-warehouse/functional.db/alltypesinsert/year=2009/month=4')
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-6971) Log flooding with unhelpful parsing error message

2018-07-09 Thread Csaba Ringhofer (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-6971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537391#comment-16537391
 ] 

Csaba Ringhofer commented on IMPALA-6971:
-

IMPALA-5993 seems to deal with the same error message. There is a fix on review 
( https://gerrit.cloudera.org/#/c/8747/ ), but there was no update since some 
time.

> Log flooding with unhelpful parsing error message
> -
>
> Key: IMPALA-6971
> URL: https://issues.apache.org/jira/browse/IMPALA-6971
> Project: IMPALA
>  Issue Type: Improvement
>Affects Versions: Impala 3.0, Impala 2.12.0
>Reporter: Zoram Thanga
>Priority: Minor
>  Labels: supportability
>
> Once in a while I've seen Impala flood the log file with messages similar to 
> the following:
> {noformat}
> I0318 08:20:28.566603 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566643 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566676 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566709 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566742 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566787 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566820 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566856 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566890 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566923 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566959 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.566998 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.567039 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.567070 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.567101 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.567132 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.567173 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error parsing row: file: 
> hdfs://nameservice/user/hive/warehouse/mydb/mytable/part-m-0, before 
> offset: 2122317824
> I0318 08:20:28.567209 984564 runtime-state.cc:168] Error from query 
> cf41e4ee4f7e1273:bcc41038: Error 

[jira] [Created] (IMPALA-7266) test_insert failure: Unable to drop partition on s3

2018-07-09 Thread Bikramjeet Vig (JIRA)
Bikramjeet Vig created IMPALA-7266:
--

 Summary: test_insert failure: Unable to drop partition on s3
 Key: IMPALA-7266
 URL: https://issues.apache.org/jira/browse/IMPALA-7266
 Project: IMPALA
  Issue Type: Bug
  Components: Infrastructure
Affects Versions: Impala 3.1.0
Reporter: Bikramjeet Vig


{noformat}
06:01:28 === FAILURES 
===
06:01:28  TestInsertQueries.test_insert[exec_option: {'sync_ddl': 0, 
'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 
'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 
'exec_single_node_rows_threshold': 0} | table_format: text/none] 
06:01:28 query_test/test_insert.py:122: in test_insert
06:01:28 multiple_impalad=vector.get_value('exec_option')['sync_ddl'] == 1)
06:01:28 .../Impala/tests/common/impala_test_suite.py:366: in run_test_case
06:01:28 self.execute_test_case_setup(test_section['SETUP'], 
table_format_info)
06:01:28 .../Impala/tests/common/impala_test_suite.py:489: in 
execute_test_case_setup
06:01:28 self.__drop_partitions(db_name, table_name)
06:01:28 .../Impala/tests/common/impala_test_suite.py:614: in __drop_partitions
06:01:28 partition, True), 'Could not drop partition: %s' % partition
06:01:28 .../Impala/shell/gen-py/hive_metastore/ThriftHiveMetastore.py:2862: in 
drop_partition_by_name
06:01:28 return self.recv_drop_partition_by_name()
06:01:28 .../Impala/shell/gen-py/hive_metastore/ThriftHiveMetastore.py:2891: in 
recv_drop_partition_by_name
06:01:28 raise result.o2
06:01:28 E   MetaException: MetaException(_message='No such file or directory: 
s3a://impala-cdh5-s3-test/test-warehouse/functional.db/alltypesinsert/year=2009/month=4')
{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7266) test_insert failure: Unable to drop partition on s3

2018-07-09 Thread Bikramjeet Vig (JIRA)
Bikramjeet Vig created IMPALA-7266:
--

 Summary: test_insert failure: Unable to drop partition on s3
 Key: IMPALA-7266
 URL: https://issues.apache.org/jira/browse/IMPALA-7266
 Project: IMPALA
  Issue Type: Bug
  Components: Infrastructure
Affects Versions: Impala 3.1.0
Reporter: Bikramjeet Vig


{noformat}
06:01:28 === FAILURES 
===
06:01:28  TestInsertQueries.test_insert[exec_option: {'sync_ddl': 0, 
'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 
'disable_codegen': False, 'abort_on_error': 1, 'debug_action': None, 
'exec_single_node_rows_threshold': 0} | table_format: text/none] 
06:01:28 query_test/test_insert.py:122: in test_insert
06:01:28 multiple_impalad=vector.get_value('exec_option')['sync_ddl'] == 1)
06:01:28 .../Impala/tests/common/impala_test_suite.py:366: in run_test_case
06:01:28 self.execute_test_case_setup(test_section['SETUP'], 
table_format_info)
06:01:28 .../Impala/tests/common/impala_test_suite.py:489: in 
execute_test_case_setup
06:01:28 self.__drop_partitions(db_name, table_name)
06:01:28 .../Impala/tests/common/impala_test_suite.py:614: in __drop_partitions
06:01:28 partition, True), 'Could not drop partition: %s' % partition
06:01:28 .../Impala/shell/gen-py/hive_metastore/ThriftHiveMetastore.py:2862: in 
drop_partition_by_name
06:01:28 return self.recv_drop_partition_by_name()
06:01:28 .../Impala/shell/gen-py/hive_metastore/ThriftHiveMetastore.py:2891: in 
recv_drop_partition_by_name
06:01:28 raise result.o2
06:01:28 E   MetaException: MetaException(_message='No such file or directory: 
s3a://impala-cdh5-s3-test/test-warehouse/functional.db/alltypesinsert/year=2009/month=4')
{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Comment Edited] (IMPALA-3933) Backward compatibility between Hive and Impala TIMESTAMP set of timezone rules across these different libraries

2018-07-09 Thread Csaba Ringhofer (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-3933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537373#comment-16537373
 ] 

Csaba Ringhofer edited comment on IMPALA-3933 at 7/9/18 6:35 PM:
-

[~attilaj] [~zi]
I tried to reproduce this issue hoping that it is already resolved by 
IMPALA-3307 - it turned out that it is now reproducible not just with 
convert_legacy_hive_parquet_utc_timestamps, but also with from_utc_timestamp:

select from_utc_timestamp('1890-09-30 00:00:00', 'Europe/Budapest') 
result: 1890-09-30 01:16:20
select from_utc_timestamp('1890-10-01 00:00:00', 'Europe/Budapest')
result: 1890-10-01 01:00:00

Note that the issue is not reproducible with "CET", so "CET" and 
"Europe/Budapest" are actually different.
It may be related that CET was introduced in Hungary in 1890 (according to 
https://en.wikipedia.org/wiki/Central_European_Time). I tried Europe/Belgrade, 
where CET was introduced in 1884, and the skew starts there accordingly.
Meanwhile in Hive the result is UTC+1 even before 1890. There may be a 
difference between how IANA and Hive's timezone db works, and the new timezone 
db of Impala and glibc may work similarly in this regard (both are IANA based).

What makes this issue a bit worrying is that it breaks ordering:
select from_utc_timestamp('1890-09-30 22:40:00', 'Europe/Budapest') 
result: 1890-09-30 23:56:20
select from_utc_timestamp('1890-09-30 22:45:00', 'Europe/Budapest');
result: 1890-09-30 23:45:00

This may lead to scenarios where Hive thinks that some data is ordered, but it 
turns out to be unordered after Impala converts it to local time.


was (Author: csringhofer):
[~attilaj] [~zi]
I tried to reproduce this issue hoping that it is already resolved by 
IMPALA-3307 - it turned out that it is now reproducible not just with 
convert_legacy_hive_parquet_utc_timestamps, but also with from_utc_timestamp:

select from_utc_timestamp('1890-09-30 00:00:00', 'Europe/Budapest') 
result: 1890-09-30 01:16:20
select from_utc_timestamp('1890-10-01 00:00:00', 'Europe/Budapest')
result: 1890-10-01 01:00:00

Note that the issue is not reproducible with "CET", so "CET" and 
"Europe/Budapest" are actually different.
It may be related that CET was introduced in Hungary in 1890 (according to 
https://en.wikipedia.org/wiki/Central_European_Time). I tried Europe/Belgrade, 
where CET was introduced in 1884, and the skew starts there accordingly.
Meanwhile in Hive the result is UTC+1 even before 1890. There may be a 
difference between how IANA and Hive's timezone db works, and the new timezone 
db of Impala and glibc may works similarly in this regard (both are IANA based).

What makes this issue a bit worrying is that it breaks ordering:
select from_utc_timestamp('1890-09-30 22:40:00', 'Europe/Budapest') 
result: 1890-09-30 23:56:20
select from_utc_timestamp('1890-09-30 22:45:00', 'Europe/Budapest');
result: 1890-09-30 23:45:00

This may lead to scenarios where Hive thinks that some data is ordered, but it 
turns out to be unordered after Impala converts it to local time.

> Backward compatibility between Hive and Impala TIMESTAMP set of timezone 
> rules across these different libraries
> ---
>
> Key: IMPALA-3933
> URL: https://issues.apache.org/jira/browse/IMPALA-3933
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Backend
>Affects Versions: impala 2.3
>Reporter: Adriano Simone
>Priority: Minor
>
> How the TIMESTAMP skew with convert_legacy_hive_parquet_utc_timestamps=true
> Enabling --convert_legacy_hive_parquet_utc_timestamps=true seems to cause 
> data skew (improper converting) upon the reading for dates earlier than 1900 
> (not sure about the exact date).
> The following example was run on a server which is in CEST timezone, thus the 
> time difference is GMT+1 for dates before 1900 (I'm not sure, I haven't 
> checked the exact starting date of DST computation), and GMT+2 when summer 
> daylight saving time was applied.
> create table itst (col1 int, myts timestamp) stored as parquet;
> From impala:
> {code:java}
> insert into itst values (1,'2016-04-15 12:34:45');
> insert into itst values (2,'1949-04-15 12:34:45');
> insert into itst values (3,'1753-04-15 12:34:45');
> insert into itst values (4,'1752-04-15 12:34:45');
> {code}
> from hive
> {code:java}
> insert into itst values (5,'2016-04-15 12:34:45');
> insert into itst values (6,'1949-04-15 12:34:45');
> insert into itst values (7,'1753-04-15 12:34:45');
> insert into itst values (8,'1752-04-15 12:34:45');
> {code}
> From impala
> {code:java}
> select * from itst order by col1;
> {code}
> Result:
> {code:java}
> Query: select * from itst
> +--+-+
> | col1 | myts|
> 

[jira] [Commented] (IMPALA-3933) Backward compatibility between Hive and Impala TIMESTAMP set of timezone rules across these different libraries

2018-07-09 Thread Csaba Ringhofer (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-3933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537373#comment-16537373
 ] 

Csaba Ringhofer commented on IMPALA-3933:
-

[~attilaj] [~zi]
I tried to reproduce this issue hoping that it is already resolved by 
IMPALA-3307 - it turned out that it is now reproducible not just with 
convert_legacy_hive_parquet_utc_timestamps, but also with from_utc_timestamp:

select from_utc_timestamp('1890-09-30 00:00:00', 'Europe/Budapest') 
result: 1890-09-30 01:16:20
select from_utc_timestamp('1890-10-01 00:00:00', 'Europe/Budapest')
result: 1890-10-01 01:00:00

Note that the issue is not reproducible with "CET", so "CET" and 
"Europe/Budapest" are actually different.
It may be related that CET was introduced in Hungary in 1890 (according to 
https://en.wikipedia.org/wiki/Central_European_Time). I tried Europe/Belgrade, 
where CET was introduced in 1884, and the skew starts there accordingly.
Meanwhile in Hive the result is UTC+1 even before 1890. There may be a 
difference between how IANA and Hive's timezone db works, and the new timezone 
db of Impala and glibc may works similarly in this regard (both are IANA based).

What makes this issue a bit worrying is that it breaks ordering:
select from_utc_timestamp('1890-09-30 22:40:00', 'Europe/Budapest') 
result: 1890-09-30 23:56:20
select from_utc_timestamp('1890-09-30 22:45:00', 'Europe/Budapest');
result: 1890-09-30 23:45:00

This may lead to scenarios where Hive thinks that some data is ordered, but it 
turns out to be unordered after Impala converts it to local time.

> Backward compatibility between Hive and Impala TIMESTAMP set of timezone 
> rules across these different libraries
> ---
>
> Key: IMPALA-3933
> URL: https://issues.apache.org/jira/browse/IMPALA-3933
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Backend
>Affects Versions: impala 2.3
>Reporter: Adriano Simone
>Priority: Minor
>
> How the TIMESTAMP skew with convert_legacy_hive_parquet_utc_timestamps=true
> Enabling --convert_legacy_hive_parquet_utc_timestamps=true seems to cause 
> data skew (improper converting) upon the reading for dates earlier than 1900 
> (not sure about the exact date).
> The following example was run on a server which is in CEST timezone, thus the 
> time difference is GMT+1 for dates before 1900 (I'm not sure, I haven't 
> checked the exact starting date of DST computation), and GMT+2 when summer 
> daylight saving time was applied.
> create table itst (col1 int, myts timestamp) stored as parquet;
> From impala:
> {code:java}
> insert into itst values (1,'2016-04-15 12:34:45');
> insert into itst values (2,'1949-04-15 12:34:45');
> insert into itst values (3,'1753-04-15 12:34:45');
> insert into itst values (4,'1752-04-15 12:34:45');
> {code}
> from hive
> {code:java}
> insert into itst values (5,'2016-04-15 12:34:45');
> insert into itst values (6,'1949-04-15 12:34:45');
> insert into itst values (7,'1753-04-15 12:34:45');
> insert into itst values (8,'1752-04-15 12:34:45');
> {code}
> From impala
> {code:java}
> select * from itst order by col1;
> {code}
> Result:
> {code:java}
> Query: select * from itst
> +--+-+
> | col1 | myts|
> +--+-+
> | 1| 2016-04-15 12:34:45 |
> | 2| 1949-04-15 12:34:45 |
> | 3| 1753-04-15 12:34:45 |
> | 4| 1752-04-15 12:34:45 |
> | 5| 2016-04-15 10:34:45 |
> | 6| 1949-04-15 10:34:45 |
> | 7| 1753-04-15 11:34:45 |
> | 8| 1752-04-15 11:34:45 |
> +--+-+
> {code}
> The timestamps are looking good, the DST differences can be seen (hive 
> inserted it in local time, but impala shows it in UTC)
> From impala after setting the command line argument 
> "--convert_legacy_hive_parquet_utc_timestamps=true"
> {code:java}
> select * from itst order by col1;
> {code}
> The result in this case:
> {code:java}
> Query: select * from itst order by col1
> +--+-+
> | col1 | myts|
> +--+-+
> | 1| 2016-04-15 12:34:45 |
> | 2| 1949-04-15 12:34:45 |
> | 3| 1753-04-15 12:34:45 |
> | 4| 1752-04-15 12:34:45 |
> | 5| 2016-04-15 12:34:45 |
> | 6| 1949-04-15 12:34:45 |
> | 7| 1753-04-15 12:51:05 |
> | 8| 1752-04-15 12:51:05 |
> +--+-+
> {code}
> It seems that instead of 11:34:45 it is showing 12:51:05.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-6065) Document IMPALA-5743

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni updated IMPALA-6065:

Issue Type: Bug  (was: Documentation)

> Document IMPALA-5743
> 
>
> Key: IMPALA-6065
> URL: https://issues.apache.org/jira/browse/IMPALA-6065
> Project: IMPALA
>  Issue Type: Bug
>  Components: Docs
>Affects Versions: Impala 2.10.0
>Reporter: Tim Armstrong
>Assignee: Alex Rodoni
>Priority: Major
>
> It looks like --ssl_minimum_version went into Impala 2.10 but didn't have any 
> accompanying docs. I believe a common use case is to disallow TLS1.0 and 1.1 
> with --ssl_minimum_version=tlsv1.2



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-7262) Impala Doc: Quiesce impalad without shutting down

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni updated IMPALA-7262:

Labels: future_release_doc  (was: )

> Impala Doc: Quiesce impalad without shutting down
> -
>
> Key: IMPALA-7262
> URL: https://issues.apache.org/jira/browse/IMPALA-7262
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Docs
>Reporter: Alex Rodoni
>Assignee: Alex Rodoni
>Priority: Major
>  Labels: future_release_doc
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Comment Edited] (IMPALA-7178) Reduce logging for common data errors

2018-07-09 Thread Zoram Thanga (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537325#comment-16537325
 ] 

Zoram Thanga edited comment on IMPALA-7178 at 7/9/18 5:47 PM:
--

I wonder if IMPALA-6971 is an instance of this log flooding?


was (Author: zoram):
I wonder if IMPALA-6996 is an instance of this log flooding?

> Reduce logging for common data errors
> -
>
> Key: IMPALA-7178
> URL: https://issues.apache.org/jira/browse/IMPALA-7178
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Csaba Ringhofer
>Assignee: Csaba Ringhofer
>Priority: Major
>
> Some data errors (for example out-of-range parquet timestamps) can dominate 
> logs if a table contains a large number of rows with invalid data. If an 
> error has its own error code (see common/thrift/generate_error_codes.py), 
> then these errors are already aggregated to the user 
> (RuntimeState::LogError()) for every query, but the logs will contain a new 
> line for every occurrence. This is not too useful most of times, as the log 
> lines will repeat the same information (the corrupt data itself is not logged 
> as it can be sensitive information).
> The best would to reduce logging without loosing information:
> - the first occurrence of an error should be logged (per 
> query/fragment/table/file/column) to help the investigation of cases where 
> the data error leads to other errors and to avoid breaking log analyzer tools 
> that search for the current format
> - other occurrences can be aggregated, like "in query Q table T column C XY 
> error occurred N times"
> An extra goal is to avoid calling RuntimeState::LogError() for other 
> occurrences than the first one, as RuntimeState::LogError() uses a (per 
> fragment) lock.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7178) Reduce logging for common data errors

2018-07-09 Thread Zoram Thanga (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537325#comment-16537325
 ] 

Zoram Thanga commented on IMPALA-7178:
--

I wonder if IMPALA-6996 is an instance of this log flooding?

> Reduce logging for common data errors
> -
>
> Key: IMPALA-7178
> URL: https://issues.apache.org/jira/browse/IMPALA-7178
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Backend
>Reporter: Csaba Ringhofer
>Assignee: Csaba Ringhofer
>Priority: Major
>
> Some data errors (for example out-of-range parquet timestamps) can dominate 
> logs if a table contains a large number of rows with invalid data. If an 
> error has its own error code (see common/thrift/generate_error_codes.py), 
> then these errors are already aggregated to the user 
> (RuntimeState::LogError()) for every query, but the logs will contain a new 
> line for every occurrence. This is not too useful most of times, as the log 
> lines will repeat the same information (the corrupt data itself is not logged 
> as it can be sensitive information).
> The best would to reduce logging without loosing information:
> - the first occurrence of an error should be logged (per 
> query/fragment/table/file/column) to help the investigation of cases where 
> the data error leads to other errors and to avoid breaking log analyzer tools 
> that search for the current format
> - other occurrences can be aggregated, like "in query Q table T column C XY 
> error occurred N times"
> An extra goal is to avoid calling RuntimeState::LogError() for other 
> occurrences than the first one, as RuntimeState::LogError() uses a (per 
> fragment) lock.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7265) Cache remote file handles

2018-07-09 Thread Joe McDonnell (JIRA)
Joe McDonnell created IMPALA-7265:
-

 Summary: Cache remote file handles
 Key: IMPALA-7265
 URL: https://issues.apache.org/jira/browse/IMPALA-7265
 Project: IMPALA
  Issue Type: Improvement
  Components: Backend
Affects Versions: Impala 3.1.0
Reporter: Joe McDonnell
Assignee: Joe McDonnell


The file handle cache currently does not allow caching remote file handles. 
This means that clusters that have a lot of remote reads can suffer from 
overloading the NameNode. Impala should be able to cache remote file handles.

There are some open questions about remote file handles and whether they behave 
differently from local file handles. In particular:
 # Is there any resource constraint on the number of remote file handles open? 
(e.g. do they maintain a network connection?)
 # Are there any semantic differences in how remote file handles behave when 
files are deleted, overwritten, or appended?
 # Are there any extra failure cases for remote file handles? (i.e. if a 
machine goes down or a remote file handle is left open for an extended period 
of time)

The form of caching will depend on the answers, but at the very least, it 
should be possible to cache a remote file handle at the level of a query so 
that a Parquet file with multiple columns can share file handles.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-5203) from_utc_timestamp inconsistent how it handles daily savings time

2018-07-09 Thread Csaba Ringhofer (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Csaba Ringhofer resolved IMPALA-5203.
-
Resolution: Resolved

> from_utc_timestamp inconsistent how it handles daily savings time
> -
>
> Key: IMPALA-5203
> URL: https://issues.apache.org/jira/browse/IMPALA-5203
> Project: IMPALA
>  Issue Type: Bug
> Environment: Impala Shell v2.6.0-cdh5.8.3 (9872875) built on Fri Dec  
> 9 14:31:00 PST 2016
>Reporter: Lou Bershad
>Priority: Major
>
> from_utc_timestamp(ts, EDT) as the timezone adjusts the time correctly 
> whether or not the timestamp being translated was during daylight savings 
> time.  That is, it adjusts 4 hours for dates during daylight savings time and 
> 5 hours for dates during standard time. from_utc_timetamp(ts, EST) always 
> adjusts by 5 hours.
> In 2017, daylight savings time started on March 12th.  This query shows that 
> from_utc_timestamp using EDT adjusts 5 hours on March 11th and 4 hours on 
> March 13th.  When using EST, it adjusts 5 hours no matter what.
> {noformat}
> [i-pvd1c1mgr-vip.vldb-bo.secureworkslab.com:21001] > select 'EST', 
> from_utc_timestamp('2017-03-11', 'EST'), from_utc_timestamp('2017-03-13', 
> 'EST') ;
> Query: select 'EST', from_utc_timestamp('2017-03-11', 'EST'), 
> from_utc_timestamp('2017-03-13', 'EST')
> +---+-+-+
> | 'est' | from_utc_timestamp('2017-03-11', 'est') | 
> from_utc_timestamp('2017-03-13', 'est') |
> +---+-+-+
> | EST   | 2017-03-10 19:00:00 | 2017-03-12 19:00:00   
>   |
> +---+-+-+
> Fetched 1 row(s) in 0.01s
> [i-pvd1c1mgr-vip.vldb-bo.secureworkslab.com:21001] > select 'EST', 
> from_utc_timestamp('2017-03-11', 'EDT'), from_utc_timestamp('2017-03-13', 
> 'EDT') ;
> Query: select 'EST', from_utc_timestamp('2017-03-11', 'EDT'), 
> from_utc_timestamp('2017-03-13', 'EDT')
> +---+-+-+
> | 'est' | from_utc_timestamp('2017-03-11', 'edt') | 
> from_utc_timestamp('2017-03-13', 'edt') |
> +---+-+-+
> | EST   | 2017-03-10 19:00:00 | 2017-03-12 20:00:00   
>   |
> +---+-+-+
> Fetched 1 row(s) in 0.01s
> {noformat}
> The inconsistency could be fixed either by:
> # EST acts the same as EDT and adjusts the timestamp based on whether the 
> timestamp is during daylight savings time.  (I feel quite strongly that this 
> would be the correct choice)
> # EDT always adjusts by 4 hours
> Note: The same dichotomy exists in other US timezones: PST/PDT, CST/CDT and 
> MST/MDT.  The dichotomy does not exist in France (CET/CEST).
> {noformat}
> Query: select tz, from_utc_timestamp('2017-03-01', tz), 
> from_utc_timestamp('2017-05-01', tz) from (
> select 'EST' tz union all
> select 'EDT' tz union all
> select 'CST' tz union all
> select 'CDT' tz union all
> select 'PST' tz union all
> select 'PDT' tz union all
> select 'CET' tz union all
> select 'CEST' tz
> ) x
> +--+--+--+
> | tz   | from_utc_timestamp('2017-03-01', tz) | 
> from_utc_timestamp('2017-05-01', tz) |
> +--+--+--+
> | EST  | 2017-02-28 19:00:00  | 2017-04-30 19:00:00   
>|
> | EDT  | 2017-02-28 19:00:00  | 2017-04-30 20:00:00   
>|
> | CST  | 2017-02-28 18:00:00  | 2017-04-30 19:00:00   
>|
> | CDT  | 2017-02-28 18:00:00  | 2017-04-30 19:00:00   
>|
> | PST  | 2017-02-28 16:00:00  | 2017-04-30 17:00:00   
>|
> | PDT  | 2017-02-28 16:00:00  | 2017-04-30 17:00:00   
>|
> | CET  | 2017-03-01 01:00:00  | 2017-05-01 02:00:00   
>|
> | CEST | 2017-03-01 01:00:00  | 2017-05-01 02:00:00   
>|
> +--+--+--+
> Fetched 8 row(s) in 0.02s
> {noformat}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: 

[jira] [Commented] (IMPALA-5203) from_utc_timestamp inconsistent how it handles daily savings time

2018-07-09 Thread Csaba Ringhofer (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-5203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537261#comment-16537261
 ] 

Csaba Ringhofer commented on IMPALA-5203:
-

IMPALA-3307 replaced the timezone implementation and removed several 
non-canonical timezone names - this includes EDT/CDT/PDT/CEST. It is possible 
to add aliases in the new implementation, so these names could be re-added by 
setting them as aliases for EST/CST/PAST/CET.

> from_utc_timestamp inconsistent how it handles daily savings time
> -
>
> Key: IMPALA-5203
> URL: https://issues.apache.org/jira/browse/IMPALA-5203
> Project: IMPALA
>  Issue Type: Bug
> Environment: Impala Shell v2.6.0-cdh5.8.3 (9872875) built on Fri Dec  
> 9 14:31:00 PST 2016
>Reporter: Lou Bershad
>Priority: Major
>
> from_utc_timestamp(ts, EDT) as the timezone adjusts the time correctly 
> whether or not the timestamp being translated was during daylight savings 
> time.  That is, it adjusts 4 hours for dates during daylight savings time and 
> 5 hours for dates during standard time. from_utc_timetamp(ts, EST) always 
> adjusts by 5 hours.
> In 2017, daylight savings time started on March 12th.  This query shows that 
> from_utc_timestamp using EDT adjusts 5 hours on March 11th and 4 hours on 
> March 13th.  When using EST, it adjusts 5 hours no matter what.
> {noformat}
> [i-pvd1c1mgr-vip.vldb-bo.secureworkslab.com:21001] > select 'EST', 
> from_utc_timestamp('2017-03-11', 'EST'), from_utc_timestamp('2017-03-13', 
> 'EST') ;
> Query: select 'EST', from_utc_timestamp('2017-03-11', 'EST'), 
> from_utc_timestamp('2017-03-13', 'EST')
> +---+-+-+
> | 'est' | from_utc_timestamp('2017-03-11', 'est') | 
> from_utc_timestamp('2017-03-13', 'est') |
> +---+-+-+
> | EST   | 2017-03-10 19:00:00 | 2017-03-12 19:00:00   
>   |
> +---+-+-+
> Fetched 1 row(s) in 0.01s
> [i-pvd1c1mgr-vip.vldb-bo.secureworkslab.com:21001] > select 'EST', 
> from_utc_timestamp('2017-03-11', 'EDT'), from_utc_timestamp('2017-03-13', 
> 'EDT') ;
> Query: select 'EST', from_utc_timestamp('2017-03-11', 'EDT'), 
> from_utc_timestamp('2017-03-13', 'EDT')
> +---+-+-+
> | 'est' | from_utc_timestamp('2017-03-11', 'edt') | 
> from_utc_timestamp('2017-03-13', 'edt') |
> +---+-+-+
> | EST   | 2017-03-10 19:00:00 | 2017-03-12 20:00:00   
>   |
> +---+-+-+
> Fetched 1 row(s) in 0.01s
> {noformat}
> The inconsistency could be fixed either by:
> # EST acts the same as EDT and adjusts the timestamp based on whether the 
> timestamp is during daylight savings time.  (I feel quite strongly that this 
> would be the correct choice)
> # EDT always adjusts by 4 hours
> Note: The same dichotomy exists in other US timezones: PST/PDT, CST/CDT and 
> MST/MDT.  The dichotomy does not exist in France (CET/CEST).
> {noformat}
> Query: select tz, from_utc_timestamp('2017-03-01', tz), 
> from_utc_timestamp('2017-05-01', tz) from (
> select 'EST' tz union all
> select 'EDT' tz union all
> select 'CST' tz union all
> select 'CDT' tz union all
> select 'PST' tz union all
> select 'PDT' tz union all
> select 'CET' tz union all
> select 'CEST' tz
> ) x
> +--+--+--+
> | tz   | from_utc_timestamp('2017-03-01', tz) | 
> from_utc_timestamp('2017-05-01', tz) |
> +--+--+--+
> | EST  | 2017-02-28 19:00:00  | 2017-04-30 19:00:00   
>|
> | EDT  | 2017-02-28 19:00:00  | 2017-04-30 20:00:00   
>|
> | CST  | 2017-02-28 18:00:00  | 2017-04-30 19:00:00   
>|
> | CDT  | 2017-02-28 18:00:00  | 2017-04-30 19:00:00   
>|
> | PST  | 2017-02-28 16:00:00  | 2017-04-30 17:00:00   
>|
> | PDT  | 2017-02-28 16:00:00  | 2017-04-30 17:00:00   
>|
> | CET  | 2017-03-01 01:00:00  | 2017-05-01 02:00:00   
>|
> | CEST | 2017-03-01 01:00:00  | 2017-05-01 02:00:00   
>|
> 

[jira] [Resolved] (IMPALA-1577) Improve from_utc_timestamp perf which is unpredictable and slow

2018-07-09 Thread Csaba Ringhofer (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Csaba Ringhofer resolved IMPALA-1577.
-
Resolution: Resolved

> Improve from_utc_timestamp perf which is unpredictable and slow
> ---
>
> Key: IMPALA-1577
> URL: https://issues.apache.org/jira/browse/IMPALA-1577
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.0
>Reporter: Skye Wanderman-Milne
>Priority: Minor
>  Labels: performance
>
> The performance of from_utc_timestamp, besides being extremely poor compared 
> to that of other builtin functions such as from_unixtime, depends greatly on 
> the input timezone. Given that evaluating this function can dominate the 
> total query time, it would be good to at least make its performance more 
> consistent, and even better to improve its performance overall.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-1577) Improve from_utc_timestamp perf which is unpredictable and slow

2018-07-09 Thread Csaba Ringhofer (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Csaba Ringhofer resolved IMPALA-1577.
-
Resolution: Resolved

> Improve from_utc_timestamp perf which is unpredictable and slow
> ---
>
> Key: IMPALA-1577
> URL: https://issues.apache.org/jira/browse/IMPALA-1577
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.0
>Reporter: Skye Wanderman-Milne
>Priority: Minor
>  Labels: performance
>
> The performance of from_utc_timestamp, besides being extremely poor compared 
> to that of other builtin functions such as from_unixtime, depends greatly on 
> the input timezone. Given that evaluating this function can dominate the 
> total query time, it would be good to at least make its performance more 
> consistent, and even better to improve its performance overall.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IMPALA-1577) Improve from_utc_timestamp perf which is unpredictable and slow

2018-07-09 Thread Csaba Ringhofer (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537222#comment-16537222
 ] 

Csaba Ringhofer commented on IMPALA-1577:
-

IMPALA-3307 replaced the timezone implementation, which became generally faster 
and should be more consistent. The old implementation looked up timezone 
aliases much slower than canonical timezone names, while there shouldn't be any 
difference in the new one (many aliases were removed, while some were added to 
a map for fast lookups).

> Improve from_utc_timestamp perf which is unpredictable and slow
> ---
>
> Key: IMPALA-1577
> URL: https://issues.apache.org/jira/browse/IMPALA-1577
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 2.0
>Reporter: Skye Wanderman-Milne
>Priority: Minor
>  Labels: performance
>
> The performance of from_utc_timestamp, besides being extremely poor compared 
> to that of other builtin functions such as from_unixtime, depends greatly on 
> the input timezone. Given that evaluating this function can dominate the 
> total query time, it would be good to at least make its performance more 
> consistent, and even better to improve its performance overall.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-7241) progress-updater.cc:43] Check failed: delta >= 0 (-3 vs. 0)

2018-07-09 Thread Michael Brown (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-7241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16537206#comment-16537206
 ] 

Michael Brown commented on IMPALA-7241:
---

I've done several other stress runs and not seen this again. For now, that 
means I'll not label this a broken build.

> progress-updater.cc:43] Check failed: delta >= 0 (-3 vs. 0)
> ---
>
> Key: IMPALA-7241
> URL: https://issues.apache.org/jira/browse/IMPALA-7241
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 3.1.0
>Reporter: Michael Brown
>Priority: Blocker
>  Labels: crash, stress
>
> During a stress test with 8 nodes running an insecure debug build based off 
> master, an impalad hit a DCHECK. The concurrency level at the time was 
> between 150-180 queries.
> The DCHECK was {{progress-updater.cc:43] Check failed: delta >= 0 (-3 vs. 
> 0)}}.
> The stack was:
> {noformat}
> #0  0x7f6a73e811f7 in raise () from /lib64/libc.so.6
> #1  0x7f6a73e828e8 in abort () from /lib64/libc.so.6
> #2  0x04300e34 in google::DumpStackTraceAndExit() ()
> #3  0x042f78ad in google::LogMessage::Fail() ()
> #4  0x042f9152 in google::LogMessage::SendToLog() ()
> #5  0x042f7287 in google::LogMessage::Flush() ()
> #6  0x042fa84e in google::LogMessageFatal::~LogMessageFatal() ()
> #7  0x01f7fedd in impala::ProgressUpdater::Update(long) ()
> #8  0x0313912b in 
> impala::Coordinator::BackendState::InstanceStats::Update(impala::TFragmentInstanceExecStatus
>  const&, impala::Coordinator::ExecSummary*, impala::ProgressUpdater*) ()
> #9  0x031369e6 in 
> impala::Coordinator::BackendState::ApplyExecStatusReport(impala::TReportExecStatusParams
>  const&, impala::Coordinator::ExecSummary*, impala::ProgressUpdater*) ()
> #10 0x031250b4 in 
> impala::Coordinator::UpdateBackendExecStatus(impala::TReportExecStatusParams 
> const&) ()
> #11 0x01e86395 in 
> impala::ClientRequestState::UpdateBackendExecStatus(impala::TReportExecStatusParams
>  const&) ()
> #12 0x01e27594 in 
> impala::ImpalaServer::ReportExecStatus(impala::TReportExecStatusResult&, 
> impala::TReportExecStatusParams const&) ()
> #13 0x01ebb8a0 in 
> impala::ImpalaInternalService::ReportExecStatus(impala::TReportExecStatusResult&,
>  impala::TReportExecStatusParams const&) ()
> #14 0x02fa6f62 in 
> impala::ImpalaInternalServiceProcessor::process_ReportExecStatus(int, 
> apache::thrift::protocol::TProtocol*, apache::thrift::protocol::TProtocol*, 
> void*) ()
> #15 0x02fa6540 in 
> impala::ImpalaInternalServiceProcessor::dispatchCall(apache::thrift::protocol::TProtocol*,
>  apache::thrift::protocol::TProtocol*, std::string const&, int, void*) ()
> #16 0x018892b0 in 
> apache::thrift::TDispatchProcessor::process(boost::shared_ptr,
>  boost::shared_ptr, void*) ()
> #17 0x01c80d1b in 
> apache::thrift::server::TAcceptQueueServer::Task::run() ()
> #18 0x01c78ffb in 
> impala::ThriftThread::RunRunnable(boost::shared_ptr,
>  impala::Promise*) ()
> #19 0x01c7a721 in boost::_mfi::mf2 boost::shared_ptr, 
> impala::Promise (impala::PromiseMode)0>*>::operator()(impala::ThriftThread*, 
> boost::shared_ptr, 
> impala::Promise*) const ()
> #20 0x01c7a5b7 in void 
> boost::_bi::list3, 
> boost::_bi::value >, 
> boost::_bi::value*> 
> >::operator() boost::shared_ptr, 
> impala::Promise*>, 
> boost::_bi::list0>(boost::_bi::type, boost::_mfi::mf2 impala::ThriftThread, 
> boost::shared_ptr, 
> impala::Promise*>&, 
> boost::_bi::list0&, int) ()
> #21 0x01c7a303 in boost::_bi::bind_t impala::ThriftThread, 
> boost::shared_ptr, 
> impala::Promise*>, 
> boost::_bi::list3, 
> boost::_bi::value >, 
> boost::_bi::value*> > 
> >::operator()() ()
> #22 0x01c7a216 in 
> boost::detail::function::void_function_obj_invoker0 boost::_mfi::mf2 boost::shared_ptr, 
> impala::Promise*>, 
> boost::_bi::list3, 
> boost::_bi::value >, 
> boost::_bi::value*> > 
> >, void>::invoke(boost::detail::function::function_buffer&) ()
> #23 0x01bbb81c in boost::function0::operator()() const ()
> #24 0x01fb6eaf in impala::Thread::SuperviseThread(std::string const&, 
> std::string const&, boost::function, impala::ThreadDebugInfo const*, 
> impala::Promise*) ()
> #25 0x01fbef87 in void 
> boost::_bi::list5, 
> boost::_bi::value, boost::_bi::value >, 
> boost::_bi::value, 
> boost::_bi::value*> 
> >::operator() boost::function, impala::ThreadDebugInfo const*, 
> impala::Promise*), 
> boost::_bi::list0>(boost::_bi::type, void (*&)(std::string const&, 
> std::string const&, boost::function, impala::ThreadDebugInfo const*, 
> impala::Promise*), boost::_bi::list0&, int) ()
> #26 0x01fbeeab in 

[jira] [Closed] (IMPALA-3956) Impala shell variable substitution should ignore comments embedded in query.

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-3956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-3956.
---
   Resolution: Fixed
Fix Version/s: Impala 3.1.0

> Impala shell variable substitution should ignore comments embedded in query.
> 
>
> Key: IMPALA-3956
> URL: https://issues.apache.org/jira/browse/IMPALA-3956
> Project: IMPALA
>  Issue Type: Bug
>  Components: Clients, Docs
>Affects Versions: Impala 2.6.0
>Reporter: Huaisi Xu
>Assignee: Alex Rodoni
>Priority: Minor
>  Labels: regression, usability
> Fix For: Impala 3.1.0
>
>
> {code:java}
> -- WHERE a >= "${start_date}"
> select
> *
> from --fda;
> test
> -- WHEREa >= "${start_date}"
> {code}
> {code:java}
> Starting Impala Shell without Kerberos authentication
> Connected to localhost:21000
> Server version: impalad version 2.7.0-cdh5-INTERNAL DEBUG (build 
> ebd65a142d3f3f4087eb1c9aaf25d53d2045a4cb)
> Error: Unknown substitution syntax (START_DATE). Use ${VAR:var_name}.
> Could not execute command: -- WHERE a >= "${start_date}"
> select
> *
> from --fda;
> test
> -- WHERE a >= "${start_date}"
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-3956) Impala shell variable substitution should ignore comments embedded in query.

2018-07-09 Thread Alex Rodoni (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-3956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Rodoni closed IMPALA-3956.
---
   Resolution: Fixed
Fix Version/s: Impala 3.1.0

> Impala shell variable substitution should ignore comments embedded in query.
> 
>
> Key: IMPALA-3956
> URL: https://issues.apache.org/jira/browse/IMPALA-3956
> Project: IMPALA
>  Issue Type: Bug
>  Components: Clients, Docs
>Affects Versions: Impala 2.6.0
>Reporter: Huaisi Xu
>Assignee: Alex Rodoni
>Priority: Minor
>  Labels: regression, usability
> Fix For: Impala 3.1.0
>
>
> {code:java}
> -- WHERE a >= "${start_date}"
> select
> *
> from --fda;
> test
> -- WHEREa >= "${start_date}"
> {code}
> {code:java}
> Starting Impala Shell without Kerberos authentication
> Connected to localhost:21000
> Server version: impalad version 2.7.0-cdh5-INTERNAL DEBUG (build 
> ebd65a142d3f3f4087eb1c9aaf25d53d2045a4cb)
> Error: Unknown substitution syntax (START_DATE). Use ${VAR:var_name}.
> Could not execute command: -- WHERE a >= "${start_date}"
> select
> *
> from --fda;
> test
> -- WHERE a >= "${start_date}"
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (IMPALA-6988) Statement to allow setting ownership for table/view

2018-07-09 Thread Fredy Wijaya (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fredy Wijaya resolved IMPALA-6988.
--
   Resolution: Fixed
Fix Version/s: Impala 3.1.0

> Statement to allow setting ownership for table/view
> ---
>
> Key: IMPALA-6988
> URL: https://issues.apache.org/jira/browse/IMPALA-6988
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Frontend
>Affects Versions: Impala 3.0, Impala 2.13.0
>Reporter: Adam Holley
>Assignee: Fredy Wijaya
>Priority: Major
>  Labels: security
> Fix For: Impala 3.1.0
>
>
> Create statement to allow setting owner.
> {noformat}
> ALTER TABLE database_name.table_name SET OWNER [USER|ROLE] user_or_role;
> ALTER VIEW database_name.view_name SET OWNER [USER|ROLE] 
> user_or_role;{noformat}
> Examples:
> {noformat}
> ALTER TABLE  SET OWNER USER 
> ALTER TABLE  SET OWNER ROLE 
> ALTER VIEW  SET OWNER USER 
> ALTER VIEW  SET OWNER ROLE 
> {noformat}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (IMPALA-6988) Statement to allow setting ownership for table/view

2018-07-09 Thread Fredy Wijaya (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-6988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fredy Wijaya resolved IMPALA-6988.
--
   Resolution: Fixed
Fix Version/s: Impala 3.1.0

> Statement to allow setting ownership for table/view
> ---
>
> Key: IMPALA-6988
> URL: https://issues.apache.org/jira/browse/IMPALA-6988
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Frontend
>Affects Versions: Impala 3.0, Impala 2.13.0
>Reporter: Adam Holley
>Assignee: Fredy Wijaya
>Priority: Major
>  Labels: security
> Fix For: Impala 3.1.0
>
>
> Create statement to allow setting owner.
> {noformat}
> ALTER TABLE database_name.table_name SET OWNER [USER|ROLE] user_or_role;
> ALTER VIEW database_name.view_name SET OWNER [USER|ROLE] 
> user_or_role;{noformat}
> Examples:
> {noformat}
> ALTER TABLE  SET OWNER USER 
> ALTER TABLE  SET OWNER ROLE 
> ALTER VIEW  SET OWNER USER 
> ALTER VIEW  SET OWNER ROLE 
> {noformat}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-1624) Allow toggling --quiet and -B settings interactively in impala-shell

2018-07-09 Thread Nghia Le (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-1624 started by Nghia Le.

> Allow toggling --quiet and -B settings interactively in impala-shell
> 
>
> Key: IMPALA-1624
> URL: https://issues.apache.org/jira/browse/IMPALA-1624
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Clients
>Affects Versions: Impala 2.1
>Reporter: John Russell
>Assignee: Nghia Le
>Priority: Minor
>  Labels: ramp-up, shell, usability
>
> I find it is quite common to start impala-shell with the --quiet option, then 
> within the same session want to turn on the extra output like the timing 
> numbers. Or to start with the normal ASCII boxes for query output, then want 
> to switch to comma-delimited output. Currently, this involves quitting out of 
> the shell and restarting it, plus doing an extra USE or -d to get back to the 
> same database as before.
> I suggest having some command in the shell to switch between quiet and 
> verbose mode, boxed and delimited output, and set the delimiter character. 
> Perhaps there are other similar command-line settings that would be easy to 
> toggle while the interpreter is running.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-1624) Allow toggling --quiet and -B settings interactively in impala-shell

2018-07-09 Thread Csaba Ringhofer (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Csaba Ringhofer reassigned IMPALA-1624:
---

Assignee: Nghia Le

> Allow toggling --quiet and -B settings interactively in impala-shell
> 
>
> Key: IMPALA-1624
> URL: https://issues.apache.org/jira/browse/IMPALA-1624
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Clients
>Affects Versions: Impala 2.1
>Reporter: John Russell
>Assignee: Nghia Le
>Priority: Minor
>  Labels: ramp-up, shell, usability
>
> I find it is quite common to start impala-shell with the --quiet option, then 
> within the same session want to turn on the extra output like the timing 
> numbers. Or to start with the normal ASCII boxes for query output, then want 
> to switch to comma-delimited output. Currently, this involves quitting out of 
> the shell and restarting it, plus doing an extra USE or -d to get back to the 
> same database as before.
> I suggest having some command in the shell to switch between quiet and 
> verbose mode, boxed and delimited output, and set the delimiter character. 
> Perhaps there are other similar command-line settings that would be easy to 
> toggle while the interpreter is running.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-7264) Set owner type upon table creation

2018-07-09 Thread Fredy Wijaya (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fredy Wijaya updated IMPALA-7264:
-
Labels: security  (was: )

> Set owner type upon table creation
> --
>
> Key: IMPALA-7264
> URL: https://issues.apache.org/jira/browse/IMPALA-7264
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Frontend
>Affects Versions: Impala 3.0
>Reporter: Fredy Wijaya
>Assignee: Fredy Wijaya
>Priority: Major
>  Labels: security
>
> Currently only owner name is set upon table creation. We need to also set 
> owner type upon table creation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-7264) Set owner type upon table creation

2018-07-09 Thread Fredy Wijaya (JIRA)


 [ 
https://issues.apache.org/jira/browse/IMPALA-7264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-7264 started by Fredy Wijaya.

> Set owner type upon table creation
> --
>
> Key: IMPALA-7264
> URL: https://issues.apache.org/jira/browse/IMPALA-7264
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Frontend
>Affects Versions: Impala 3.0
>Reporter: Fredy Wijaya
>Assignee: Fredy Wijaya
>Priority: Major
>  Labels: security
>
> Currently only owner name is set upon table creation. We need to also set 
> owner type upon table creation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7264) Set owner type upon table creation

2018-07-09 Thread Fredy Wijaya (JIRA)
Fredy Wijaya created IMPALA-7264:


 Summary: Set owner type upon table creation
 Key: IMPALA-7264
 URL: https://issues.apache.org/jira/browse/IMPALA-7264
 Project: IMPALA
  Issue Type: Sub-task
  Components: Frontend
Affects Versions: Impala 3.0
Reporter: Fredy Wijaya
Assignee: Fredy Wijaya


Currently only owner name is set upon table creation. We need to also set owner 
type upon table creation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7264) Set owner type upon table creation

2018-07-09 Thread Fredy Wijaya (JIRA)
Fredy Wijaya created IMPALA-7264:


 Summary: Set owner type upon table creation
 Key: IMPALA-7264
 URL: https://issues.apache.org/jira/browse/IMPALA-7264
 Project: IMPALA
  Issue Type: Sub-task
  Components: Frontend
Affects Versions: Impala 3.0
Reporter: Fredy Wijaya
Assignee: Fredy Wijaya


Currently only owner name is set upon table creation. We need to also set owner 
type upon table creation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7263) Impala Doc: Cancel Shutdown of impalad

2018-07-09 Thread Alex Rodoni (JIRA)
Alex Rodoni created IMPALA-7263:
---

 Summary: Impala Doc: Cancel Shutdown of impalad
 Key: IMPALA-7263
 URL: https://issues.apache.org/jira/browse/IMPALA-7263
 Project: IMPALA
  Issue Type: Sub-task
  Components: Docs
Reporter: Alex Rodoni
Assignee: Alex Rodoni






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-7263) Impala Doc: Cancel Shutdown of impalad

2018-07-09 Thread Alex Rodoni (JIRA)
Alex Rodoni created IMPALA-7263:
---

 Summary: Impala Doc: Cancel Shutdown of impalad
 Key: IMPALA-7263
 URL: https://issues.apache.org/jira/browse/IMPALA-7263
 Project: IMPALA
  Issue Type: Sub-task
  Components: Docs
Reporter: Alex Rodoni
Assignee: Alex Rodoni






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7262) Impala Doc: Quiesce impalad without shutting down

2018-07-09 Thread Alex Rodoni (JIRA)
Alex Rodoni created IMPALA-7262:
---

 Summary: Impala Doc: Quiesce impalad without shutting down
 Key: IMPALA-7262
 URL: https://issues.apache.org/jira/browse/IMPALA-7262
 Project: IMPALA
  Issue Type: Sub-task
  Components: Docs
Reporter: Alex Rodoni
Assignee: Alex Rodoni






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (IMPALA-7262) Impala Doc: Quiesce impalad without shutting down

2018-07-09 Thread Alex Rodoni (JIRA)
Alex Rodoni created IMPALA-7262:
---

 Summary: Impala Doc: Quiesce impalad without shutting down
 Key: IMPALA-7262
 URL: https://issues.apache.org/jira/browse/IMPALA-7262
 Project: IMPALA
  Issue Type: Sub-task
  Components: Docs
Reporter: Alex Rodoni
Assignee: Alex Rodoni






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-1624) Allow toggling --quiet and -B settings interactively in impala-shell

2018-07-09 Thread Csaba Ringhofer (JIRA)


[ 
https://issues.apache.org/jira/browse/IMPALA-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536877#comment-16536877
 ] 

Csaba Ringhofer commented on IMPALA-1624:
-

I would add --output_file as a good candidate to set interactively. I often run 
into situations when I want to write the results of some  queries into file, 
while read others in the shell.

> Allow toggling --quiet and -B settings interactively in impala-shell
> 
>
> Key: IMPALA-1624
> URL: https://issues.apache.org/jira/browse/IMPALA-1624
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Clients
>Affects Versions: Impala 2.1
>Reporter: John Russell
>Priority: Minor
>  Labels: ramp-up, shell, usability
>
> I find it is quite common to start impala-shell with the --quiet option, then 
> within the same session want to turn on the extra output like the timing 
> numbers. Or to start with the normal ASCII boxes for query output, then want 
> to switch to comma-delimited output. Currently, this involves quitting out of 
> the shell and restarting it, plus doing an extra USE or -d to get back to the 
> same database as before.
> I suggest having some command in the shell to switch between quiet and 
> verbose mode, boxed and delimited output, and set the delimiter character. 
> Perhaps there are other similar command-line settings that would be easy to 
> toggle while the interpreter is running.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-5542) Impala cannot scan Parquet decimal stored as int64_t/int32_t

2018-07-09 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/IMPALA-5542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltán Borók-Nagy reassigned IMPALA-5542:
-

Assignee: Zoltán Borók-Nagy

> Impala cannot scan Parquet decimal stored as int64_t/int32_t
> 
>
> Key: IMPALA-5542
> URL: https://issues.apache.org/jira/browse/IMPALA-5542
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Affects Versions: Impala 2.10.0
>Reporter: Tim Armstrong
>Assignee: Zoltán Borók-Nagy
>Priority: Major
>  Labels: parquet
>
> This is supported according to the Parquet spec 
> (https://github.com/Parquet/parquet-format/blob/master/LogicalTypes.md#decimal)
>  but wasn't widely used. For some reason Spark decided to start writing this 
> out as the default (see SPARK-20297) so we will likely start seeing this at 
> some point.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org