[jira] [Resolved] (IMPALA-9736) "MT_DOP not supported for plans with base table joins or table sinks" error is out of date

2020-05-10 Thread Tim Armstrong (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong resolved IMPALA-9736.
---
Fix Version/s: Impala 4.0
   Resolution: Fixed

> "MT_DOP not supported for plans with base table joins or table sinks" error 
> is out of date
> --
>
> Key: IMPALA-9736
> URL: https://issues.apache.org/jira/browse/IMPALA-9736
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
> Fix For: Impala 4.0
>
>
> We do support base table joins now.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-8755) Implement Z-ordering for Impala

2020-05-10 Thread Jim Apple (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-8755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104050#comment-17104050
 ] 

Jim Apple commented on IMPALA-8755:
---

Hi all! What's work is left to complete this feature?

> Implement Z-ordering for Impala
> ---
>
> Key: IMPALA-8755
> URL: https://issues.apache.org/jira/browse/IMPALA-8755
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Zoltán Borók-Nagy
>Assignee: Norbert Luksa
>Priority: Major
>
> Implement Z-ordering for Impala: [https://en.wikipedia.org/wiki/Z-order_curve]
> A Z-order curve defines an ordering on multi-dimensional data. Data sorted 
> that way can be efficiently filtered by min/max statistics regarding to the 
> columns participating in the ordering.
> Impala currently only supports lexicographic ordering via the SORT BY clause. 
> This strongly prefers the first column, i.e. given the "SORT BY A, B, C" 
> clause => A will be totally ordered (hence filtering on A will be very 
> efficient), but values belonging to B and C will be scattered throughout the 
> data set (hence filtering on B or C will barely do any good).
> We could add a new clause, e.g. a "ZSORT BY" clause to Impala that writes the 
> data in Z-order.
> "ZSORT BY A, B C" would cluster the rows in a way that filtering on A, B, or 
> C would be equally efficient.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-9741) Support query iceberg table by impala

2020-05-10 Thread WangSheng (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-9741 started by WangSheng.
-
> Support query iceberg table by impala
> -
>
> Key: IMPALA-9741
> URL: https://issues.apache.org/jira/browse/IMPALA-9741
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: WangSheng
>Assignee: WangSheng
>Priority: Major
>
> Since we have submit an patch of supporting create iceberg table by impala in 
> IMPALA-9688, we are preparing to implement iceberg table query by impala. But 
> we need to read the impala and iceberg code  deeply to determine how to do 
> this.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-9741) Support query iceberg table by impala

2020-05-10 Thread WangSheng (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangSheng updated IMPALA-9741:
--
Description: Since we have submit an patch of supporting create iceberg 
table by impala in IMPALA-9688, we are preparing to implement iceberg table 
query by impala. But we need to read the impala and iceberg code  deeply to 
determine how to do this.

> Support query iceberg table by impala
> -
>
> Key: IMPALA-9741
> URL: https://issues.apache.org/jira/browse/IMPALA-9741
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: WangSheng
>Assignee: WangSheng
>Priority: Major
>
> Since we have submit an patch of supporting create iceberg table by impala in 
> IMPALA-9688, we are preparing to implement iceberg table query by impala. But 
> we need to read the impala and iceberg code  deeply to determine how to do 
> this.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9741) Support query iceberg table by impala

2020-05-10 Thread WangSheng (Jira)
WangSheng created IMPALA-9741:
-

 Summary: Support query iceberg table by impala
 Key: IMPALA-9741
 URL: https://issues.apache.org/jira/browse/IMPALA-9741
 Project: IMPALA
  Issue Type: Sub-task
Reporter: WangSheng
Assignee: WangSheng






--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-9621) Support iceberg on hdfs

2020-05-10 Thread WangSheng (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-9621 started by WangSheng.
-
> Support iceberg on hdfs
> ---
>
> Key: IMPALA-9621
> URL: https://issues.apache.org/jira/browse/IMPALA-9621
> Project: IMPALA
>  Issue Type: Improvement
>Reporter: WangSheng
>Assignee: WangSheng
>Priority: Major
>
> We are investigating iceberg recently, and preparing to implement select 
> iceberg data by impala. Our production use hdfs, so we will try to support 
> iceberg on hdfs.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9382) Revisit representation of runtime profile objects to be denser

2020-05-10 Thread Tim Armstrong (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103981#comment-17103981
 ] 

Tim Armstrong commented on IMPALA-9382:
---

Here is the latest iteration, showing profiles with the new format enabled and 
disabled.

 [^tpcds_q10_profile_v1.txt]  [^tpcds_q10_profile_v2.txt]  



> Revisit representation of runtime profile objects to be denser
> --
>
> Key: IMPALA-9382
> URL: https://issues.apache.org/jira/browse/IMPALA-9382
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
> Attachments: profile_504b379400cba9f2_2d2cf007, 
> tpcds_q10_profile_v1.txt, tpcds_q10_profile_v2.txt
>
>
> RuntimeProfile trees can potentially stress the memory allocator and use up a 
> lot more memory and cache than is really necessary:
> * std::map is used throughout, and allocates a node per map entry. We do 
> depend on the counters being displayed in-order, but we would probably be 
> better of storing the counters in a vector and lazily sorting when needed 
> (since the set of counters is generally static after Prepare()).
> * We store the same counter names redundantly all over the place. We'd 
> probably be best off using a pool of constant counter names (we could just 
> require registering them upfront).
> There may be a small gain from switching thrift to using unordered_map, e.g. 
> for the info strings that appear with some frequency in profiles.
> However, I think we need to restructure the thrift representation and 
> in-memory representation to get significant gains.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-9382) Revisit representation of runtime profile objects to be denser

2020-05-10 Thread Tim Armstrong (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-9382:
--
Attachment: tpcds_q10_profile_v2.txt
tpcds_q10_profile_v1.txt

> Revisit representation of runtime profile objects to be denser
> --
>
> Key: IMPALA-9382
> URL: https://issues.apache.org/jira/browse/IMPALA-9382
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
> Attachments: profile_504b379400cba9f2_2d2cf007, 
> tpcds_q10_profile_v1.txt, tpcds_q10_profile_v2.txt
>
>
> RuntimeProfile trees can potentially stress the memory allocator and use up a 
> lot more memory and cache than is really necessary:
> * std::map is used throughout, and allocates a node per map entry. We do 
> depend on the counters being displayed in-order, but we would probably be 
> better of storing the counters in a vector and lazily sorting when needed 
> (since the set of counters is generally static after Prepare()).
> * We store the same counter names redundantly all over the place. We'd 
> probably be best off using a pool of constant counter names (we could just 
> require registering them upfront).
> There may be a small gain from switching thrift to using unordered_map, e.g. 
> for the info strings that appear with some frequency in profiles.
> However, I think we need to restructure the thrift representation and 
> in-memory representation to get significant gains.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-9740) TSAN data race in hdfs-bulk-ops

2020-05-10 Thread Sahil Takiar (Jira)
Sahil Takiar created IMPALA-9740:


 Summary: TSAN data race in hdfs-bulk-ops
 Key: IMPALA-9740
 URL: https://issues.apache.org/jira/browse/IMPALA-9740
 Project: IMPALA
  Issue Type: Sub-task
  Components: Backend
Reporter: Sahil Takiar


hdfs-bulk-ops usage of a local connection cache (HdfsFsCache::HdfsFsMap) has a 
data race:
{code:java}
 WARNING: ThreadSanitizer: data race (pid=23205)
  Write of size 8 at 0x7b24005642d8 by thread T47:
#0 
boost::unordered::detail::table_impl >, std::string, hdfs_internal*, 
boost::hash, std::equal_to > 
>::add_node(boost::unordered::detail::node_constructor > > >&, unsigned long) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/unordered/detail/unique.hpp:329:26
 (impalad+0x1f93832)
#1 
std::pair > >, bool> 
boost::unordered::detail::table_impl >, std::string, hdfs_internal*, 
boost::hash, std::equal_to > 
>::emplace_impl >(std::string 
const&, std::pair&&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/unordered/detail/unique.hpp:420:41
 (impalad+0x1f933ed)
#2 
std::pair > >, bool> 
boost::unordered::detail::table_impl >, std::string, hdfs_internal*, 
boost::hash, std::equal_to > 
>::emplace >(std::pair&&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/unordered/detail/unique.hpp:384:20
 (impalad+0x1f932d1)
#3 
std::pair > >, bool> boost::unordered::unordered_map, std::equal_to, 
std::allocator > 
>::emplace >(std::pair&&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/unordered/unordered_map.hpp:241:27
 (impalad+0x1f93238)
#4 boost::unordered::unordered_map, std::equal_to, 
std::allocator > 
>::insert(std::pair&&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/unordered/unordered_map.hpp:390:26
 (impalad+0x1f92038)
#5 impala::HdfsFsCache::GetConnection(std::string const&, hdfs_internal**, 
boost::unordered::unordered_map, std::equal_to, 
std::allocator > >*) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/runtime/hdfs-fs-cache.cc:115:18
 (impalad+0x1f916b3)
#6 impala::HdfsOp::Execute() const 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/hdfs-bulk-ops.cc:84:55
 (impalad+0x23444d5)
#7 HdfsThreadPoolHelper(int, impala::HdfsOp const&) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/hdfs-bulk-ops.cc:137:6
 (impalad+0x2344ea9)
#8 boost::detail::function::void_function_invoker2::invoke(boost::detail::function::function_buffer&, int, impala::HdfsOp 
const&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:118:11
 (impalad+0x2345e80)
#9 boost::function2::operator()(int, 
impala::HdfsOp const&) const 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:770:14
 (impalad+0x1f883be)
#10 impala::ThreadPool::WorkerThread(int) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/thread-pool.h:166:9
 (impalad+0x1f874e5)
#11 boost::_mfi::mf1, 
int>::operator()(impala::ThreadPool*, int) const 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/bind/mem_fn_template.hpp:165:29
 (impalad+0x1f87b7d)
#12 void 
boost::_bi::list2*>, 
boost::_bi::value >::operator(), int>, 
boost::_bi::list0>(boost::_bi::type, boost::_mfi::mf1, int>&, boost::_bi::list0&, int) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/bind/bind.hpp:319:9
 (impalad+0x1f87abc)
#13 boost::_bi::bind_t, int>, 
boost::_bi::list2*>, 
boost::_bi::value > >::operator()() 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/bind/bind.hpp:1222:16
 (impalad+0x1f87a23)
#14 
boost::detail::function::void_function_obj_invoker0, int>, 
boost::_bi::list2*>, 
boost::_bi::value > >, 
void>::invoke(boost::detail::function::function_buffer&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:159:11
 (impalad+0x1f877c1)
#15 boost::function0::operator()() const 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:770:14
 (impalad+0x1e192b1)
#16 impala::Thread::SuperviseThread(std::string const&, std::string const&, 
boost::function, impala::ThreadDebugInfo const*, impala::Promise*) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/thread.cc:360:3
 (impalad+0x23df196)
#17 void 

[jira] [Assigned] (IMPALA-9739) TSAN data races during impalad shutdown

2020-05-10 Thread Sahil Takiar (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-9739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sahil Takiar reassigned IMPALA-9739:


Assignee: Bikramjeet Vig

Not really sure what is going on here. Bikram, assigning to you since you 
touched this code at some point. At the very least, it would be good to 
understanding why the data race is occurring.

> TSAN data races during impalad shutdown
> ---
>
> Key: IMPALA-9739
> URL: https://issues.apache.org/jira/browse/IMPALA-9739
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Sahil Takiar
>Assignee: Bikramjeet Vig
>Priority: Major
>
> A TSAN run of the custom cluster tests shows several instances of the 
> following data race during impalad shutdown:
> {code:java}
> WARNING: ThreadSanitizer: data race (pid=12660)
>   Read of size 8 at 0x07786f60 by thread T338:
> #0 std::unique_ptr 
> >::~unique_ptr() 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/unique_ptr.h:235:6
>  (impalad+0x19bd895)
> #1 at_exit_wrapper(void*) 
> /mnt/source/llvm/llvm-5.0.1.src-p2/projects/compiler-rt/lib/tsan/rtl/tsan_interceptors.cc:361
>  (impalad+0x191cf13)
> #2 impala::ImpalaServer::StartShutdown(long, 
> impala::ShutdownStatusPB*)::$_2::operator()() const 
> /data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/service/impala-server.cc:2622:57
>  (impalad+0x21bd871)
> #3 
> boost::detail::function::void_function_obj_invoker0  impala::ShutdownStatusPB*)::$_2, 
> void>::invoke(boost::detail::function::function_buffer&) 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:159:11
>  (impalad+0x21bd6d9)
> #4 boost::function0::operator()() const 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:770:14
>  (impalad+0x1e192b1)
> #5 impala::Thread::SuperviseThread(std::string const&, std::string 
> const&, boost::function, impala::ThreadDebugInfo const*, 
> impala::Promise*) 
> /data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/thread.cc:360:3
>  (impalad+0x23df196)
> #6 void boost::_bi::list5, 
> boost::_bi::value, boost::_bi::value >, 
> boost::_bi::value, 
> boost::_bi::value*> 
> >::operator() boost::function, impala::ThreadDebugInfo const*, 
> impala::Promise*), 
> boost::_bi::list0>(boost::_bi::type, void (*&)(std::string const&, 
> std::string const&, boost::function, impala::ThreadDebugInfo const*, 
> impala::Promise*), boost::_bi::list0&, int) 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/bind/bind.hpp:531:9
>  (impalad+0x23e735c)
> #7 boost::_bi::bind_t const&, boost::function, impala::ThreadDebugInfo const*, 
> impala::Promise*), 
> boost::_bi::list5, 
> boost::_bi::value, boost::_bi::value >, 
> boost::_bi::value, 
> boost::_bi::value*> > 
> >::operator()() 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/bind/bind.hpp:1222:16
>  (impalad+0x23e7273)
> #8 boost::detail::thread_data (*)(std::string const&, std::string const&, boost::function, 
> impala::ThreadDebugInfo const*, impala::Promise (impala::PromiseMode)0>*), boost::_bi::list5, 
> boost::_bi::value, boost::_bi::value >, 
> boost::_bi::value, 
> boost::_bi::value*> > > 
> >::run() 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/thread/detail/thread.hpp:116:17
>  (impalad+0x23e6f60)
> #9 thread_proxy  (impalad+0x30e44f9)
>   Previous write of size 8 at 0x07786f60 by main thread:
> #0 void std::swap(impala::Thread*&, impala::Thread*&) 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/move.h:176:11
>  (impalad+0x221f370)
> #1 std::unique_ptr 
> >::reset(impala::Thread*) 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/unique_ptr.h:342:2
>  (impalad+0x221a5ab)
> #2 std::unique_ptr 
> >::operator=(std::unique_ptr std::default_delete >&&) 
> /data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/unique_ptr.h:251:2
>  (impalad+0x23e1ad8)
> #3 impala::Thread::StartThread(std::string const&, std::string const&, 
> boost::function const&, std::unique_ptr std::default_delete >*, bool) 
> 

[jira] [Created] (IMPALA-9739) TSAN data races during impalad shutdown

2020-05-10 Thread Sahil Takiar (Jira)
Sahil Takiar created IMPALA-9739:


 Summary: TSAN data races during impalad shutdown
 Key: IMPALA-9739
 URL: https://issues.apache.org/jira/browse/IMPALA-9739
 Project: IMPALA
  Issue Type: Sub-task
  Components: Backend
Reporter: Sahil Takiar


A TSAN run of the custom cluster tests shows several instances of the following 
data race during impalad shutdown:
{code:java}
WARNING: ThreadSanitizer: data race (pid=12660)
  Read of size 8 at 0x07786f60 by thread T338:
#0 std::unique_ptr 
>::~unique_ptr() 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/unique_ptr.h:235:6
 (impalad+0x19bd895)
#1 at_exit_wrapper(void*) 
/mnt/source/llvm/llvm-5.0.1.src-p2/projects/compiler-rt/lib/tsan/rtl/tsan_interceptors.cc:361
 (impalad+0x191cf13)
#2 impala::ImpalaServer::StartShutdown(long, 
impala::ShutdownStatusPB*)::$_2::operator()() const 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/service/impala-server.cc:2622:57
 (impalad+0x21bd871)
#3 
boost::detail::function::void_function_obj_invoker0::invoke(boost::detail::function::function_buffer&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:159:11
 (impalad+0x21bd6d9)
#4 boost::function0::operator()() const 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/function/function_template.hpp:770:14
 (impalad+0x1e192b1)
#5 impala::Thread::SuperviseThread(std::string const&, std::string const&, 
boost::function, impala::ThreadDebugInfo const*, impala::Promise*) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/thread.cc:360:3
 (impalad+0x23df196)
#6 void boost::_bi::list5, 
boost::_bi::value, boost::_bi::value >, 
boost::_bi::value, 
boost::_bi::value*> 
>::operator(), impala::ThreadDebugInfo const*, impala::Promise*), boost::_bi::list0>(boost::_bi::type, void 
(*&)(std::string const&, std::string const&, boost::function, 
impala::ThreadDebugInfo const*, impala::Promise*), boost::_bi::list0&, int) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/bind/bind.hpp:531:9
 (impalad+0x23e735c)
#7 boost::_bi::bind_t, impala::ThreadDebugInfo const*, 
impala::Promise*), 
boost::_bi::list5, 
boost::_bi::value, boost::_bi::value >, 
boost::_bi::value, 
boost::_bi::value*> > 
>::operator()() 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/bind/bind.hpp:1222:16
 (impalad+0x23e7273)
#8 boost::detail::thread_data, impala::ThreadDebugInfo 
const*, impala::Promise*), 
boost::_bi::list5, 
boost::_bi::value, boost::_bi::value >, 
boost::_bi::value, 
boost::_bi::value*> > > >::run() 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/boost-1.61.0-p2/include/boost/thread/detail/thread.hpp:116:17
 (impalad+0x23e6f60)
#9 thread_proxy  (impalad+0x30e44f9)

  Previous write of size 8 at 0x07786f60 by main thread:
#0 void std::swap(impala::Thread*&, impala::Thread*&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/move.h:176:11
 (impalad+0x221f370)
#1 std::unique_ptr 
>::reset(impala::Thread*) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/unique_ptr.h:342:2
 (impalad+0x221a5ab)
#2 std::unique_ptr 
>::operator=(std::unique_ptr >&&) 
/data/jenkins/workspace/impala-private-parameterized/Impala-Toolchain/gcc-4.9.2/lib/gcc/x86_64-unknown-linux-gnu/4.9.2/../../../../include/c++/4.9.2/bits/unique_ptr.h:251:2
 (impalad+0x23e1ad8)
#3 impala::Thread::StartThread(std::string const&, std::string const&, 
boost::function const&, std::unique_ptr >*, bool) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/thread.cc:329:11
 (impalad+0x23decac)
#4 impala::Status impala::Thread::Create(std::string const&, 
std::string const&, void (* const&)(), std::unique_ptr >*, bool) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/util/thread.h:74:12
 (impalad+0x1a1cd8c)
#5 impala::StartImpalaShutdownSignalHandlerThread() 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/common/init.cc:407:10
 (impalad+0x1a1c1e8)
#6 ImpaladMain(int, char**) 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/service/impalad-main.cc:96:3
 (impalad+0x21a525a)
#7 main 
/data/jenkins/workspace/impala-private-parameterized/repos/Impala/be/src/service/daemon-main.cc:37:12
 (impalad+0x19b860a)

  As if synchronized via sleep:
#0 

[jira] [Commented] (IMPALA-9716) Add jitter to the exponential backoff in status reporting

2020-05-10 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103906#comment-17103906
 ] 

ASF subversion and git services commented on IMPALA-9716:
-

Commit e8d17948fb94e9a5cf8b6cfce9e05d51ebb65492 in impala's branch 
refs/heads/master from Thomas Tauber-Marshall
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=e8d1794 ]

IMPALA-9716: Add jitter to the exponential backoff in status reporting

When status reports fail, we use exponential backoff when retrying
sending them. However, currently the backoff is deterministic, leading
to a thundering herd problem where all of the backends for a
particular query may try to report at the same time, the coordinator
is overwhelmed and rejects some of the rpcs, then the backends all
backoff by the same amount and retry sending at the same time, leading
the coordinator to be overwhelmed again.

This patch alleviates this problem by adding some random jitter to the
exponential backoff used when a status report fails.

Testing:
- Passed a full run of existing tests.
- Code path is covered by test_reportexecstatus_retries

Change-Id: Id05c224517aa606057117328f480dfa98676b923
Reviewed-on: http://gerrit.cloudera.org:8080/15860
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Add jitter to the exponential backoff in status reporting
> -
>
> Key: IMPALA-9716
> URL: https://issues.apache.org/jira/browse/IMPALA-9716
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Distributed Exec
>Affects Versions: Impala 4.0
>Reporter: Thomas Tauber-Marshall
>Assignee: Thomas Tauber-Marshall
>Priority: Major
>
> When status reports fail, we use exponential backoff when retrying sending 
> them. However, currently the backoff is deterministic, leading to a 
> thundering herd problem where all of the backends for a particular query may 
> try to report at the same time, the coordinator is overwhelmed and rejects 
> some of the rpcs, then the backends all backoff by the same amount and retry 
> sending at the same time, leading the coordinator to be overwhelmed again.
> We can help solve this by adding some random jitter to the exponential 
> backoff time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9570) Document about memory management for writing UDAFs

2020-05-10 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103905#comment-17103905
 ] 

ASF subversion and git services commented on IMPALA-9570:
-

Commit 4846751c84b70673ea25cd88e8c9d2085f7ae55e in impala's branch 
refs/heads/master from Shajini Thayasingh
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=4846751 ]

IMPALA-9570: [DOCS] add memory management

add memory management and fix broken links. Incorporated review changes.

Change-Id: I6e8b6d0c3fe2e1746831665b3d3ae98a0beaa1e7
Reviewed-on: http://gerrit.cloudera.org:8080/15836
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Document about memory management for writing UDAFs
> --
>
> Key: IMPALA-9570
> URL: https://issues.apache.org/jira/browse/IMPALA-9570
> Project: IMPALA
>  Issue Type: Documentation
>Reporter: Quanlong Huang
>Assignee: shajini thayasingh
>Priority: Major
> Attachments: apache doc.png, cloudera doc.png, 
> image-2020-04-24-14-11-18-449.png
>
>
> In the current document, we have few words about memory management for 
> writing UDAFs: 
> [https://impala.apache.org/docs/build/html/topics/impala_udf.html]
> The only related sentense is "A serialize function that ... frees any memory 
> allocated during the init, update, and merge phases."
> We can add more for clarifying it or add this link for further reading: 
> [https://github.com/apache/impala/blob/2f53783b14ddf215a4c96bc52a511d3a83896128/be/src/udf/udf.h#L358-L365]
> BTW, the links of uda-sample.h and uda-sample.cc are not shown:
> [http://impala.apache.org/docs/build/html/topics/impala_udf.html#uda_functions]
> !apache doc.png|width=326,height=109!
> They are normal in Cloudera's doc: 
> [https://docs.cloudera.com/documentation/enterprise/latest/topics/impala_udf.html#uda_functions]
> !cloudera doc.png|width=725,height=226!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-9736) "MT_DOP not supported for plans with base table joins or table sinks" error is out of date

2020-05-10 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103907#comment-17103907
 ] 

ASF subversion and git services commented on IMPALA-9736:
-

Commit dcf497910a8db4796985fa4b01a37e747b12029a in impala's branch 
refs/heads/master from Tim Armstrong
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=dcf4979 ]

IMPALA-9736: fix mt_dop not supported error

The error was not accurate, because joins are now
supported. Also updated it to refer to DML statements
instead of table sinks to be more user-appropriate.

Change-Id: I8eb8106f86c47a14cc951c4a77966fe51b5c30e3
Reviewed-on: http://gerrit.cloudera.org:8080/15884
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> "MT_DOP not supported for plans with base table joins or table sinks" error 
> is out of date
> --
>
> Key: IMPALA-9736
> URL: https://issues.apache.org/jira/browse/IMPALA-9736
> Project: IMPALA
>  Issue Type: Bug
>  Components: Frontend
>Reporter: Tim Armstrong
>Assignee: Tim Armstrong
>Priority: Major
>
> We do support base table joins now.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org