Hi Zoli! Thanks for pointing this out + the details. As we discussed in private, in the future, due to some tez api changes (used by LLAP), there is going to be a new Tez release anyway (it also depends on whether a particular tez branch reached the target hadoop version afaik). So it's crucial to find out if there's something in 0.9.1 -> 0.9.2 which could block hive unit tests in order to move forward...I'm about to get back to this in a couple of weeks (depending on workload :) ).
Regards, Laci On Fri, 19 Jun 2020 at 14:43, Zoltan Haindrich <[email protected]> wrote: > Hey all! > > We've tried to upgrade to Tez to 0.9.2 in Hive - but we have hit some > non-deterministic hang issue. > So we've rolled back to 0.9.1 for now (see:[1]). > > I've collected some jstacks from the running tests which have were stuck > for more than 20 hours - there are 2-3 tests which were able to trigger it: > > https://termbin.com/z1eoc > https://termbin.com/2m0j > https://termbin.com/027t > https://termbin.com/1dbe > > There is a job which could be used to check/reproduce the issue: > http://130.211.9.232/job/hive-flaky-check/51/ > > Let me know if I could help! > > [1] > https://markmail.org/search/?q=hive+dev#query:hive%20dev%20list%3Aorg.apache.hadoop.hive-dev%20order%3Adate-backward+page:1+mid:vjax2iylwjumo5pv+state:results > > cheers, > Zoltan >
