Re: Jenkins build failures?

2016-05-25 Thread Josh Elser
+1 Great digging, Sergey! Sergey Soldatov wrote: James, Sure. I will file a JIRA and check about non zero thread pool size (not sure that would help since it's initialized in getDefaultExecutor and always used if no other pool is provided in HTable constructor). Thanks, Sergey On Mon, May 23, 2

Re: Jenkins build failures?

2016-05-24 Thread Sergey Soldatov
James, Sure. I will file a JIRA and check about non zero thread pool size (not sure that would help since it's initialized in getDefaultExecutor and always used if no other pool is provided in HTable constructor). Thanks, Sergey On Mon, May 23, 2016 at 8:11 PM, James Taylor wrote: > Thanks, Serg

Re: Jenkins build failures?

2016-05-23 Thread James Taylor
Thanks, Sergey. Sounds like you're on to it. We could try configuring those tests with a non zero thread pool size so they don't use SynchronousQueue. Want to file a JIRA with this info so we don't lose track of it? James On Tue, May 17, 2016 at 11:21 PM, Sergey Soldatov wrote: > Getting ba

Re: Jenkins build failures?

2016-05-17 Thread Sergey Soldatov
Getting back to the failures with OOM/unable to create a native thread. Those files have around 100 tests inside each that are running on top of the phoenix. In total they generate over 2500 scans. (system.catalog, sequences and regular scans over table). The problem that on HBase side all scans a

Re: Jenkins build failures?

2016-05-05 Thread James Taylor
Looks like all Jenkins builds are failing, but it seems environmental? Do we need to exclude some particular kind of host(s)? On Wed, May 4, 2016 at 5:25 PM, James Taylor wrote: > Thanks, Sergey! > > On Wed, May 4, 2016 at 5:22 PM, Sergey Soldatov > wrote: > >> James, >> Ah, didn't notice that

Re: Jenkins build failures?

2016-05-04 Thread James Taylor
Thanks, Sergey! On Wed, May 4, 2016 at 5:22 PM, Sergey Soldatov wrote: > James, > Ah, didn't notice that timeouts are not shown in the final report as > failures. It seems that the build is using JDK 1.7 and test run OOM > with PermGen space. Fixed in PHOENIX-2879 > > Thanks, > Sergey > > On Wed

Re: Jenkins build failures?

2016-05-04 Thread Sergey Soldatov
James, Ah, didn't notice that timeouts are not shown in the final report as failures. It seems that the build is using JDK 1.7 and test run OOM with PermGen space. Fixed in PHOENIX-2879 Thanks, Sergey On Wed, May 4, 2016 at 1:48 PM, James Taylor wrote: > Sergey, on master branch (which is HBase

Re: Jenkins build failures?

2016-05-04 Thread James Taylor
Sergey, on master branch (which is HBase 1.2): https://builds.apache.org/job/Phoenix-master/1214/console On Wed, May 4, 2016 at 1:31 PM, Sergey Soldatov wrote: > James, > Regarding HivePhoenixStoreIT. Are you talking about > Phoenix-4.x-HBase-1.0 job? Last build passed it successfully. > > > On

Re: Jenkins build failures?

2016-05-04 Thread Sergey Soldatov
James, Regarding HivePhoenixStoreIT. Are you talking about Phoenix-4.x-HBase-1.0 job? Last build passed it successfully. On Wed, May 4, 2016 at 10:15 AM, James Taylor wrote: > Our Jenkins builds have improved, but we're seeing some issues: > - timeouts with the new org.apache.phoenix.hive.HiveP

Re: Jenkins build failures?

2016-05-04 Thread James Taylor
Our Jenkins builds have improved, but we're seeing some issues: - timeouts with the new org.apache.phoenix.hive.HivePhoenixStoreIT test. - consistent failure with 4.x-HBase-1.1 build. I suspect that Jenkins build is out-of-date, as we haven't had a 4.x-HBase-1.1 branch for quite a while. There's li

Re: Jenkins build failures?

2016-04-30 Thread James Taylor
The defaults when tests are running are much lower than the standard Phoenix defaults (see QueryServicesTestImpl and BaseTest.setUpConfigForMiniCluster()). It's unclear to me why the HashJoinIT and SortMergeJoinIT tests (I think these are the culprits) do not seem to adhere to these (or maybe overr

Re: Jenkins build failures?

2016-04-30 Thread Nick Dimiduk
The default thread pool sizes for HDFS, HBase, ZK, and the Phoenix client are all contributing to this huge thread count. A good starting point would be to take a jstack of the IT process and count, group by threads with similar name. Reconfigure to reduce all those groups to something like 10 eac

Re: Jenkins build failures?

2016-04-29 Thread Sergey Soldatov
but the way, we need to do something with those OOMs and "unable to create new native thread" in ITs. It's quite strange to see in 10 lines test such kind of failures. Especially when queries for table with less than 10 rows generate over 2500 threads. Does anybody know whether it's zk related issu

Re: Jenkins build failures?

2016-04-29 Thread James Taylor
A patch would be much appreciated, Sergey. On Fri, Apr 29, 2016 at 3:26 AM, Sergey Soldatov wrote: > As for flume module - flume-ng is coming with commons-io 2.1 while > hadoop & hbase require org.apache.commons.io.Charsets which was > introduced in 2.3. Easy way is to move dependency on flume-n

Re: Jenkins build failures?

2016-04-29 Thread Sergey Soldatov
As for flume module - flume-ng is coming with commons-io 2.1 while hadoop & hbase require org.apache.commons.io.Charsets which was introduced in 2.3. Easy way is to move dependency on flume-ng after the dependencies on hbase/hadoop. The last thing about ConcurrentHashMap - it definitely means that

Re: Jenkins build failures?

2016-04-28 Thread James Taylor
Thanks for looking into this, Josh - really appreciate it. The NewerTableAlreadyExistsException occurs if a test derived from BaseClientManagedTimeIT does not set the CURRENT_SCN_ATTRIB property on the connection when creating a table. This will end up occurring in a *different* test than the test

Re: Jenkins build failures?

2016-04-28 Thread Josh Elser
Also, didn't mean to ignore you Sergey. I haven't seen anything at the moment which pointed to getting more info from -X (general flakiness from the ASF jenkins boxes is... the norm). We can definitely do this if we think there is something concrete that we're missing. Sergey Soldatov wrote:

Re: Jenkins build failures?

2016-04-28 Thread Josh Elser
*tl;dr* * I'm removing ubuntu-us1 from all pools * Phoenix-Flume ITs look busted * UpsertValuesIT looks busted * Something is weirdly wrong with Phoenix-4.x-HBase-1.1 in its entirety. Details below... It looks like we have a bunch of different reasons for the failures. Starting with Phoenix-ma

Re: Jenkins build failures?

2016-04-28 Thread Sergey Soldatov
Can we add -X to the mvn? All recent failures happen right before integration tests started without any reasonable error message. Possible an additional debug info would help. Thanks, Sergey On Tue, Apr 26, 2016 at 3:47 PM, James Taylor wrote: > Anyone know why our Jenkins builds keep failing? I

Jenkins build failures?

2016-04-26 Thread James Taylor
Anyone know why our Jenkins builds keep failing? Is it environmental and is there anything we can do about it? Thanks, James