awesome work Appy! That's certainly good news to hear.
On Mon, Sep 12, 2016 at 2:14 PM, Apekshit Sharma <a...@cloudera.com> wrote: > On a separate note: > Trunk had 8 green runs in last 3 days! ( > https://builds.apache.org/job/HBase-Trunk_matrix/) > This was due to fixing just the mass failures on trunk and no change in > flaky infra. Which made me to conclude two things: > 1. Flaky infra works. > 2. It relies heavily on the post-commit build's stability (which every > project should anyways strive for). If the build fails catastrophically > once in a while, we can just exclude that one run using a flag and > everything will work, but if it happens frequently, then it won't work > right. > > I have re-enabled Flaky tests job ( > https://builds.apache.org/view/H-L/view/HBase/job/HBASE-Flaky-Tests/) which > was disabled for almost a month due to trunk being on fire. > I will keep an eye on how things are going. > > > On Mon, Sep 12, 2016 at 2:02 PM, Apekshit Sharma <a...@cloudera.com> wrote: > >> @Sean, Mikhail: I found the alternate solution. Using user defined axis, >> tool environment and env variable injection. >> See latest diff to https://builds.apache.org/job/HBase-Trunk_matrix/ job >> for reference. >> >> >> On Tue, Aug 30, 2016 at 7:39 PM, Mikhail Antonov <olorinb...@gmail.com> >> wrote: >> >>> FYI, I did the same for branch-1.3 builds. I've disabled hbase-1.3 and >>> hbase-1.3-IT jobs and instead created >>> >>> https://builds.apache.org/job/HBase-1.3-JDK8 and >>> https://builds.apache.org/job/HBase-1.3-JDK7 >>> >>> This should work for now until we figure out how to move forward. >>> >>> Thanks, >>> Mikhail >>> >>> On Wed, Aug 17, 2016 at 1:41 PM, Sean Busbey <bus...@cloudera.com> wrote: >>> >>> > /me smacks forehead >>> > >>> > these replacement jobs, of course, also have special characters in >>> > their names which then show up in the working path. >>> > >>> > renaming them to skip spaces and parens. >>> > >>> > On Wed, Aug 17, 2016 at 1:34 PM, Sean Busbey <sean.bus...@gmail.com> >>> > wrote: >>> > > FYI, it looks like essentially our entire CI suite is red, probably >>> due >>> > to >>> > > parts of our codebase not tolerating spaces or other special >>> characters >>> > in >>> > > the working directory. >>> > > >>> > > I've made a stop-gap non-multi-configuration set of jobs for running >>> unit >>> > > tests for the 1.2 branch against JDK 7 and JDK 8: >>> > > >>> > > https://builds.apache.org/view/H-L/view/HBase/job/HBase% >>> > 201.2%20(JDK%201.7)/ >>> > > >>> > > https://builds.apache.org/view/H-L/view/HBase/job/HBase% >>> > 201.2%20(JDK%201.8)/ >>> > > >>> > > Due to the lack of response from infra@ I suspect our only options >>> for >>> > > continuing on ASF infra is to fix whatever part of our build doesn't >>> > > tolerate the new paths, or stop using multiconfiguration deployments. >>> I >>> > am >>> > > obviously less than thrilled at the idea of having several multiples >>> of >>> > > current jobs. >>> > > >>> > > >>> > > On Wed, Aug 10, 2016 at 6:28 PM, Sean Busbey <bus...@cloudera.com> >>> > wrote: >>> > > >>> > >> Ugh. >>> > >> >>> > >> I sent a reply to Gav on builds@ about maybe getting names that >>> don't >>> > >> have spaces in them: >>> > >> >>> > >> https://lists.apache.org/thread.html/8ac03dc62f9d6862d4f3d5eb37119c >>> > >> 9c73b4059aaa3ebba52fc63bb6@%3Cbuilds.apache.org%3E >>> > >> >>> > >> In the mean time, is this an issue we need file with Hadoop or >>> > >> something we need to fix in our own code? >>> > >> >>> > >> On Wed, Aug 10, 2016 at 6:04 PM, Matteo Bertozzi >>> > >> <theo.berto...@gmail.com> wrote: >>> > >> > There are a bunch of builds that have most of the test failing. >>> > >> > >>> > >> > Example: >>> > >> > https://builds.apache.org/job/HBase-Trunk_matrix/1392/jdk= >>> > >> JDK%201.7%20(latest),label=yahoo-not-h2/testReport/junit/ >>> > >> org.apache.hadoop.hbase/TestLocalHBaseCluster/testLocalHBaseCluster/ >>> > >> > >>> > >> > from the stack trace looks like the problem is with the jdk name >>> that >>> > has >>> > >> > spaces: >>> > >> > the hadoop FsVolumeImpl calls setNameFormat(... + >>> fileName.toString() >>> > + >>> > >> ...) >>> > >> > and this seems to not be escaped >>> > >> > so we end up with JDK%25201.7%2520(latest) in the string format >>> and we >>> > >> get >>> > >> > a IllegalFormatPrecisionException: 7 >>> > >> > >>> > >> > 2016-08-10 22:07:46,108 WARN [DataNode: >>> > >> > [[[DISK]file:/home/jenkins/jenkins-slave/workspace/HBase- >>> > >> Trunk_matrix/jdk/JDK%25201.7%2520(latest)/label/yahoo-not- >>> > >> h2/hbase-server/target/test-data/e7099624-ecfa-4674-87de- >>> > >> a8733d13b582/dfscluster_10fdcfc3-cd1b-45be-9b5a- >>> > >> 9c88f385e6f1/dfs/data/data1/, >>> > >> > [DISK]file:/home/jenkins/jenkins-slave/workspace/HBase- >>> > >> Trunk_matrix/jdk/JDK%25201.7%2520(latest)/label/yahoo-not- >>> > >> h2/hbase-server/target/test-data/e7099624-ecfa-4674-87de- >>> > >> a8733d13b582/dfscluster_10fdcfc3-cd1b-45be-9b5a- >>> > >> 9c88f385e6f1/dfs/data/data2/]] >>> > >> > heartbeating to localhost/127.0.0.1:34629] >>> > >> > datanode.BPServiceActor(831): Unexpected exception in block pool >>> Block >>> > >> > pool <registering> (Datanode Uuid unassigned) service to >>> > >> > localhost/127.0.0.1:34629 >>> > >> > java.util.IllegalFormatPrecisionException: 7 >>> > >> > at java.util.Formatter$FormatSpecifier.checkText( >>> > >> Formatter.java:2984) >>> > >> > at java.util.Formatter$FormatSpecifier.<init>( >>> > >> Formatter.java:2688) >>> > >> > at java.util.Formatter.parse(Formatter.java:2528) >>> > >> > at java.util.Formatter.format(Formatter.java:2469) >>> > >> > at java.util.Formatter.format(Formatter.java:2423) >>> > >> > at java.lang.String.format(String.java:2792) >>> > >> > at com.google.common.util.concurrent.ThreadFactoryBuilder. >>> > >> setNameFormat(ThreadFactoryBuilder.java:68) >>> > >> > at org.apache.hadoop.hdfs.server.datanode.fsdataset.impl. >>> > >> FsVolumeImpl.initializeCacheExecutor(FsVolumeImpl.java:140) >>> > >> > >>> > >> > >>> > >> > >>> > >> > Matteo >>> > >> > >>> > >> > >>> > >> > On Tue, Aug 9, 2016 at 9:55 AM, Stack <st...@duboce.net> wrote: >>> > >> > >>> > >> >> Good on you Sean. >>> > >> >> S >>> > >> >> >>> > >> >> On Mon, Aug 8, 2016 at 9:43 PM, Sean Busbey <bus...@apache.org> >>> > wrote: >>> > >> >> >>> > >> >> > I updated all of our jobs to use the updated JDK versions from >>> > infra. >>> > >> >> > These have spaces in the names, and those names end up in our >>> > >> >> > workspace path, so try to keep an eye out. >>> > >> >> > >>> > >> >> > >>> > >> >> > >>> > >> >> > On Mon, Aug 8, 2016 at 10:42 AM, Sean Busbey < >>> bus...@cloudera.com> >>> > >> >> wrote: >>> > >> >> > > running in docker is the default now. relying on the default >>> > docker >>> > >> >> > > image that comes with Yetus means that our protoc checks are >>> > >> >> > > failing[1]. >>> > >> >> > > >>> > >> >> > > >>> > >> >> > > [1]: https://issues.apache.org/jira/browse/HBASE-16373 >>> > >> >> > > >>> > >> >> > > On Sat, Aug 6, 2016 at 5:03 PM, Sean Busbey < >>> bus...@apache.org> >>> > >> wrote: >>> > >> >> > >> Hi folks! >>> > >> >> > >> >>> > >> >> > >> this morning I merged the patch that updates us to Yetus >>> > 0.3.0[1] >>> > >> and >>> > >> >> > updated the precommit job appropriately. I also changed it to >>> use >>> > one >>> > >> of >>> > >> >> > the Java versions post the puppet changes to asf build. >>> > >> >> > >> >>> > >> >> > >> The last three builds look normal (#2975 - #2977). I'm gonna >>> try >>> > >> >> > running things in docker next. I'll email again when I make it >>> the >>> > >> >> default. >>> > >> >> > >> >>> > >> >> > >> [1]: https://issues.apache.org/jira/browse/HBASE-15882 >>> > >> >> > >> >>> > >> >> > >> On 2016-06-16 10:43 (-0500), Sean Busbey <bus...@apache.org> >>> > >> wrote: >>> > >> >> > >>> FYI, today our precommit jobs started failing because our >>> > chosen >>> > >> jdk >>> > >> >> > >>> (1.7.0.79) disappeared (mentioned on HBASE-16032). >>> > >> >> > >>> >>> > >> >> > >>> Initially we were doing something wrong, namely directly >>> > >> referencing >>> > >> >> > >>> the jenkins build tools area without telling jenkins to give >>> > us an >>> > >> >> env >>> > >> >> > >>> variable that stated where the jdk is located. However, >>> after >>> > >> >> > >>> attempting to switch to the appropriate tooling variable for >>> > jdk >>> > >> >> > >>> 1.7.0.79, I found that it didn't point to a place that >>> worked. >>> > >> >> > >>> >>> > >> >> > >>> I've now updated the job to rely on the latest 1.7 jdk, >>> which >>> > is >>> > >> >> > >>> currently 1.7.0.80. I don't know how often "latest" updates. >>> > >> >> > >>> >>> > >> >> > >>> Personally, I think this is a sign that we need to >>> prioritize >>> > >> >> > >>> HBASE-15882 so that we can switch back to using Docker. I >>> won't >>> > >> have >>> > >> >> > >>> time this week, so if anyone else does please pick up the >>> > ticket. >>> > >> >> > >>> >>> > >> >> > >>> On Thu, Mar 17, 2016 at 5:19 PM, Stack <st...@duboce.net> >>> > wrote: >>> > >> >> > >>> > Thanks Sean. >>> > >> >> > >>> > St.Ack >>> > >> >> > >>> > >>> > >> >> > >>> > On Wed, Mar 16, 2016 at 12:04 PM, Sean Busbey < >>> > >> bus...@cloudera.com >>> > >> >> > >>> > >> >> > wrote: >>> > >> >> > >>> > >>> > >> >> > >>> >> FYI, I updated the precommit job today to specify that >>> only >>> > >> >> compile >>> > >> >> > time >>> > >> >> > >>> >> checks should be done against jdks other than the primary >>> > jdk7 >>> > >> >> > instance. >>> > >> >> > >>> >> >>> > >> >> > >>> >> On Mon, Mar 7, 2016 at 8:43 PM, Sean Busbey < >>> > >> bus...@cloudera.com> >>> > >> >> > wrote: >>> > >> >> > >>> >> >>> > >> >> > >>> >> > I tested things out, and while YETUS-297[1] is present >>> the >>> > >> >> > default runs >>> > >> >> > >>> >> > all plugins that can do multiple jdks against those >>> > available >>> > >> >> > (jdk7 and >>> > >> >> > >>> >> > jdk8 in our case). >>> > >> >> > >>> >> > >>> > >> >> > >>> >> > We can configure things to only do a single run of unit >>> > >> tests. >>> > >> >> > They'll be >>> > >> >> > >>> >> > against jdk7, since that is our default jdk. That fine >>> by >>> > >> >> > everyone? It'll >>> > >> >> > >>> >> > save ~1.5 hours on any build that hits hbase-server. >>> > >> >> > >>> >> > >>> > >> >> > >>> >> > On Mon, Mar 7, 2016 at 1:22 PM, Stack < >>> st...@duboce.net> >>> > >> wrote: >>> > >> >> > >>> >> > >>> > >> >> > >>> >> >> Hurray! >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> >> It looks like YETUS-96 is in there and we are only >>> > running >>> > >> on >>> > >> >> > jdk build >>> > >> >> > >>> >> >> now, the default (but testing compile against >>> both).... >>> > Will >>> > >> >> > keep an >>> > >> >> > >>> >> eye. >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> >> St.Ack >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> >> On Mon, Mar 7, 2016 at 10:27 AM, Sean Busbey < >>> > >> >> > bus...@cloudera.com> >>> > >> >> > >>> >> wrote: >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> >> > FYI, I've just updated our precommit jobs to use the >>> > 0.2.0 >>> > >> >> > release of >>> > >> >> > >>> >> >> Yetus >>> > >> >> > >>> >> >> > that came out today. >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> > After keeping an eye out for strangeness today I'll >>> > turn >>> > >> >> > docker mode >>> > >> >> > >>> >> >> back >>> > >> >> > >>> >> >> > on by default tonight. >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> > On Wed, Jan 13, 2016 at 10:14 AM, Sean Busbey < >>> > >> >> > bus...@apache.org> >>> > >> >> > >>> >> >> wrote: >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> > > FYI, I added a new parameter to the precommit job: >>> > >> >> > >>> >> >> > > >>> > >> >> > >>> >> >> > > * USE_YETUS_PRERELEASE - causes us to use the >>> HEAD of >>> > >> the >>> > >> >> > >>> >> apache/yetus >>> > >> >> > >>> >> >> > > repo rather than our chosen release >>> > >> >> > >>> >> >> > > >>> > >> >> > >>> >> >> > > It defaults to inactive, but can be used in >>> > >> >> > manually-triggered runs >>> > >> >> > >>> >> to >>> > >> >> > >>> >> >> > > test a solution to a problem in the yetus >>> library. At >>> > >> the >>> > >> >> > moment, >>> > >> >> > >>> >> I'm >>> > >> >> > >>> >> >> > > using it to test a solution to default module >>> > ordering >>> > >> as >>> > >> >> > seen in >>> > >> >> > >>> >> >> > > HBASE-15075. >>> > >> >> > >>> >> >> > > >>> > >> >> > >>> >> >> > > On Fri, Jan 8, 2016 at 7:58 AM, Sean Busbey < >>> > >> >> > bus...@cloudera.com> >>> > >> >> > >>> >> >> wrote: >>> > >> >> > >>> >> >> > > > FYI, I just pushed HBASE-13525 (switch to Apache >>> > Yetus >>> > >> >> for >>> > >> >> > >>> >> precommit >>> > >> >> > >>> >> >> > > tests) >>> > >> >> > >>> >> >> > > > and updated our jenkins precommit build to use >>> it. >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > Jenkins job has some explanation: >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> https://builds.apache.org/view/PreCommit%20Builds/job/ >>> > >> >> > PreCommit-HBASE-Build/ >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > Release note from HBASE-13525 does as well. >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > The old job will stick around here for a couple >>> of >>> > >> weeks, >>> > >> >> > in case >>> > >> >> > >>> >> we >>> > >> >> > >>> >> >> > need >>> > >> >> > >>> >> >> > > > to refer back to it: >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> https://builds.apache.org/view/PreCommit%20Builds/job/ >>> > >> >> > PreCommit-HBASE-Build-deprecated/ >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > If something looks awry, please drop a note on >>> > >> >> HBASE-13525 >>> > >> >> > while >>> > >> >> > >>> >> it >>> > >> >> > >>> >> >> > > remains >>> > >> >> > >>> >> >> > > > open (and make a new issue after). >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > On Wed, Dec 2, 2015 at 3:22 PM, Stack < >>> > >> st...@duboce.net> >>> > >> >> > wrote: >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > >> As part of my continuing advocacy of >>> > >> builds.apache.org >>> > >> >> > and that >>> > >> >> > >>> >> >> their >>> > >> >> > >>> >> >> > > >> results are now worthy of our trust and >>> nurture, >>> > here >>> > >> >> are >>> > >> >> > some >>> > >> >> > >>> >> >> > > highlights >>> > >> >> > >>> >> >> > > >> from the last few days of builds: >>> > >> >> > >>> >> >> > > >> >>> > >> >> > >>> >> >> > > >> + hadoopqa is now finding zombies before the >>> > patch is >>> > >> >> > committed. >>> > >> >> > >>> >> >> > > >> HBASE-14888 showed "-1 core tests. The patch >>> > failed >>> > >> >> these >>> > >> >> > unit >>> > >> >> > >>> >> >> tests:" >>> > >> >> > >>> >> >> > > but >>> > >> >> > >>> >> >> > > >> didn't have any failed tests listed (I'm >>> trying to >>> > >> see >>> > >> >> if >>> > >> >> > I can >>> > >> >> > >>> >> do >>> > >> >> > >>> >> >> > > anything >>> > >> >> > >>> >> >> > > >> about this...). Running our little >>> > >> >> > >>> >> ./dev-tools/findHangingTests.py >>> > >> >> > >>> >> >> > > against >>> > >> >> > >>> >> >> > > >> the consoleText, it showed a hanging test. >>> Running >>> > >> >> > locally, I see >>> > >> >> > >>> >> >> same >>> > >> >> > >>> >> >> > > >> hang. This is before the patch landed. >>> > >> >> > >>> >> >> > > >> + Our branch runs are now near totally zombie >>> and >>> > >> flakey >>> > >> >> > free -- >>> > >> >> > >>> >> >> still >>> > >> >> > >>> >> >> > > some >>> > >> >> > >>> >> >> > > >> work to do -- but a recent patch that seemed >>> > harmless >>> > >> >> was >>> > >> >> > >>> >> causing a >>> > >> >> > >>> >> >> > > >> reliable flake fail in the backport to >>> branch-1* >>> > >> >> > confirmed by >>> > >> >> > >>> >> local >>> > >> >> > >>> >> >> > > runs. >>> > >> >> > >>> >> >> > > >> The flakeyness was plain to see up in >>> > >> builds.apache.org >>> > >> >> . >>> > >> >> > >>> >> >> > > >> + In the last few days I've committed a patch >>> that >>> > >> >> > included >>> > >> >> > >>> >> javadoc >>> > >> >> > >>> >> >> > > >> warnings even though hadoopqa said the patch >>> > >> introduced >>> > >> >> > javadoc >>> > >> >> > >>> >> >> issues >>> > >> >> > >>> >> >> > > (I >>> > >> >> > >>> >> >> > > >> missed it). This messed up life for folks >>> > >> subsequently >>> > >> >> as >>> > >> >> > their >>> > >> >> > >>> >> >> > patches >>> > >> >> > >>> >> >> > > now >>> > >> >> > >>> >> >> > > >> reported javadoc issues.... >>> > >> >> > >>> >> >> > > >> >>> > >> >> > >>> >> >> > > >> In short, I suggest that builds.apache.org is >>> > worth >>> > >> >> > keeping an >>> > >> >> > >>> >> eye >>> > >> >> > >>> >> >> > on, >>> > >> >> > >>> >> >> > > >> make >>> > >> >> > >>> >> >> > > >> sure you get a clean build out of hadoopqa >>> before >>> > >> >> > committing >>> > >> >> > >>> >> >> anything, >>> > >> >> > >>> >> >> > > and >>> > >> >> > >>> >> >> > > >> lets all work together to try and keep our >>> builds >>> > >> blue: >>> > >> >> > it'll >>> > >> >> > >>> >> save >>> > >> >> > >>> >> >> us >>> > >> >> > >>> >> >> > > all >>> > >> >> > >>> >> >> > > >> work in the long run. >>> > >> >> > >>> >> >> > > >> >>> > >> >> > >>> >> >> > > >> St.Ack >>> > >> >> > >>> >> >> > > >> >>> > >> >> > >>> >> >> > > >> >>> > >> >> > >>> >> >> > > >> On Tue, Nov 4, 2014 at 9:38 AM, Stack < >>> > >> st...@duboce.net >>> > >> >> > >>> > >> >> > wrote: >>> > >> >> > >>> >> >> > > >> >>> > >> >> > >>> >> >> > > >> > Branch-1 and master have stabilized and now >>> run >>> > >> mostly >>> > >> >> > blue >>> > >> >> > >>> >> >> (give or >>> > >> >> > >>> >> >> > > take >>> > >> >> > >>> >> >> > > >> > the odd failure) [1][2]. Having a mostly blue >>> > >> branch-1 >>> > >> >> > has >>> > >> >> > >>> >> >> helped us >>> > >> >> > >>> >> >> > > >> > identify at least one destabilizing commit in >>> > the >>> > >> last >>> > >> >> > few >>> > >> >> > >>> >> days, >>> > >> >> > >>> >> >> > maybe >>> > >> >> > >>> >> >> > > >> two; >>> > >> >> > >>> >> >> > > >> > this is as it should be (smile). >>> > >> >> > >>> >> >> > > >> > >>> > >> >> > >>> >> >> > > >> > Lets keep our builds blue. If you commit a >>> > patch, >>> > >> make >>> > >> >> > sure >>> > >> >> > >>> >> >> > subsequent >>> > >> >> > >>> >> >> > > >> > builds stay blue. You can subscribe to >>> > >> >> > bui...@hbase.apache.org >>> > >> >> > >>> >> >> to >>> > >> >> > >>> >> >> > get >>> > >> >> > >>> >> >> > > >> > notice of failures if not already subscribed. >>> > >> >> > >>> >> >> > > >> > >>> > >> >> > >>> >> >> > > >> > Thanks, >>> > >> >> > >>> >> >> > > >> > St.Ack >>> > >> >> > >>> >> >> > > >> > >>> > >> >> > >>> >> >> > > >> > 1. >>> > >> >> > >>> >> https://builds.apache.org/view/H-L/view/HBase/job/HBase- >>> > 1.0/ >>> > >> >> > >>> >> >> > > >> > 2. >>> > >> >> > >>> >> >> https://builds.apache.org/view >>> /H-L/view/HBase/job/HBase- >>> > >> TRUNK/ >>> > >> >> > >>> >> >> > > >> > >>> > >> >> > >>> >> >> > > >> > >>> > >> >> > >>> >> >> > > >> > On Mon, Oct 13, 2014 at 4:41 PM, Stack < >>> > >> >> > st...@duboce.net> >>> > >> >> > >>> >> wrote: >>> > >> >> > >>> >> >> > > >> > >>> > >> >> > >>> >> >> > > >> >> A few notes on testing. >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> Too long to read, infra is more capable now >>> and >>> > >> after >>> > >> >> > some >>> > >> >> > >>> >> >> work, we >>> > >> >> > >>> >> >> > > are >>> > >> >> > >>> >> >> > > >> >> seeing branch-1 and trunk mostly running >>> blue. >>> > >> Lets >>> > >> >> > try and >>> > >> >> > >>> >> >> keep it >>> > >> >> > >>> >> >> > > this >>> > >> >> > >>> >> >> > > >> >> way going forward. >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> Apache Infra has new, more capable hardware. >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> A recent spurt of test fixing combined with >>> > more >>> > >> >> > capable >>> > >> >> > >>> >> >> hardware >>> > >> >> > >>> >> >> > > seems >>> > >> >> > >>> >> >> > > >> >> to have gotten us to a new place; tests are >>> > mostly >>> > >> >> > passing now >>> > >> >> > >>> >> >> on >>> > >> >> > >>> >> >> > > >> branch-1 >>> > >> >> > >>> >> >> > > >> >> and master. Lets try and keep it this way >>> and >>> > >> start >>> > >> >> > to trust >>> > >> >> > >>> >> >> our >>> > >> >> > >>> >> >> > > test >>> > >> >> > >>> >> >> > > >> runs >>> > >> >> > >>> >> >> > > >> >> again. Just a few flakies remain. Lets try >>> > and >>> > >> nail >>> > >> >> > them. >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> Our tests now run in parallel with other >>> test >>> > >> suites >>> > >> >> > where >>> > >> >> > >>> >> >> previous >>> > >> >> > >>> >> >> > > we >>> > >> >> > >>> >> >> > > >> >> ran alone. You can see this sometimes when >>> our >>> > >> zombie >>> > >> >> > detector >>> > >> >> > >>> >> >> > > reports >>> > >> >> > >>> >> >> > > >> >> tests from another project altogether as >>> > lingerers >>> > >> >> (To >>> > >> >> > be >>> > >> >> > >>> >> >> fixed). >>> > >> >> > >>> >> >> > > Some >>> > >> >> > >>> >> >> > > >> of >>> > >> >> > >>> >> >> > > >> >> our tests are failing because a concurrent >>> > hbase >>> > >> run >>> > >> >> is >>> > >> >> > >>> >> undoing >>> > >> >> > >>> >> >> > > classes >>> > >> >> > >>> >> >> > > >> and >>> > >> >> > >>> >> >> > > >> >> data from under it. Also, lets fix. >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> Our tests are brittle. It takes 75minutes >>> for >>> > >> them to >>> > >> >> > >>> >> complete. >>> > >> >> > >>> >> >> > Many >>> > >> >> > >>> >> >> > > >> are >>> > >> >> > >>> >> >> > > >> >> heavy-duty integration tests starting up >>> > multiple >>> > >> >> > clusters and >>> > >> >> > >>> >> >> > > mapreduce >>> > >> >> > >>> >> >> > > >> >> all in the one JVM. It is a miracle they >>> pass >>> > at >>> > >> all. >>> > >> >> > Usually >>> > >> >> > >>> >> >> > > >> integration >>> > >> >> > >>> >> >> > > >> >> tests have been cast as unit tests because >>> > there >>> > >> was >>> > >> >> > no where >>> > >> >> > >>> >> >> else >>> > >> >> > >>> >> >> > > for >>> > >> >> > >>> >> >> > > >> them >>> > >> >> > >>> >> >> > > >> >> to get an airing. We have the hbase-it >>> suite >>> > now >>> > >> >> > which would >>> > >> >> > >>> >> >> be a >>> > >> >> > >>> >> >> > > more >>> > >> >> > >>> >> >> > > >> apt >>> > >> >> > >>> >> >> > > >> >> place but until these are run on a regular >>> > basis >>> > >> in >>> > >> >> > public for >>> > >> >> > >>> >> >> all >>> > >> >> > >>> >> >> > to >>> > >> >> > >>> >> >> > > >> see, >>> > >> >> > >>> >> >> > > >> >> the fat integration tests disguised as unit >>> > tests >>> > >> >> will >>> > >> >> > remain. >>> > >> >> > >>> >> >> A >>> > >> >> > >>> >> >> > > >> review of >>> > >> >> > >>> >> >> > > >> >> our current unit tests weeding the old cruft >>> > and >>> > >> the >>> > >> >> > no longer >>> > >> >> > >>> >> >> > > relevant >>> > >> >> > >>> >> >> > > >> or >>> > >> >> > >>> >> >> > > >> >> duplicates would be a nice undertaking if >>> > someone >>> > >> is >>> > >> >> > looking >>> > >> >> > >>> >> to >>> > >> >> > >>> >> >> > > >> contribute. >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> Alex Newman has been working on making our >>> > tests >>> > >> work >>> > >> >> > up on >>> > >> >> > >>> >> >> travis >>> > >> >> > >>> >> >> > > and >>> > >> >> > >>> >> >> > > >> >> circle-ci. That'll be sweet when it goes >>> > >> end-to-end. >>> > >> >> > He also >>> > >> >> > >>> >> >> > added >>> > >> >> > >>> >> >> > > in >>> > >> >> > >>> >> >> > > >> >> some "type" categorizations -- client, >>> filter, >>> > >> >> > mapreduce -- >>> > >> >> > >>> >> >> > alongside >>> > >> >> > >>> >> >> > > >> our >>> > >> >> > >>> >> >> > > >> >> old "sizing" categorizations of >>> > >> small/medium/large. >>> > >> >> > His >>> > >> >> > >>> >> >> thinking >>> > >> >> > >>> >> >> > is >>> > >> >> > >>> >> >> > > >> that >>> > >> >> > >>> >> >> > > >> >> we can run these categorizations in parallel >>> > so we >>> > >> >> > could run >>> > >> >> > >>> >> the >>> > >> >> > >>> >> >> > > total >>> > >> >> > >>> >> >> > > >> >> suite in about the time of the longest test, >>> > say >>> > >> >> > 20-30minutes? >>> > >> >> > >>> >> >> We >>> > >> >> > >>> >> >> > > could >>> > >> >> > >>> >> >> > > >> >> even change Apache to run them this way. >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> FYI, >>> > >> >> > >>> >> >> > > >> >> St.Ack >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> >> >>> > >> >> > >>> >> >> > > >> > >>> > >> >> > >>> >> >> > > >> >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > >>> > >> >> > >>> >> >> > > > -- >>> > >> >> > >>> >> >> > > > Sean >>> > >> >> > >>> >> >> > > >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> > -- >>> > >> >> > >>> >> >> > busbey >>> > >> >> > >>> >> >> > >>> > >> >> > >>> >> >> >>> > >> >> > >>> >> > >>> > >> >> > >>> >> > >>> > >> >> > >>> >> > >>> > >> >> > >>> >> > -- >>> > >> >> > >>> >> > busbey >>> > >> >> > >>> >> > >>> > >> >> > >>> >> >>> > >> >> > >>> >> >>> > >> >> > >>> >> >>> > >> >> > >>> >> -- >>> > >> >> > >>> >> busbey >>> > >> >> > >>> >> >>> > >> >> > >>> >>> > >> >> > > >>> > >> >> > > >>> > >> >> > > >>> > >> >> > > -- >>> > >> >> > > busbey >>> > >> >> > >>> > >> >> >>> > >> >>> > >> >>> > >> >>> > >> -- >>> > >> busbey >>> > >> >>> > > >>> > > >>> > > >>> > > -- >>> > > Sean >>> > >>> > >>> > >>> > -- >>> > busbey >>> > >>> >>> >>> >>> -- >>> Thanks, >>> Michael Antonov >>> >> >> >> >> -- >> >> -- Appy >> > > > > -- > > -- Appy -- busbey