Thanks. JIRAs for flaky tests welcome. Better if there are patches too. :-)
> On Feb 5, 2019, at 1:25 AM, Xu Cang <[email protected]> wrote: > > Thanks to Peter's link. I checked my failed tests, they are all in this > flaky tests list. (Going to create some JIRAs for flaky tests if there > aren't now) > > +1 for this release now. > > Best, > Xu > > >> On Tue, Feb 5, 2019 at 12:36 AM Peter Somogyi <[email protected]> wrote: >> >> Just like Xu Cang I ran into similar test failures on Debian and many of >> these are on the flaky list for branch-1 with 100% flakyness. >> >> https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-1/lastSuccessfulBuild/artifact/dashboard.html >> >> These are the failed tests for me: >> regionserver.TestRegionServerMetrics >> client.TestRegionLocationCaching >> client.TestHCM >> client.TestClientOperationInterrupt >> util.TestHBaseFsck >> master.cleaner.TestSnapshotFromMaster >> master.TestMasterShutdown >> replication.TestReplicationDroppedTables >> filter.TestFilterListOrOperatorWithBlkCnt >> mapreduce.TestSecureLoadIncrementalHFilesSplitRecovery >> mapreduce.TestLoadIncrementalHFilesSplitRecovery >> >> On Tue, Feb 5, 2019 at 7:30 AM 张铎(Duo Zhang) <[email protected]> >> wrote: >> >>> I think HBASE-21727 should be partially reverted since it removed a >> public >>> method in HBaseConfiguration which is marked as IA.Public? >>> >>> [email protected] <[email protected]> 于2019年2月5日周二 上午1:54写道: >>> >>>> Based on my own testing I was going to vote +1.I built 1.5.0 from >>> source, >>>> and ran it with the tip of the Phoenix 4.x. >>>> I regularly load a lot of data, execute Phoenix queries, etc. Nothing >>>> undue, nothing undue in the logs either. >>>> I'll try to reproduce the test failures. Since Andy can't reproduce >> them >>>> there is something flaky, most likely it's the tests, but that's, of >>>> course, hard to say. >>>> -- Lars >>>> On Saturday, February 2, 2019, 4:03:53 PM PST, Andrew Purtell < >>>> [email protected]> wrote: >>>> >>>> Thanks. As I am not able to produce those unit test results we will >> need >>>> your help to diagnose the issues. Please file JIRAs as needed, post the >>>> test output detail, etc. Thanks for trying the candidate out! >>>> >>>> The ITBLL results may be a tool usage problem. The numbers in the >> failure >>>> messages you posted are too round. I expect real failures to produce >> more >>>> irregular numbers. ITBLL can a bit hard to use. Contact me offline and >>> I’ll >>>> give you notes on how I ran the tests myself. >>>> >>>> >>>>> On Feb 2, 2019, at 3:45 PM, Xu Cang <[email protected]> >>>> wrote: >>>>> >>>>> 2 jars sha12 verification: pass. >>>>> Basic UI check: pass. >>>>> Unit test. Some failures in hbase - server package. (see details >>> below, >>>>> not sure if these are flaky tests) >>>>> ITBLL tests with slowDeterministic and serverKilling monky. Both got >>> some >>>>> failures. (Not sure if this is my environment issue since I am using >>> *my >>>>> laptop* to conduct this testing) >>>>> Not voting for now since I have some doubts regarding my testing >>> result. >>>>> Will keep looking. >>>>> >>>>> >>>>> - *Unit test failure: (failures are reproducable)* >>>>> >>>>> [INFO] Results: >>>>> [INFO] >>>>> [ERROR] Failures: >>>>> [ERROR] >>>>> >>>> >>> >> TestRegionLocationCaching.testCachingForHTableMultiPut:133->checkRegionLocationIsCached:148 >>>>> Expected non-zero number of cached region locations. Actual: 0 >>>>> [ERROR] >>>>> >>>> >>> >> TestRegionLocationCaching.testCachingForHTableMultiplexerMultiPut:95->checkRegionLocationIsCached:148 >>>>> Expected non-zero number of cached region locations. Actual: 0 >>>>> [ERROR] >>>>> >>>> >>> >> TestRegionLocationCaching.testCachingForHTableMultiplexerSinglePut:73->checkRegionLocationIsCached:148 >>>>> Expected non-zero number of cached region locations. Actual: 0 >>>>> [ERROR] >>>>> >>>> >>> >> TestRegionLocationCaching.testCachingForHTableSinglePut:116->checkRegionLocationIsCached:148 >>>>> Expected non-zero number of cached region locations. Actual: 0 >>>>> [ERROR] TestReplicasClient.testHedgedRead:595 expected:<0> but >> was:<1> >>>>> [ERROR] >>>>> >>>> >>> >> TestFilterListOrOperatorWithBlkCnt.testMultiRowRangeWithFilterListOrOperatorWithBlkCnt:127 >>>>> expected:<4> but was:<5> >>>>> [ERROR] TestRegionServerMetrics.testRequestCount:137 Metrics >> Counters >>>>> should be equal expected:<59> but was:<89> >>>>> [INFO] >>>>> [ERROR] Tests run: 1870, Failures: 7, Errors: 0, Skipped: 17 >>>>> >>>>> >>>>> - >>>>> *ITBLL testing result: (failures are reproducable) * >>>>> >>>>> ./bin/hbase org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList >>>> Loop >>>>> 1 1 25000000 /tmp/itbll 1 -m slowDeterministic >>>>> >>>>> 2019-02-01 23:31:03,746 INFO [main] mapreduce.Job: map 100% reduce >>> 100% >>>>> 2019-02-01 23:31:03,746 INFO [main] mapreduce.Job: Job >>>>> job_local221831554_0003 completed successfully >>>>> 2019-02-01 23:31:03,758 INFO [main] mapreduce.Job: 175000018 >>>>> Input split bytes=679 >>>>> Combine input records=0 >>>>> Combine output records=0 >>>>> Reduce input groups=75000000 >>>>> Reduce shuffle bytes=5175000018 >>>>> Reduce input records=150000000 >>>>> Reduce output records=0 >>>>> Spilled Records=561948580 >>>>> Shuffled Maps =3 >>>>> Failed Shuffles=0 >>>>> Merged Map outputs=3 >>>>> GC time elapsed (ms)=1859 >>>>> Total committed heap usage (bytes)=1846542336 >>>>> HBase Counters >>>>> BYTES_IN_REMOTE_RESULTS=0 >>>>> BYTES_IN_RESULTS=31125001574 >>>>> MILLIS_BETWEEN_NEXTS=934178 >>>>> NOT_SERVING_REGION_EXCEPTION=7 >>>>> NUM_SCANNER_RESTARTS=0 >>>>> NUM_SCAN_RESULTS_STALE=0 >>>>> REGIONS_SCANNED=12 >>>>> REMOTE_RPC_CALLS=0 >>>>> REMOTE_RPC_RETRIES=0 >>>>> ROWS_FILTERED=12 >>>>> ROWS_SCANNED=75000012 >>>>> RPC_CALLS=14607 >>>>> RPC_RETRIES=7 >>>>> Shuffle Errors >>>>> BAD_ID=0 >>>>> CONNECTION=0 >>>>> IO_ERROR=0 >>>>> WRONG_LENGTH=0 >>>>> WRONG_MAP=0 >>>>> WRONG_REDUCE=0 >>>>> >>>> org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList$Verify$Counts >>>>> REFERENCED=75000000 >>>>> File Input Format Counters >>>>> Bytes Read=0 >>>>> File Output Format Counters >>>>> Bytes Written=108 >>>>> 2019-02-01 23:31:03,764 ERROR [main] >>>>> test.IntegrationTestBigLinkedList$Verify: *Expected referenced count >>> does >>>>> not match with actual referenced count. expected referenced=25000000 >>>>> ,actual=75000000* >>>>> >>>>> >>>>> ./bin/hbase org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList >>>> Loop >>>>> 1 4 1000000 /tmp/itbll 4 -m slowDeterministic >>>>> 2019-02-02 00:34:27,009 ERROR [main] >>>>> test.IntegrationTestBigLinkedList$Verify: Expected referenced count >>> does >>>>> not match with actual referenced count. expected referenced=4000000 >>>>> ,actual=79000000 >>>>> >>>>> >>>>>> On Fri, Feb 1, 2019 at 2:17 PM Andrew Purtell <[email protected]> >>>> wrote: >>>>>> >>>>>> The first HBase 1.5.0 release candidate (RC0) is available for >>> download >>>> at >>>>>> https://dist.apache.org/repos/dist/dev/hbase/hbase-1.5.0RC0/ and >>> Maven >>>>>> artifacts are available in the temporary repository >>>>>> >>> https://repository.apache.org/content/repositories/orgapachehbase-1250/ >>>>>> >>>>>> The git tag corresponding to the candidate is '1.5.0RC0' >> (ce6a6014da). >>>>>> >>>>>> A detailed source and binary compatibility report for this release >> is >>>>>> available for your review at >>>>>> >>>>>> >>>> >>> >> https://dist.apache.org/repos/dist/dev/hbase/hbase-1.5.0RC0/compat-check-report.html >>>>>> . I do not believe there are any reported compatibility issues that >>> are >>>> in >>>>>> violation of our compatibility policy for minor releases, but if you >>>> find >>>>>> something and feel differently, please file a JIRA. >>>>>> >>>>>> A list of the 88 issues resolved in this release can be found at >>>>>> https://s.apache.org/K4Wk . The 1.5.0 changelog is derived from the >>>>>> changelog of the last branch-1.4 release, 1.4.9. >>>>>> >>>>>> Please try out the candidate and vote +1/0/-1. >>>>>> >>>>>> The vote will be open for at least 72 hours. Unless objection I will >>>> try to >>>>>> close it Thursday February 28, 2019 if we have sufficient votes. >>>>>> >>>>>> Prior to making this announcement I made the following preflight >>> checks: >>>>>> >>>>>> RAT check passes (7u80) >>>>>> Unit test suite passes (7u80, 8u181) >>>>>> Opened the UI in a browser, poked around >>>>>> LTT load 100M rows with 100% verification and 20% updates (8u181) >>>>>> ITBLL 1B rows with slowDeterministic monkey (8u181) >>>>>> ITBLL 1B rows with serverKilling monkey (8u181) >>>>>> >>>>>> Some of this testing was done with recent 1.5.0-SNAPSHOT versions. >>>> During >>>>>> the month of February I plan to perform a number of additional >> tests, >>>>>> including performance regression checks. As more results become >>>> available I >>>>>> will post them to this thread. >>>>>> >>>>>> -- >>>>>> Best regards, >>>>>> Andrew >>>>>> >>> >>
