[ https://issues.apache.org/jira/browse/HBASE-28640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Viraj Jasani resolved HBASE-28640. ---------------------------------- Resolution: Duplicate Duplicate of HBASE-28641 > Hbase compaction is slow in 2.4.11 compared to hbase 1.x > -------------------------------------------------------- > > Key: HBASE-28640 > URL: https://issues.apache.org/jira/browse/HBASE-28640 > Project: HBase > Issue Type: Improvement > Components: Compaction > Affects Versions: 2.4.11 > Reporter: Divesh Katta > Priority: Major > Attachments: image-2024-06-04-16-17-38-873.png, > image-2024-06-04-16-18-06-474.png, image-2024-06-04-16-18-26-818.png, > image-2024-06-04-16-18-55-672.png > > > Hi Team, > we build an Hbase 2.4.11 cluster comprising HDFS and HBase components. > However, during our performance testing, we observed that the HBase > compaction process was taking longer than expected. > With identical configurations, in HBase-1 cluster is completing compaction > tasks in less time compared to the new Hbase 2 cluster. > Hbase1 cluster details: > HBASE: 1.1.2 > Hbase2 cluster details > HBASE: 2.4.11 > Please find the screenshot for the Compaction iteration and timeline: > !image-2024-06-05-16-11-50-691.png! > IN Hbase 1 COMPACTION TIME > In the HBASE1 cluster with the same set of configurations and tables, we > observed consistent behavior in terms of compaction time(3hrs). > Start Time: 1:30AM > End Time: 4:30AM > !image-2024-06-04-16-17-38-873.png! > > IN Hbase 2 COMPACTION TIME > Start Time: 1:30AM > End Time: 23:00PM+ > !image-2024-06-04-16-18-06-474.png! > > Actions Taken: > Tuning of HBase configurations related to compactions was performed initially > but didn't yield significant improvement. > OS Mitigations were switched off and filesystem was migrated from ext4 to > xfs, yet compactions didn't improve as expected. > Observations: > We suspected that this absence of data encoding might be contributing to > longer compaction times in HBase-2. > Scheduled two separate compactions for tables with and without DATA_ENCODING > enabled in Hbase 2. > In the screenshot below, the first compaction started at 1:30 AM for 15 > tables (total size exceeding 220TB+) with (DATA ENCODED=FAST_DIFF) enabled > and this compaction completed by 2:30 AM. However, at 2:30 AM, we scheduled > another compaction for only 6 tables, totalling over 60TB in size, but these > tables were not enabled with DATA ENCODING , and this compaction is taking > longer. > !image-2024-06-04-16-18-26-818.png! > > After enabling DATA ENCODING for all the tables in HBASE-2, we initiated > compaction at 1:30 AM, which completed by 4:30 AM(3hrs) > Start Time: 1:30AM > End Time: 4:30 AM > !image-2024-06-04-16-18-55-672.png! > Noticed that tables with (DATA_ENCODING=FAST_DIFF) enabled underwent faster > compactions compared to those without. > Upon comparing the debug logs of the two Hbase 2 clusters, we discovered that > the cluster with (DATA_ENCODING=FAST_DIFF) enabled exhibited higher > throughput(average throughput is 89.54 MB/Second), whereas the cluster with > DATA_ENCODING disabled showed lower throughput(average throughput is 8.19 > MB/second,). > > #DATA_ENCODED ENBALED Tables throughput > 2024-04-02 03:11:58,956 INFO [regionserver/xxxxxxxx:16020-shortCompactions-0] > throttle.PressureAwareThroughputController: > 803e9ef64aec8e526837c0477cc48884#scr#compaction#34672 average throughput is > 89.54 MB/second, slept 0 time(s) and total slept time is 0 ms. 21 active > operations remaining, total limit is unlimited 2024-04-02 03:12:23,680 INFO > [regionserver/usr-Hbase 1<XXXXX>201:16020-shortCompactions-7] > throttle.PressureAwareThroughputController: > 82565df649e3e6eb53a1e168435204db#scr#compaction#34677 average throughput is > 65.02 MB/second, slept 0 time(s) and total slept time is 0 ms. 21 active > operations remaining, total limit is unlimited > 2024-04-02 02:45:46,538 DEBUG [regionserver/xxxxxx:16020-longCompactions-7] > compactions.Compactor: Compaction progress: > b746a46d64df25ca82571b10bb1e2c03#key#compaction#34484 128896600/371074603 > (34.74%), rate=17151.92 KB/sec, throughputController is > DefaultCompactionThroughputController [maxThroughput=unlimited, > activeCompactions=22] 2024-04-02 02:45:47,291 DEBUG [regionserver/usr-Hbase > 1<XXXX>201:16020-shortCompactions-0] compactions.Compactor: Compaction > progress: a1338704db18a8fc613ad2c9a4561d65#key#compaction#34496 > 90148941/110577879 (81.53%), rate=20950.64 KB/sec, throughputController is > DefaultCompactionThroughputController [maxThroughput=unlimited, > activeCompactions=22] > > #DATA_ENCODED DISABLED tables throughput > 2024-04-04 03:12:48,521 INFO [regionserver/xxxxxxx:16020-longCompactions-3] > throttle.PressureAwareThroughputController: > 273b9d62b602ecd3e6df3d244f796c0b#key#compaction#185279 average throughput is > 8.91 MB/second, slept 0 time(s) and total slept time is 0 ms. 21 active > operations remaining, total limit is unlimited 2024-04-04 03:14:02,793 INFO > [regionserver/usr-Hbase 1<XXXXXXX>201:16020-longCompactions-2] > throttle.PressureAwareThroughputController: > 11fff837daa7da3fa307ecbb457fe64d#key#compaction#185282 average throughput is > 8.19 MB/second, slept 0 time(s) and total slept time is 0 ms. 21 active > operations remaining, total limit is unlimited > 2024-04-03 02:46:21,165 DEBUG [regionserver/xxxxxxx:16020-shortCompactions-0] > compactions.Compactor: Compaction progress: > 98748d4c25e15c623e43a3e42f03f5de#key#com paction#178586 2986892/73628890 > (4.06%), rate=9936.51 KB/sec, throughputController is > DefaultCompactionThroughputController [maxThroughput=unlimited, > activeCompactions=22] 2024-04-03 02:46:21,193 DEBUG > [regionserver/:16020-longCompactions-0] compactions.Compactor: Compaction > progress: ce578c37f0a8354afd6d37fa5ca9c7d7#key#comp > Need Help With: > In Hbase 1 the compaction is consistently completing in 3 hours without data > encoading. > In hbase 2 Compaction without data encoading and compression it is taking 12 > + hours. but with data encoading and compression we are able to finish it in > approx 4-4.5 hours. -- This message was sent by Atlassian Jira (v8.20.10#820010)