[
https://issues.apache.org/jira/browse/HBASE-26405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Kyle Purtell updated HBASE-26405:
----------------------------------------
Description:
This integration test emulates a use case that stores a lot of small values
into a table that would likely be heavily indexed (ROW_INDEX_V1, small blocks,
etc.), an application that crowdsources weather (temperature) observation data.
This IT can be used to test and optimize compression settings for such cases.
It comes with a companion utility, HFileBlockExtracter, which extracts block
data from HFiles into a set of individual files for each block's data, for use
in training external compression dictionaries, perhaps with ZStandard's 'zstd'
utility.
See javadoc on class IntegrationTestLoadSmallValues for usage.
This was used to test the changes on HBASE-26353.
was:
This integration test emulates a use case that stores a lot of small values
into a table that would likely be heavily indexed (ROW_INDEX_V1, small blocks,
etc.), an application that crowdsources weather (temperature) observation data.
This IT can be used to test and optimize compression settings for such cases.
It comes with a companion utility, HFileBlockExtracter, which extracts block
data from HFiles into a set of individual files for each block's data, for use
in training external compression dictionaries, perhaps with ZStandard's 'zstd'
utility.
See javadoc on class IntegrationTestLoadSmallValues for usage.
> IntegrationTestLoadSmallValues
> ------------------------------
>
> Key: HBASE-26405
> URL: https://issues.apache.org/jira/browse/HBASE-26405
> Project: HBase
> Issue Type: Test
> Components: integration tests, test
> Reporter: Andrew Kyle Purtell
> Assignee: Andrew Kyle Purtell
> Priority: Minor
> Fix For: 2.5.0, 3.0.0-alpha-2
>
>
> This integration test emulates a use case that stores a lot of small values
> into a table that would likely be heavily indexed (ROW_INDEX_V1, small
> blocks, etc.), an application that crowdsources weather (temperature)
> observation data. This IT can be used to test and optimize compression
> settings for such cases. It comes with a companion utility,
> HFileBlockExtracter, which extracts block data from HFiles into a set of
> individual files for each block's data, for use in training external
> compression dictionaries, perhaps with ZStandard's 'zstd' utility.
> See javadoc on class IntegrationTestLoadSmallValues for usage.
> This was used to test the changes on HBASE-26353.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)