zhangshenghang commented on PR #7148:
URL: https://github.com/apache/seatunnel/pull/7148#issuecomment-2227768820
> Cache is often used for frequently querying hot data. In data
synchronization read scenarios, data is usually queried only once. Is it
effective to configure cache-related settings in this case? Looking forward to
your reply.
You are right. In the case of synchronous data, it only needs to be queried
once, so I set the default value of `cache_blocks` to false. The default
cache_blocks value of Hbase Scan is True. At the same time, I am not sure
whether any user will use cache_blocks = true, so I set the cache_blocks
parameter to an optional parameter for special cases.
```
public static final Option<Boolean> HBASE_CACHE_BLOCKS_CONFIG =
Options.key("cache_blocks")
.booleanType()
.defaultValue(false)
.withDescription(
"When it is false, data blocks are not cached. When it is true, data blocks
are cached. This value should be set to false when scanning a large amount of
data to reduce memory consumption. The default value is false");
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]