wgtmac commented on issue #2201: URL: https://github.com/apache/orc/issues/2201#issuecomment-2832916571
This is a common practice that the lifecycle of the reusable batch is bound to the last reader state. The reason is that creating a batch is usually a heavy operation due to memory allocation. This technique is also applied to `arrow-java` to reuse the `VectorSchemaRoot` object. It is just a coincidence that other kinds of batches do not have the lifecycle issue because only dictionary-encoded data can have the view semantics. To keep the data to live longer, it is better to provide a clone method to `ColumnVectorBatch`. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@orc.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org