Github user jiangxb1987 commented on a diff in the pull request:
https://github.com/apache/spark/pull/20361#discussion_r165234841
--- Diff:
sql/core/src/main/java/org/apache/spark/sql/execution/datasources/orc/OrcColumnarBatchReader.java
---
@@ -49,8 +49,9 @@
* After creating, `initialize` and `initBatch` should be called
sequentially.
*/
public class OrcColumnarBatchReader extends RecordReader<Void,
ColumnarBatch> {
- // TODO: make this configurable.
- private static final int CAPACITY = 4 * 1024;
+
+ // The default size of vectorized batch.
--- End diff --
How about rephrase to `The capacity of vectorized batch` ?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]