Github user jiangxb1987 commented on a diff in the pull request: https://github.com/apache/spark/pull/20361#discussion_r165234841 --- Diff: sql/core/src/main/java/org/apache/spark/sql/execution/datasources/orc/OrcColumnarBatchReader.java --- @@ -49,8 +49,9 @@ * After creating, `initialize` and `initBatch` should be called sequentially. */ public class OrcColumnarBatchReader extends RecordReader<Void, ColumnarBatch> { - // TODO: make this configurable. - private static final int CAPACITY = 4 * 1024; + + // The default size of vectorized batch. --- End diff -- How about rephrase to `The capacity of vectorized batch` ?
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org