[
https://issues.apache.org/jira/browse/DRILL-6340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16506736#comment-16506736
]
ASF GitHub Bot commented on DRILL-6340:
---------------------------------------
ilooner commented on a change in pull request #1302: DRILL-6340: Output Batch
Control in Project using the RecordBatchSizer
URL: https://github.com/apache/drill/pull/1302#discussion_r194206547
##########
File path:
exec/java-exec/src/main/java/org/apache/drill/exec/physical/impl/project/ProjectRecordBatch.java
##########
@@ -230,18 +265,37 @@ protected IterOutcome doWork() {
container.buildSchema(SelectionVectorMode.NONE);
}
+ memoryManager.updateOutgoingStats(outputRecords);
+ if (logger.isDebugEnabled()) {
+ logger.debug("BATCH_STATS, outgoing: {}", new RecordBatchSizer(this));
+ }
// Get the final outcome based on hasRemainder since that will determine
if all the incoming records were
// consumed in current output batch or not
return getFinalOutcome(hasRemainder);
}
private void handleRemainder() {
final int remainingRecordCount = incoming.getRecordCount() -
remainderIndex;
- if (!doAlloc(remainingRecordCount)) {
+ assert this.memoryManager.incomingBatch == incoming;
+ final int recordsToProcess = Math.min(remainingRecordCount,
memoryManager.getOutputRowCount());
+
+ if (!doAlloc(recordsToProcess)) {
outOfMemory = true;
return;
}
- final int projRecords = projector.projectRecords(remainderIndex,
remainingRecordCount, 0);
+ if (logger.isTraceEnabled()) {
+ logger.trace("handleRemainder: remaining RC " + remainingRecordCount + "
toProcess " + recordsToProcess
Review comment:
{}
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Output Batch Control in Project using the RecordBatchSizer
> ----------------------------------------------------------
>
> Key: DRILL-6340
> URL: https://issues.apache.org/jira/browse/DRILL-6340
> Project: Apache Drill
> Issue Type: Improvement
> Components: Execution - Relational Operators
> Reporter: Karthikeyan Manivannan
> Assignee: Karthikeyan Manivannan
> Priority: Major
> Fix For: 1.14.0
>
>
> This bug is for tracking the changes required to implement Output Batch
> Sizing in Project using the RecordBatchSizer. The challenge in doing this
> mainly lies in dealing with expressions that produce variable-length columns.
> The following doc talks about some of the design approaches for dealing with
> such variable-length columns.
> [https://docs.google.com/document/d/1h0WsQsen6xqqAyyYSrtiAniQpVZGmQNQqC1I2DJaxAA/edit?usp=sharing]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)