[ 
https://issues.apache.org/jira/browse/PIG-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12662359#action_12662359
 ] 

Benjamin Reed commented on PIG-599:
-----------------------------------

+1 looks good.

> BufferedPositionedInputStream isn't buffered
> --------------------------------------------
>
>                 Key: PIG-599
>                 URL: https://issues.apache.org/jira/browse/PIG-599
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: types_branch
>            Reporter: Alan Gates
>            Assignee: Alan Gates
>             Fix For: types_branch
>
>         Attachments: loadperf-2.patch, loadperf.patch
>
>
> org.apache.pig.impl.io.BufferedPositionedInputStream is not actually 
> buffered.  This is because it sits atop a FSDataInputStream (somewhere down 
> the stack), which is buffered.  So to avoid double buffering, which can be 
> bad, BufferedPositionedInputStream was written without buffering.  But the 
> FSDataInputStream is far enough down the stack that it is still quite costly 
> to call read() individually for each byte.  A run through a profiler shows 
> that a fair amount of time is being spent in 
> BufferedPositionedInputStream.read().

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to