[ 
https://issues.apache.org/jira/browse/PIG-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16878175#comment-16878175
 ] 

Koji Noguchi commented on PIG-5390:
-----------------------------------

These self-spilling bags were added in pig 0.6 from PIG-1037, and the 
regression of adding them to SpillableMemoryManager was probably added in pig 
0.11 from PIG-2923.

> Possible race condition from Self-spilling bags registering with 
> SpillableMemoryManager 
> ----------------------------------------------------------------------------------------
>
>                 Key: PIG-5390
>                 URL: https://issues.apache.org/jira/browse/PIG-5390
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Koji Noguchi
>            Assignee: Koji Noguchi
>            Priority: Major
>
> This is a follow up from PIG-5380 where [~rohini] pointed out 
> {quote}
> I think same change is required in InternalSortedBag as well as code is 
> exactly same and it can spill too - 
> https://github.com/apache/pig/blob/trunk/src/org/apache/pig/data/InternalSortedBag.java#L133
>  . We most likely haven't seen issues with it as the probability could be 
> very less as it will proactively spill if it exceeds cached memory limit.
> {quote}
> Looking at the history and the source, this is a critical bug given all these 
> self-spilling bags are designed on the premise that no other threads would 
> touch them.  Comment in the source clearly say
> {code}
>  * This bag is not registered with SpillableMemoryManager. It calculates
>  * the number of tuples to hold in memory and spill pro-actively into files."
> {code}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to