[ https://issues.apache.org/jira/browse/CRUNCH-553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14643794#comment-14643794 ]
Micah Whitacre commented on CRUNCH-553: --------------------------------------- +1 > From.formattedFile may cause records to be dropped. > --------------------------------------------------- > > Key: CRUNCH-553 > URL: https://issues.apache.org/jira/browse/CRUNCH-553 > Project: Crunch > Issue Type: Bug > Components: IO > Affects Versions: 0.11.0, 0.12.0 > Reporter: Josh Wills > Assignee: Josh Wills > Fix For: 0.13.0 > > Attachments: CRUNCH-553.patch > > > From the mailing list, a user reported a bug in which they were using > multiple instances of From.formattedFile TableSources and were seeing records > getting dropped at random from different runs of their jobs. I created a > simple test that replicated the behavior and found the source of the problem > in the planner: a confusion between a BaseInputTable and the > BaseInputCollection objects that does most of the work to actually configure > the input table data that resulted from BaseInputTable's equals() method not > checking to see if an object was of its same class before performing the > comparison on the underlying BaseInputCollection instance. -- This message was sent by Atlassian JIRA (v6.3.4#6332)