[ 
https://issues.apache.org/jira/browse/APEXMALHAR-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607856#comment-15607856
 ] 

ASF GitHub Bot commented on APEXMALHAR-2314:
--------------------------------------------

GitHub user deepak-narkhede reopened a pull request:

    https://github.com/apache/apex-malhar/pull/468

    APEXMALHAR-2314 Improper functioning in partitioning for sequentialFileRead 
for FSRecord

    Fix the StreamCodec for FSRecordReader, initially it was hashcode of 
blockId's mostly always unique.
    Hence unable to satisfy the sequentialFileRead property. Now the 
StreamCodec is modified to work
    with hashcode of filePath. So all blocks related to a file would be 
partitioned on same operator.
    
    Tested with recordReader and verified for sequentialFileRead that all 
blocks related to a file are partitioned to single operator.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/deepak-narkhede/apex-malhar APEXMALHAR-2314

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/apex-malhar/pull/468.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #468
    
----
commit 259cc5b80635207e8b0a4d7c0c9b5bc735021de2
Author: deepak-narkhede <[email protected]>
Date:   2016-10-24T11:39:24Z

    APEXMALHAR-2314 Improper functioning in partitioning of sequentialFileRead 
property of FSRecordReaderModule.
    
    Modified the StreamCodec to work with hashcode of filepath rather than 
blockId.

----


> Improper functioning in partitioning of sequentialFileRead property of 
> FSRecordReader 
> --------------------------------------------------------------------------------------
>
>                 Key: APEXMALHAR-2314
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2314
>             Project: Apache Apex Malhar
>          Issue Type: Bug
>            Reporter: Deepak Narkhede
>            Assignee: Deepak Narkhede
>            Priority: Minor
>
> Fix the StreamCodec for FSRecordReader, initially it was hashcode of 
> blockId's mostly always unique.
> Hence unable to satisfy the sequentialFileRead property. Now the StreamCodec 
> is modified to work
> with hashcode of filePath. So all blocks related to a file would be 
> partitioned on same operator.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to