[
https://issues.apache.org/jira/browse/APEXMALHAR-2314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607856#comment-15607856
]
ASF GitHub Bot commented on APEXMALHAR-2314:
--------------------------------------------
GitHub user deepak-narkhede reopened a pull request:
https://github.com/apache/apex-malhar/pull/468
APEXMALHAR-2314 Improper functioning in partitioning for sequentialFileRead
for FSRecord
Fix the StreamCodec for FSRecordReader, initially it was hashcode of
blockId's mostly always unique.
Hence unable to satisfy the sequentialFileRead property. Now the
StreamCodec is modified to work
with hashcode of filePath. So all blocks related to a file would be
partitioned on same operator.
Tested with recordReader and verified for sequentialFileRead that all
blocks related to a file are partitioned to single operator.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/deepak-narkhede/apex-malhar APEXMALHAR-2314
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/apex-malhar/pull/468.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #468
----
commit 259cc5b80635207e8b0a4d7c0c9b5bc735021de2
Author: deepak-narkhede <[email protected]>
Date: 2016-10-24T11:39:24Z
APEXMALHAR-2314 Improper functioning in partitioning of sequentialFileRead
property of FSRecordReaderModule.
Modified the StreamCodec to work with hashcode of filepath rather than
blockId.
----
> Improper functioning in partitioning of sequentialFileRead property of
> FSRecordReader
> --------------------------------------------------------------------------------------
>
> Key: APEXMALHAR-2314
> URL: https://issues.apache.org/jira/browse/APEXMALHAR-2314
> Project: Apache Apex Malhar
> Issue Type: Bug
> Reporter: Deepak Narkhede
> Assignee: Deepak Narkhede
> Priority: Minor
>
> Fix the StreamCodec for FSRecordReader, initially it was hashcode of
> blockId's mostly always unique.
> Hence unable to satisfy the sequentialFileRead property. Now the StreamCodec
> is modified to work
> with hashcode of filePath. So all blocks related to a file would be
> partitioned on same operator.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)