[
https://issues.apache.org/jira/browse/APEXMALHAR-2008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15187028#comment-15187028
]
ASF GitHub Bot commented on APEXMALHAR-2008:
--------------------------------------------
Github user sandeepdeshmukh commented on a diff in the pull request:
https://github.com/apache/incubator-apex-malhar/pull/207#discussion_r55511718
--- Diff:
library/src/main/java/com/datatorrent/lib/io/fs/FileSplitterInput.java ---
@@ -384,7 +399,14 @@ protected void scanIterationComplete()
lastScanMillis = System.currentTimeMillis();
}
- protected void scan(@NotNull Path filePath, Path rootPath)
+ protected void scan(Path filePath, Path rootPath)
--- End diff --
Why is NotNull removed?
> Create hdfs file input module
> ------------------------------
>
> Key: APEXMALHAR-2008
> URL: https://issues.apache.org/jira/browse/APEXMALHAR-2008
> Project: Apache Apex Malhar
> Issue Type: Task
> Reporter: Priyanka Gugale
> Assignee: Priyanka Gugale
> Priority: Minor
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> To read HDFS files in parallel using Apex we normally use FileSplitter and
> FileReader module. It would be a good idea to combine those operators as a
> unit in module. Having a module will give us readily usable set of operators
> to read HDFS files.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)