[
https://issues.apache.org/jira/browse/AVRO-867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13072520#comment-13072520
]
Joe Crobak commented on AVRO-867:
---------------------------------
bq. I assume you're proposing to move something like Util#fileOrStdin and
#fileOrStdin into another module? That sounds reasonable. These could probably
go into the mapred module, since it already depends on HDFS.
Ah, I hadn't realized that Util#fileOrStdin does exactly this. In that case,
this is more about updating all the tools to use #fileOrStdin if that makes
sense (e.g. DataFileReader and DataFileGetSchema don't use it).
> Allow tools to read files via hadoop FileSystem class
> -----------------------------------------------------
>
> Key: AVRO-867
> URL: https://issues.apache.org/jira/browse/AVRO-867
> Project: Avro
> Issue Type: New Feature
> Components: java
> Reporter: Joe Crobak
> Assignee: Joe Crobak
>
> It would be great if I could use the various tools to read/parse files that
> are in HDFS, S3, etc via the
> [FileSystem|http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/fs/FileSystem.html]
> api. We could retain backwards compatibility by assuming that unqualified
> urls are "file://" but allow reading of files from fully qualified urls such
> as hdfs://. The required apis are already part of the avro-tools uber jar to
> support the TetherTool.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira