[ 
https://issues.apache.org/jira/browse/AVRO-867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13072520#comment-13072520
 ] 

Joe Crobak commented on AVRO-867:
---------------------------------

bq. I assume you're proposing to move something like Util#fileOrStdin and 
#fileOrStdin into another module? That sounds reasonable. These could probably 
go into the mapred module, since it already depends on HDFS.

Ah, I hadn't realized that Util#fileOrStdin does exactly this. In that case, 
this is more about updating all the tools to use #fileOrStdin if that makes 
sense (e.g. DataFileReader and DataFileGetSchema don't use it).

> Allow tools to read files via hadoop FileSystem class
> -----------------------------------------------------
>
>                 Key: AVRO-867
>                 URL: https://issues.apache.org/jira/browse/AVRO-867
>             Project: Avro
>          Issue Type: New Feature
>          Components: java
>            Reporter: Joe Crobak
>            Assignee: Joe Crobak
>
> It would be great if I could use the various tools to read/parse files that 
> are in HDFS, S3, etc via the 
> [FileSystem|http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/fs/FileSystem.html]
>  api. We could retain backwards compatibility by assuming that unqualified 
> urls are "file://" but allow reading of files from fully qualified urls such 
> as hdfs://. The required apis are already part of the avro-tools uber jar to 
> support the TetherTool.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to