[ http://issues.apache.org/jira/browse/HADOOP-369?page=all ]
Johan Oskarson updated HADOOP-369:
----------------------------------
Attachment: dircat.patch
I've added two patches, one is the requested cat feature (cat whole directory)
However, a simple test shows that this is very very much slower then saving it
through a filestream.
Why I do not know :)
So I've changed the copymerge patch as suggested and uploaded the new patch.
> Added ability to copy all part-files into one output file
> ---------------------------------------------------------
>
> Key: HADOOP-369
> URL: http://issues.apache.org/jira/browse/HADOOP-369
> Project: Hadoop
> Issue Type: New Feature
> Components: dfs
> Affects Versions: 0.4.0
> Reporter: Johan Oskarson
> Priority: Trivial
> Attachments: copymerge.patch, copymerge.patch, dircat.patch
>
>
> Since we use the hadoop output in non-hadoop applications it's nice to be
> able to merge the part-files into one output file on the local filesystem.
> So I've added a dfsshell feature that streams from all files in a directory
> to one output file.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira