I haven't tried it, but it sounds like a cool use case. Might be a good alternative to distcp, more interoperable with tools which don't speak hadoop.
On Tue, Oct 6, 2015, 18:41 Russ Weeks <[email protected]> wrote: > I hope this isn't too off-topic. Any opinions re. its > completeness/quality/reliability? > > (The use case is, CSV files -> NFS -> HDFS -> Spark -> RFiles -> Accumulo. > Relevance established!) > > Thanks, > -Russ >
