What is the best way to download data directly into HDFS from some remote source?
I used this command, which works: curl <remote_url> | funzip | hadoop fs -put - /path/filename Is this the recommended way to go? Uri -- Uri Laserson, PhD Data Scientist, Cloudera Twitter/GitHub: @laserson +1 617 910 0447 [email protected]
