Reading files from http server

Peter Rudenko Mon, 13 Apr 2015 03:57:08 -0700

Hi, i want to play with Criteo 1 tb dataset. Files are located on azurestorage. Here's a command to download them:curl -Ohttp://azuremlsampleexperiments.blob.core.windows.net/criteo/day_{`seq-s ‘,’ 0 23`}.gzis there any way to read files through http protocol with spark withoutdownloading them first to hdfs?. Something like this:sc.textFile("http://azuremlsampleexperiments.blob.core.windows.net/criteo/day_{0-23}.gz";),so it will have 24 partitions.


Thanks,
Peter Rudenko

Reading files from http server

Reply via email to