As mentioned earlier , just testing some random data for the sake of testing 
isn’t useful and wouldn’t really yield any meaningful information, with that 
being said  here are some free resources for getting 
Data 
www.quandl.com
www.data.gov

Thanks

Prem Moola (201.679.9071)

From: Jörn Franke
Sent: Thursday, September 28, 2017 1:26 PM
To: Gaurav1809
Cc: [email protected]
Subject: Re: Where can I get few GBs of sample data?

I think just any Dataset is not useful. The data should be close to the real 
data that you want to process. Similarly, the processing should be the same as 
you plan.


> On 28. Sep 2017, at 18:04, Gaurav1809 <[email protected]> wrote:
> 
> Hi All,
> 
> I have setup multi node spark cluster and now looking for good volume of
> data to test and see how it works while processing the same.
> Can anyone provide pointers as to where can i get few GBs of free sample
> data?
> 
> Thanks and regards,
> Gaurav
> 
> 
> 
> --
> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/
> 
> ---------------------------------------------------------------------
> To unsubscribe e-mail: [email protected]
> 

---------------------------------------------------------------------
To unsubscribe e-mail: [email protected]


Reply via email to