On 23 March 2011 18:12, Jonathan Holloway <[email protected]>wrote:
> I've got a general question surrounding the output of various Pig scripts > and generally where people are > storing that data and in what kind of format? > ... > At present the results from my Pig scripts end up in HDFS in Pig bag/tuple > format and I just wondered whether > that was the best practice for large amounts of data in terms of > organisation. > I too would like to know this. My plan was to convert my hdfs data into read-only Project Voldemort key/value database. I've been told it can be done but haven't investigated fully yet. I am not sure when I should to use Hive or what the alternatives are. Alex
