I've written a package to assist with using R in Hadoop Streaming.

The main point of the package is to help make command-line runnable RScript files. I'd like to provide a demo RScript file, a demo data file (e.g. a plaintext file, not something already processed by R) , as well as demo bash shell scripts that demonstrate how to run the job from the command line and in a Hadoop cluster.

My best idea so far for distributing these files is to package the contents of these files as a list of strings in a data file in the data directory, and include a function in the package, say generateDemoFolder(targetDir), that writes the files to a user- specified directory, ready for use from the command line.

Any suggestions?


Thanks,


David

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Reply via email to