Re: unable to serialize analytics pipeline

2013-10-22 Thread Mark Hamstra
If you distribute the needed jar(s) to your Workers, you may well be able to instantiate what you need using mapPartitions, mapPartitionsWithIndex, mapWith, flatMapWith, etc. Be careful, though, about teardown of any resource allocation that you may need to do within each partition. On Tue,

Re: unable to serialize analytics pipeline

2013-10-22 Thread Philip Ogren
A simple workaround that seems to work (at least in localhost mode) is to mark my top-level pipeline object (inside my simple interface) as transient and add an initialize method. In the method that calls the pipeline and returns the results, I simply call the initialize method if needed