okie, i may have found an alternate/workaround to using .collect() for what i
am trying to achieve...
initially, for the Spark application that i am working on, i would call
.collect() on two separate RDDs into a couple of ArrayLists (which was the
reason i was asking what the size limit on the
Well, if you don't need to actually evaluate the information on the driver, but
just need to trigger some sort of action, then you may want to consider using
the `forEach` or `forEachPartition` method, which is an action and will execute
your process. It won't return anything to the driver and