[
https://issues.apache.org/jira/browse/BEAM-1625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aviem Zur updated BEAM-1625:
----------------------------
Description:
{{BoundedDataset#action}} does not materialize RDD since it uses
{{foreachPartition}} which is not an action, instead use {{foreach}} which is.
The reason it worked until now was because there was also a call to {{cache()}}
before it.
http://spark.apache.org/docs/1.6.3/programming-guide.html#actions
was:
{{BoundedDataset#action}} does not materialize RDD since it uses
{{foreachPartition}} which is not an action, instead use {{foreach}} which is.
See: http://spark.apache.org/docs/latest/programming-guide.html#actions
> BoundedDataset action() does not materialize RDD
> ------------------------------------------------
>
> Key: BEAM-1625
> URL: https://issues.apache.org/jira/browse/BEAM-1625
> Project: Beam
> Issue Type: Bug
> Components: runner-spark
> Reporter: Aviem Zur
> Assignee: Aviem Zur
> Fix For: 0.6.0
>
>
> {{BoundedDataset#action}} does not materialize RDD since it uses
> {{foreachPartition}} which is not an action, instead use {{foreach}} which is.
> The reason it worked until now was because there was also a call to
> {{cache()}} before it.
> http://spark.apache.org/docs/1.6.3/programming-guide.html#actions
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)