Hey guys,

Introducing Apache Zeppelin to a new org, I have recently noticed that on
quite a simple but important use case the output of Zeppelin is *very
different* from spark-shell

I can print partitions of the RDD in spark-shell

```
scala> val data = sc.parallelize(List((1, 2), (1, 1), (2, 3), (2, 1), (1,
4), (3, 5)), 2)
scala> data.mapPartitions { _.map { println(_) } } collect
[Stage 0:>                                                          (0 + 0)
/ 2]
(2,1)
(1,4)
(3,5)
(1,2)
(1,1)
(2,3)
res0: Array[Unit] = Array((), (), (), (), (), ())
```

But the same code in Zeppelin does not include output of the print
statement at all :/ Tried both, 0.7.1 and master.

```
data.mapPartitions { _.map { println(_) } } collect
res2: Array[Unit] = Array((), (), (), (), (), ())
```
[image: Inline image 1]

Is that expected or did I miss something? Please let me know if you have
any ideas.

--
Alex

Reply via email to