I am still working on making a minimal test that I can share without my
work-specific code being in there. However, the problem occurs with a
dataframe with several hundred columns being asked to do a tension split.
Random split works with up to about 350 columns so far. It breaks in my
code with 600 columns, but it's a converted dataset of case classes to
dataframe. This is deterministically causing the error in Scala 2.11.

Once I can get a deterministically breaking test without work code I will
try to file a Jira bug.

On Tue, Aug 16, 2016, 04:17 Ted Yu <yuzhih...@gmail.com> wrote:

> I think we should reopen it.
>
> On Aug 16, 2016, at 1:48 AM, Kazuaki Ishizaki <ishiz...@jp.ibm.com> wrote:
>
> I just realized it since it broken a build with Scala 2.10.
>
> https://github.com/apache/spark/commit/fa244e5a90690d6a31be50f2aa203ae1a2e9a1cf
>
> I can reproduce the problem in SPARK-15285 with master branch.
> Should we reopen SPARK-15285?
>
> Best Regards,
> Kazuaki Ishizaki,
>
>
>
> From:        Ted Yu <yuzhih...@gmail.com>
> To:        dhruve ashar <dhruveas...@gmail.com>
> Cc:        Aris <arisofala...@gmail.com>, "user@spark.apache.org" <
> user@spark.apache.org>
> Date:        2016/08/15 06:19
> Subject:        Re: Spark 2.0.0 JaninoRuntimeException
> ------------------------------
>
>
>
> Looks like the proposed fix was reverted:
>
>     Revert "[SPARK-15285][SQL] Generated SpecificSafeProjection.apply
> method grows beyond 64 KB"
>
>     This reverts commit fa244e5a90690d6a31be50f2aa203ae1a2e9a1cf.
>
> Maybe this was fixed in some other JIRA ?
>
> On Fri, Aug 12, 2016 at 2:30 PM, dhruve ashar <*dhruveas...@gmail.com*
> <dhruveas...@gmail.com>> wrote:
> I see a similar issue being resolved recently:
> *https://issues.apache.org/jira/browse/SPARK-15285*
> <https://issues.apache.org/jira/browse/SPARK-15285>
>
> On Fri, Aug 12, 2016 at 3:33 PM, Aris <*arisofala...@gmail.com*
> <arisofala...@gmail.com>> wrote:
> Hello folks,
>
> I'm on Spark 2.0.0 working with Datasets -- and despite the fact that
> smaller data unit tests work on my laptop, when I'm on a cluster, I get
> cryptic error messages:
>
> Caused by: org.codehaus.janino.JaninoRuntimeException: Code of method
> "(Lorg/apache/spark/sql/catalyst/InternalRow;Lorg/apache/spark/sql/catalyst/InternalRow;)I"
> of class
> "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering"
> grows beyond 64 KB
>
> Unfortunately I'm not clear on how to even isolate the source of this
> problem. I didn't have this problem in Spark 1.6.1.
>
> Any clues?
>
>
>
> --
> -Dhruve Ashar
>
>
>
>

Reply via email to