Can you please share the code? It doesn't seem an ideal solution, but if AQE is confused, disabling it makes sense. I can't figure out why a low partition count for an 8 node, 6 edge network would require a lot of partitions... users may have different numbers... do you suggest we enforce some minimum partition count?
On Tue, Jan 14, 2025 at 7:33 AM Ángel <angel.alvarez.pas...@gmail.com> wrote: > Are you sure that temporarily disabling a global setting like AQE is the > best approach to fix this issue? > I increased the number of shuffle partitions in the Spark session > configure in the GraphFrameTestSparkContext.scala from 4 to 10, and the > "checkpoint interval" test ran perfectly without throwing an OOM error. > Why? No idea, but it worked. > > > > El lun, 13 ene 2025 a las 16:45, Russell Jurney (<russell.jur...@gmail.com>) > escribió: > >> Merged, thanks guys! >> >> Russ >> >> On Sun, Jan 12, 2025 at 2:23 PM Bjørn Jørgensen <bjornjorgen...@gmail.com> >> wrote: >> >>> I tested stuff in this PR >>> https://github.com/graphframes/graphframes/pull/477 and then I made >>> this PR https://github.com/graphframes/graphframes/pull/478 >>> >>> søn. 12. jan. 2025 kl. 23:10 skrev Ángel <angel.alvarez.pas...@gmail.com >>> >: >>> >>>> Hi Russell, >>>> >>>> I've just got the OOM error during Test 13. I'm running it from >>>> IntelliJ on Windows with Java 11. >>>> >>>> [image: image.png] >>>> I'll look into it over the course of the next week. >>>> >>>> Regards, >>>> Ángel >>>> >>>> El sáb, 11 ene 2025 a las 9:23, Russell Jurney (< >>>> russell.jur...@gmail.com>) escribió: >>>> >>>>> Friends of GraphFrames (github.com/graphframes/graphframes), I have a >>>>> question for you... >>>>> >>>>> I can't get the unit test 'two components and two dangling vertices' >>>>> in the org.graphframes.lib.ConnectedComponentsSuite >>>>> <https://github.com/graphframes/graphframes/blob/649094caf58cfda0eea3e8cd66785aa38104d771/src/test/scala/org/graphframes/lib/ConnectedComponentsSuite.scala#L138-L148> >>>>> to pass. It fails with an 'OutOfMemoryError: Java heap space' error. I am >>>>> a >>>>> little stuck on completing a docs release with a motif finding >>>>> tutorial <https://github.com/graphframes/graphframes/pull/473> due to >>>>> this issue. >>>>> >>>>> The problem is outlined in this gist: >>>>> https://gist.github.com/rjurney/6abeffbd59c67df5e5243c8f6619b6bf >>>>> >>>>> Can someone else please try this and see if it passes on the master >>>>> branch? >>>>> >>>>> > build/sbt clean compile package test >>>>> >>>>> I've tried giving it lots of RAM just to see if it would help, as much >>>>> as 32g driver and 16g for executors and... it has no effect. The test >>>>> graph >>>>> is 8 nodes and 6 edges >>>>> <https://gist.github.com/rjurney/6abeffbd59c67df5e5243c8f6619b6bf#file-connectedcomponentsuite-scala-L22-L26>, >>>>> so it shouldn't have a memory problem... yet when it runs, all 24 cores of >>>>> my CPU get used, it spikes as indicated in the image in the gist. >>>>> >>>>> I am running the following setup: >>>>> >>>>> * Ubuntu 20.04 (22.04 in the Docker image) >>>>> * OpenJDK 11 (I also tried 8, same problem) >>>>> * Scala 2.12.20 (I also tried 2.13, same problem) >>>>> * Python 3.11 (I also tried 3.9, same problem) >>>>> >>>>> Or I am running the Dockerfile in the gist >>>>> <https://gist.github.com/rjurney/6abeffbd59c67df5e5243c8f6619b6bf#file-dockerfile> >>>>> . >>>>> >>>>> Any help much appreciated! Thanks >>>>> >>>>> ----------------------------------------------------------------- >>>>> Oh, some new community stuff for GraphFrames. Hackathon announced next >>>>> week :) >>>>> >>>>> >>>>> - GraphFrames Mailing List >>>>> <https://groups.google.com/g/graphframes/>: ask questions about >>>>> GraphFrames on our Google Group >>>>> - #graphframes Discord Channel on GraphGeeks >>>>> <https://discord.com/channels/1162999022819225631/1326257052368113674> >>>>> >>>>> Thanks! >>>>> Russell Jurney @rjurney <http://twitter.com/rjurney> >>>>> russell.jur...@gmail.com LI <http://linkedin.com/in/russelljurney> FB >>>>> <http://facebook.com/jurney> datasyndrome.com >>>>> >>>> >>> >>> -- >>> Bjørn Jørgensen >>> Vestre Aspehaug 4, 6010 Ålesund >>> <https://www.google.com/maps/search/Vestre+Aspehaug+4,+6010+%C3%85lesund++Norge?entry=gmail&source=g> >>> Norge >>> <https://www.google.com/maps/search/Vestre+Aspehaug+4,+6010+%C3%85lesund++Norge?entry=gmail&source=g> >>> >>> +47 480 94 297 >>> >>