AndrewTavis_WMDE added a comment.

  @dcausse, a general point on my end is that when I'm trying to run the code 
that you sent along via an HTML on `people.wikimedia.org` I'm getting the 
following as an output of Spark runs repeated over and over again:
  
    23/07/31 13:01:58 WARN YarnSchedulerBackend$YarnSchedulerEndpoint: 
Requesting driver to remove executor 1 for reason Container killed by YARN for 
exceeding physical memory limits. 4.4 GB of 4.4 GB physical memory used. 
Consider boosting spark.executor.memoryOverhead.
  
  This seems to be happening given your `create_custom_session` setup, and 
doesn't happen when I do normal `create_session` as seen below:
  
    spark_session = wmf.spark.create_session(type='yarn-large', 
app_name="wdqs-subgraph-analysis")
  
  Would you be able to let me know if there's something in my permissions or 
setup that's causing this? I'm assuming that your setup will make queries 
faster, but we can disregard if my working setup gets me mostly there. I'm 
running Jupyter on `stat1005`, and saw that AKhatun was using `stat1008`, in 
case that's helpful information :)

TASK DETAIL
  https://phabricator.wikimedia.org/T342111

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: AndrewTavis_WMDE
Cc: Lydia_Pintscher, dcausse, Gehel, dr0ptp4kt, AndrewTavis_WMDE, Aklapper, 
Manuel, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to