EBernhardson added a comment.

  Summary of what was done so far to deploy:
  
  - Tuned subgraph_mapping_weekly. Set spark parallelism to 4096, Increased 
memory to 24G (=6g per task)  and reduced total executor count to keep total 
memory usage around 1TB. Changed `coalesce()` into `repartition()`  in 
SubgraphMapper. Completes without any failed tasks. Might be a bit wasteful of 
memory, but probably not worth tuning unless there are complaints and we can 
hope a later upgrade to spark 3 w/ skew-join optimization will improve things. 
We could manually implement the same skew-join optimization on a per-use case 
basis, but it's extra work that might not be necessary.
  - Enabled subgraph_metrics_weekly. Ran without issue.
  - This patch added a number of new sensors. We've been intending to switch 
sensors from `mode=poke` to `mode=reschedule`. Adding these new sensors 
reminded me of why we needed to make that change (all airflow executors used 
waiting for data to arrive). Deployed a patch to switch everything over.
  - Enabled subgraph_query_mapping_daily. This started waiting for 
snapshot=20220613 (last monday) with an execution_date of 20220620 (also a 
monday). I suspect we should adjust this to target snapshot=20220620, but 
waiting for confirmation. Turned back off so it doesn't timeout and complain.
  - Enabled subgraph_query_metrics_daily.  This is waiting for 
`event.wdqs_external_sparql_query/datacenter=eqiad/year=2022/month=6/day=20` 
(and same for codfw) but it needs to be waiting on the individual hourly 
partitions.  I hadn't thought this fully through when reviewing the patch, we 
will need to adjust the sensor to use HivePartitionRangeSensor which can 
generate all the intermediate hourly named partitions. Turned back off as it's 
also waiting for outputs of subgraph_query_mapping_daily (iiuc) which is turned 
off currently.

TASK DETAIL
  https://phabricator.wikimedia.org/T303831

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: AKhatun_WMF, EBernhardson
Cc: EBernhardson, dcausse, Gehel, JAllemandou, Aklapper, AKhatun_WMF, 
Hellket777, Astuthiodit_1, AWesterinen, 786, Biggs657, karapayneWMDE, 
Invadibot, MPhamWMF, maantietaja, Juan90264, Alter-paule, Beast1978, CBogen, 
ItamarWMDE, Un1tY, Akuckartz, Hook696, Kent7301, joker88john, CucyNoiD, 
Nandana, Namenlos314, Gaboe420, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, 
Bsandipan, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, 
LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Neuronton, Scott_WUaS, 
Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, 
Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to