JAllemandou added a comment.

  In T342416#9101868 <https://phabricator.wikimedia.org/T342416#9101868>, 
@EBernhardson wrote:
  
  > These are both generated by spark.  The rdf is being imported by a scala 
application while the cirrus dump is imported by pyspark, but they should both 
be using the same underlying implementation. Both applications use 
`df.write.insertInto(table_name)` to instruct spark to do the actual output. 
I'm a bit surprised they end up generating different sets of permissions.
  >
  > I suppose it's not super important why the cirrus dump is world readable, 
it's fine to be readable, it just hints to me that there is something I don't 
understand about hdfs/spark/permissions happening here.
  
  Mwarf, wrong guess :) Interesting nonetheless - Let me know if you wish we 
pair on this.

TASK DETAIL
  https://phabricator.wikimedia.org/T342416

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: EBernhardson, JAllemandou
Cc: dcausse, BTullis, AndrewTavis_WMDE, Aklapper, JAllemandou, 
Danny_Benjafield_WMDE, Mohamed-Awnallah, Astuthiodit_1, AWesterinen, lbowmaker, 
karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, 
QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, 
Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to