potiuk commented on PR #33117:
URL: https://github.com/apache/airflow/pull/33117#issuecomment-1666030947

   Yes. We have to. This is for security and isolation. And quite the oposite - 
maybe I do not understand what "expropriation" means (difficult word) and what 
it means in the context of locking. 
   
   I think you need to explain your reasoning. 
   
   Which locks you talk about ? what contention ? with what? 
   
   From what I know how it works multiprocessing (with forking as used here) 
are the only reasonable way to achieve high parallelism and using all the cores 
for just parsing. each process has it's own file locks and have no contentions 
with the other parsers. Then they save to database as single operation when the 
complete parsing. 
   
   I am not sure what you are proposing instead - taking into account GIL 
limitations of Python, it would be completely unfeasible to use Threading in 
this case instead but maybe you have something else in mind.
   
   Please elaborate what your proposal is.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to