potiuk commented on issue #31105:
URL: https://github.com/apache/airflow/issues/31105#issuecomment-1695037219

   > @potiuk and/or @getaaron, let me know if step 1 is a reasonable short term 
approach or whether I should just skip it and do step 2.
   
   Yes. This is exactly how I would have imagined it too. It's quite likely as 
well that we will stop at task 1 if we find 2 too much of complexity. I have a 
feeling that trying to implement k-way merge sort in this case is not giving 
"the bang for the bucks".  If you have such a huge log, finding anything 
meaningful in Airlfow UI for it will be next to impossible, search and 
everything there will also struggle if you try to scroll through it and you 
would likely have to implement a full-fledged search and likely other APIs to 
make use of such a huge log. 
   
   In this case just posting a message "Your log is definitely too large to 
handle" and pointing the user to "Consider fine-tuning your tasks to make your 
logs smaller or configure remote logging and choose some dedicated logging 
solution such to store your logs"  is quite likely 9X% of the effect with 1% of 
the effort. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to