AutomationDev85 opened a new pull request, #43954:
URL: https://github.com/apache/airflow/pull/43954

   # Description
   
   Edge worker runs into UnicodeDecodeError exception during log file upload. 
The Worker reads log file as string and counts number of string characters to 
seek into the file during uploading next log file part. This can result in 
mismatch between number of bytes and number of string characters if log file 
contains none UTF-8 characters. So during upload of next log file part worker 
can jump between 2 bytes of none utf-8 characters and crashes. Edge worker now 
uses read bytes to calc place to seek into and decode string as UTF-8 to handle 
none UTF-8 charaters.
   
   # Details about changes
   
   * Worker reads log file as byte string to calc number of read bytes to seek 
in.
   * Worker decode byte string to UTF-8
   * Changed unit tests to test none UTF-8 log file.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to