URL: <https://savannah.gnu.org/bugs/?64082>
Summary: wget unescapes URLs used as CSS url() parameters, leading to spaces and thus invalid CSS Group: GNU Wget Submitter: None Submitted: Thu 20 Apr 2023 10:17:29 PM UTC Category: Program Logic Severity: 3 - Normal Priority: 5 - Normal Status: None Privacy: Public Assigned to: None Originator Name: Ethan Gibbs Originator Email: et...@snowsign.net Open/Closed: Open Release: 1.20 Discussion Lock: Any Operating System: Microsoft Windows Reproducibility: Every Time Fixed Release: None Planned Release: None Regression: No Work Required: None Patch Included: No _______________________________________________________ Follow-up Comments: ------------------------------------------------------- Date: Thu 20 Apr 2023 10:17:29 PM UTC By: Anonymous Attempting to invoke wget with the --convert-links / -k parameter works as intended, but fails to account for the fact that CSS url() parameters cannot contain spaces. In my case, wget invoked on Apr 20, 2023 with wget -rk -l 1 https://www.calpoly.edu/ converted the filename 20200115-Sailplane-JoeJ0583-min%20%281%29.jpg to 20200115-Sailplane-JoeJ0583-min (1).jpg . The page uses the CSS background-image: url(/sites/calpoly.edu/files/styles/width_2048px/public/2020-01/20200115-Sailplane-JoeJ0583-min%20%281%29.jpg?itok=QrMTAlEd) on a div, which breaks when containing a space. This bug has likely always been in wget. Here is a Stack Overflow question detailing the bug from 10 years ago: https://stackoverflow.com/questions/13300017/wget-download-relative-link-conversion-misses-whitespace-encoding-for-css-url There is no relevant wget output for this bug. _______________________________________________________ Reply to this item at: <https://savannah.gnu.org/bugs/?64082> _______________________________________________ Message sent via Savannah https://savannah.gnu.org/