On Tue, May 12, 2020 at 6:45 AM Luca Bernardi <[email protected]> wrote: > > URL: > <https://savannah.gnu.org/bugs/?58354> > > Summary: Wget doesn't parse URIs starting with http:/ > Project: GNU Wget > Submitted by: f0ff > Submitted on: Tue 12 May 2020 10:45:17 AM UTC > Category: None > Severity: 3 - Normal > Priority: 5 - Normal > Status: None > Privacy: Public > Assigned to: None > Originator Name: > Originator Email: > Open/Closed: Open > Release: 1.14 > Discussion Lock: Any > Operating System: GNU/Linux > Reproducibility: Every Time > Fixed Release: None > Planned Release: None > Regression: None > Work Required: None > Patch Included: No > > _______________________________________________________ > > Details: > > Hi, > Wget refuses to parse URIs that start with http:/ (note single slash), e.g. > http:/wp-includes/css/dist/block-library/style.min.css?ver=5.4.1. These are > widely accepted by browsers. > > Command that I've used: `wget --user-agent=Mozilla --content-disposition > --page-requisites --adjust-extension --restrict-file-names=windows -d -e > robots=off -m -k -E -r -l 10 -p -N -F -P crawl -nH $IP`
You may as well make the slashes optional in the protocol string. Berners Lee does not like them anyway, https://www.mentalfloss.com/uk/history/27802/10-inventors-who-came-to-regret-their-creations. Jeff
