I'm bored, so I do a regular daily report at the DSL Reports "CanChat"
sub-forum, on the Covid-19 case counts for Ontario, using provincial
data.  I download 2 files daily as source data.  One of them is a PDF
file, which is run through "pdftotext" and then parsed by a bash script
(don't ask).  Today, the command...

  wget https://files.ontario.ca/moh-covid-19-report-en-2021-01-14.pdf

...returns a zero-byte file.  *BUT*, sticking the URL into the URL bar
of Pale Moon and Google Chrome (and I assume Firefox/etc) brings up the
PDF file just fine.  Is "wget" being blocked?  I have to do extra steps
to get from the browser-invoked PDF to get the PDF file saved to the
standard work area where my script expects it to be, so it can work its
magic and parse out the daily breakdown by PHU (Public Health Unit).
BTW, today's posts requiring the PDF file are...
https://www.dslreports.com/forum/r33002718-
https://www.dslreports.com/forum/r33002752-

  I've tried setting --user-agent= with my browser's string as shown by
https://www.whatismybrowser.com/detect/what-is-my-user-agent  but no
luck.  Is there some way to get around this?  I have not updated this
past week, so I don't think the problem is at my end.

-- 
Walter Dnes <waltd...@waltdnes.org>
I don't run "desktop environments"; I run useful applications

Reply via email to