Greetings, Rich.

Hope it helps.

A few notes:
- I modified the notebook and ran it again to capture more data
- I added headers to the notebook so that there is a minimal Table of
Contents to make jumping around easier
- the notebook mirrored the site in just under 5 minutes in the CoLab
environment
- there are a total of 1609 files
- total size is just under 15 GB
- two files suffered a 503 error
- I added a few code cells to find those errors and reattempt a download,
if desired
- a full tree, long listing, and short listing of files are included in the
notebook at the end

Good luck and I'm curious to hear what you do next with the data.

Regards,
- Robert


On Sat, Nov 18, 2023 at 6:38 AM Rich Shepard <[email protected]>
wrote:

> On Sat, 18 Nov 2023, Robert Citek wrote:
>
> > I was able to mirror the site in Google's Colab.  Here's a gist with a
> > notebook describing what I did and its output:
> >
> > https://gist.github.com/rwcitek/8d3035f6d2931d80f0569d3964fa6e28
> >
> > In the notebook, you can click on the "Open in Colab" button to run the
> > commands in your own Colab environment.
>
> Robert,
>
> Thank you.
>
> Regards,
>
> Rich
>

Reply via email to