Re: MD5 list of bitsavers files

2021-10-02 Thread Antonio Carlini via cctalk

On 01/10/2021 08:36, Paul Flo Williams wrote:


https://vt100.net/manx/assets/2021-10-01-bitsavers-dec.md5

Paul


Thanks for that. It turns out that some of the bitsavers files have 
changed since I picked them up (perhaps to add OCR of similar). (I also 
got further data from Richard via email, which showed the same issue).


So I've had to do some other surgery to hone down the list of files.


The total number of files that seem to be unique and unpublished is just 
under 2000. About 950 of those are original PDFs of 
manuals/specs/whatever (not scans) that I picked up while working at DEC.


The rest either I've scanned, or I've found lying around on the internet 
somewhere.



That's rather a large amount to dump on Al without providing some 
metadata. I think for most of them I can dig up a title. So I could 
provide an index file to link the filepath and title, something like:


0002/MANUALS/EK/XMIADHB.PDF: "XMI Adapters Handbook"

Would that help, or is there more that I might be able to provide?



Antonio




--
Antonio Carlini
anto...@acarlini.com



Re: MD5 list of bitsavers files

2021-10-01 Thread Paul Flo Williams via cctalk
On Thu, 30 Sep 2021 18:47:52 +0100
Antonio Carlini via cctalk  wrote:

> I see that manx lists the MD5 checksum for many files, at least it
> does for those from bitsavers. Is there a publicly available list of
> URL and MD5 checksum? This would make it relatively easy for me to
> cross check my files against the list and whittle down to a subset
> that I should make available.

https://vt100.net/manx/assets/2021-10-01-bitsavers-dec.md5

Paul


Re: MD5 list of bitsavers files

2021-09-30 Thread Al Kossow via cctalk

On 9/30/21 10:47 AM, Antonio Carlini via cctalk wrote:

Alternatively, is the current manx database available anywhere? 

Richard Thompson should be maintaining it.

Now that bitsavers is on a machine that I have a shell account on I need
to start generating fixity information for the files there.


MD5 list of bitsavers files

2021-09-30 Thread Antonio Carlini via cctalk
I'm trying to list out the document scans I have and work out which are 
already on bitsavers and which are not (and, indeed, a fair few of these 
are originally from bitsavers anyway). This is probably several thousand 
files total, so searching manx by hand is not an option!


I see that manx lists the MD5 checksum for many files, at least it does 
for those from bitsavers. Is there a publicly available list of URL and 
MD5 checksum? This would make it relatively easy for me to cross check 
my files against the list and whittle down to a subset that I should 
make available.



Alternatively, is the current manx database available anywhere? I know 
the code is on github, but I didn't see the data there. (I do have an 
SQL dump from 2010 when manx changed hands, but that's not recent enough 
to save much).


I could try to do some parsing of bitsavers-filename => DEC-part-number 
and eliminate files that way, but that seems inexact at best. Or I could 
just download the DEC subset of files (spread across the mirrors) but 
that seems a bit antisocial.



Antonio


--
Antonio Carlini
anto...@acarlini.com