Hello Nicholas,
maybe that it's a bit late but I've found that for this kind of things, OpenRefine (ex Google refine) works fine if the catalog you want to query allows passing parameter in the url; You can get the HTML code of the catalog pages and easily detect if it show a record or not. I've done it for different projects and found it a good answer to this problem.

You can then replay the scenario with openrefine on a new set of data / isbn. I don't know if it can be scripted but as shown after the paragraph starting with "So now we have our cleaned data file" on this blogpost : http://blog.ouseful.info/2013/07/26/using-openrefine-to-clean-multiple-documents-in-the-same-way/

Hope this helps.

Sylvain



Le 13/08/2014 12:20, Nicholas Brown a écrit :
Apologies for cross posting

Dear collective wisdom,

I'm interested in using automation software such as Macro Express or iMacros to feed a 
list of ISBNs from a spreadsheet into Copac or Worldcat and output a list of those that 
return no matches in the results screen. The idea would be to create a tool that can 
quickly, although rather roughly, identify rare items in a collection (though obviously 
this would be limited to items with ISBNs or other unique identifiers). I can write a 
macro which will sequentially search either catalogue for a list of ISBNs but am 
struggling with how to have the macro identify items with no matches (I have a vague idea 
about searching the results screen for the text "Sorry, there are no search 
results") and to compile them back into a spreadsheet.

I'd be keen to hear if anyone has attempted something similar, general advice, 
any potential pitfalls in the method outlined above or suggestions for a better 
way to achieve the same results. If something useful comes of it I'd be happy 
to share the results.

Many thanks for your help,
Nick

Nicholas Brown
Library and Information Manager
[email protected]
+44 (0)20 7749  1125
www.iniva.org

Reply via email to