Re: [Bibdesk-users] pdfmeat

Ken Mankoff Sat, 25 Jan 2014 16:08:47 -0800

OK, perhaps the last post for now on this project
https://github.com/mankoff/BibDeskAppleScripts


It seems like it would be quite easy to get full PDF download from any site
plus any metadata provided by either Google Scholar or the publisher. The
key is to use 3rd party tools.

pdfmeat.py does an excellent job, in my experience, of fetching BibTeX
records. I'd like to modify it to accept a specified search string instead
of always parsing it from the PDF which is error-prone.

bibfetch.pl does an OK job of fetching BibTeX records (Google Scholar
only) and is used when no PDF file is available for (the current version
of) pdfmeat.py. bibfetch.pl also has an option to provide any URLs to PDFs
that Google Scholar reports. Therefore, another few commands (for example,
shell command to curl) would download the PDF, which could then be
auto-filed with the BibDesk record being modified.

This is now a fairly comprehensive solution that downloads PDFs if
available, from any website. If the PDF already exists but there is no
metadata in BibDesk, that is OK, it still gets most of the data thanks to
pdfmeat.py. Either way, any missing records are filled in.

The drawback of course is two external tools, which each have their own
dependencies. The Python script required me to "pip install" a few things,
and uses pdftotext, which suggests an entire LaTeX install. This might not
be a problem with the BibDesk crowd. bibfetch.pl needed a few CPAN packages
installed too.

Cheers,

    -k.

------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today.
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk

_______________________________________________
Bibdesk-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/bibdesk-users

Re: [Bibdesk-users] pdfmeat

Reply via email to