I think the first feature would be nice, I don't really like the script idea though. As for testing, that could go through the nightly builds. Your code seems to be leaking a PDFDocument. Just had a quick look at it.
Christiaan On 13 Jan 2009, at 10:38 PM, Gregory Jefferis wrote: > Hi, > > I have written some code to parse dropped pdfs for DOIs and generate > a new > BibItem if they can be found on PubMed. This uses PDFKit to > generate a text > representation of the first two pages of the pdf followed by a regex > search > for DOIs. The DOI regexes have been tested by me for a while in a > standalone program and are quite forgiving. > > A second feature allows an optional external script to be called > before full > text doi parsing that can rapidly check a PDF's attributes for a doi > or > check if the pdf name conforms to certain patterns typical of > Elsevier or > Nature Publishing Group journals. This speeds up addition of some > PDFs. I > guess this would make something hackable by knowledgeable end > users. I have > such a script that I have been using for a while. > > Both features are under the control of hidden preferences, and > therefore off > by default (unless I now modify Info.plist). I do think these > feature would > be very much appreciated by all biologists using BibDesk. > > A few questions: > > 0) Does this seem a reasonable addition? Have I missed any similar > functionality? > 1) What's the best way to share this for testing? > 2) Would anyone be prepared to review my code before (or after) I > commit to > trunk - I confess I'm a little sketchy on Obj C memory management, > so there > is always the possibility of a missed release. It's ~100 lines of > code. > 3) Would anyone see a way to extend this kind of functionality to > other > bibliographic sources besides PubMed? (Not that I was planning to > implement > this as well, but rather so that anything I do could be left modular > enough > for others to extend) > 4) Is there a way that this functionality could be called when a PDF > is > viewed in BibDesk's built in browser (eg by browsing a PDF at a > journal's > website)? > > Many thanks, > > Greg. > > Diff at: > > http://pastie.org/359890 > > > -- > Gregory Jefferis, PhD jeffe...@mrc-lmb.cam.ac.uk > Division of Neurobiology LMB Lab: +44 (0)1223 > 252943 > MRC Laboratory of Molecular Biology, LMB Office: +44 (0)1223 > 252944 > Hills Road, LMB Fax: +44 (0)1223 > 402310 > Cambridge, CB2 0QH, UK. Voicemail: +44 (0)1223 > 339899 > > http://www2.mrc-lmb.cam.ac.uk/NB/jefferis_g > http://www.neuroscience.cam.ac.uk/directory/profile.php?gsxej2 > http://flybrain.stanford.edu > > > > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by: > SourcForge Community > SourceForge wants to tell your story. > http://p.sf.net/sfu/sf-spreadtheword > _______________________________________________ > Bibdesk-develop mailing list > Bibdesk-develop@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/bibdesk-develop ------------------------------------------------------------------------------ This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword _______________________________________________ Bibdesk-develop mailing list Bibdesk-develop@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bibdesk-develop