On Nov 2, 2007, at 9:39 PM, Adam M. Goldstein wrote: > I mentioned this on this list quite a while ago, when it was in its > really early stages.
ah, me need to pay attention :) > Used with PDF-to-text, you can process a PDF and then have cb2bib > import the references. Yeah, strangely you can't pass it whole text files of references, though. I think that would only need a small tweak to the code, however. > It didn't work so well when I tried it before, but it sounds like it's > improving. It looks like people submit regexps that they've created > for different journal sources. Yeah, I'll submit mine when I'm done, but most of these ones are pretty specific. > One problem is that many PDF's from professionally published journals > seem to be locked and so don't permit extracting their text. Yes, I've written to a few publishers about that. It's really silly, honestly who makes money out of unauthorized copies of journal articles? It just annoys regular users trying to use attributed quotations. > > -Adam > > On Nov 2, 2007, at 9:31 PM, James Howison wrote: > >> It takes plain text references and uses Regexes to turn them into >> BibTeX. >> >> http://www.molspaces.com/d_cb2bib-overview.php >> >> It has some built in regexes (eg JSTOR and PubMeb), but it is more >> useful once you get into writing your own regexes. The interface is >> very, shall we say, linux quirky, but there is a Mac binary and it >> does work. One thing I like is that you just use a single capture >> for >> the whole authors or editors string and it is quite smart about >> normalizing them for BibTeX. >> >> The regex format it is using I found wasn't well documented (ok, I'm >> sure if I looked in the source I could find out!), but I found it to >> basically be PCRE with the slightly annoying quirk that it doesn't >> work with the 'non-greedy' specifier (ie (.*) works (doing a greedy >> match) while (.*?) doesn't work (AFAIK it should do a non-greedy >> match). It does some pre-processing for newlines across platforms >> (that is documented on the website). >> >> I'm using it to turn text bibliographies in something like Harvard >> citation format (from Journalism Studies) into BibTeX references. >> >> --J >> >> ------------------------------------------------------------------------- >> This SF.net email is sponsored by: Splunk Inc. >> Still grepping through log files to find problems? Stop. >> Now Search log events and configuration files using AJAX and a >> browser. >> Download your FREE copy of Splunk now >> http://get.splunk.com/ >> _______________________________________________ >> Bibdesk-users mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/bibdesk-users > > ================================= > Adam M. Goldstein PhD > Assistant Professor of Philosophy > Iona College > -- > email: [EMAIL PROTECTED] > web: http://www.iona.edu/faculty/agoldstein/ > tel: (914) 637-2717 > post: Iona College > Department of Philosophy > 715 North Avenue > New Rochelle, NY 10801 > > > > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a > browser. > Download your FREE copy of Splunk now >> http://get.splunk.com/ > _______________________________________________ > Bibdesk-users mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/bibdesk-users > ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Bibdesk-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/bibdesk-users
