On Fri, Jan 20, 2012 at 10:52 AM, Sylvester Keil <[email protected]> wrote: > > > On Jan 20, 2012, at 4:47 PM, Rintze Zelle wrote: > > > On Fri, Jan 20, 2012 at 10:05 AM, Bruce D'Arcus <[email protected]> wrote: > > On Fri, Jan 20, 2012 at 9:35 AM, Sylvester Keil <[email protected]> > > wrote: > > > > > I wrote anystyle-parser as a freecite replacement; my idea, going > > > forward, was to turn it into a web service, like freecite, too. The ML > > > model and the feature dictionary was optimized for my use cases, but > > > could be easily improved. > > > > So just to clarify, the relevance here is in this approach, we'd need > > a really smart parser, that would allow us to deconstruct a formatting > > bibliographic entry into their component parts, and then to match that > > against CSL macros fragments, to piece together a new style. > > > > With the caveat that formatted bibliographic entries are often lossy, so it > > might be desirable to parse the entry, use the component parts to identify > > the item (e.g. via CrossRef's lookup tools), retrieve more complete > > bibliographic data for the item (e.g., once you know the DOI, you could > > resolve it and scrape that page; this could be done with server-side Zotero > > translators: > > http://forums.zotero.org/discussion/19458/translators-server-side/ ), and > > run that more complete bibliographic data through the CSL processor for all > > styles. > > Absolutely. I don't think it's feasible to parse bibliographies 'perfectly'. > To have a suitably good parser and combine it with discovery tools is a > really good idea. > > However, it doesn't help if the data you need to parse isn't really available > online.
A similar approach, that doesn't rely on free text parsing, is what Dan Stillman originally suggested: have predefined formatted data, and allow users to modify it. Of course, there are a range of ways to approach this as well (pure free text, pop-up tokens that allow one to select from options, etc.). Bruce ------------------------------------------------------------------------------ Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d _______________________________________________ xbiblio-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/xbiblio-devel
