On Fri, Jan 20, 2012 at 10:52 AM, Sylvester Keil <[email protected]> wrote:
>
>
> On Jan 20, 2012, at 4:47 PM, Rintze Zelle wrote:
>
> > On Fri, Jan 20, 2012 at 10:05 AM, Bruce D'Arcus <[email protected]> wrote:
> > On Fri, Jan 20, 2012 at 9:35 AM, Sylvester Keil <[email protected]> 
> > wrote:
> >
> > > I wrote anystyle-parser as a freecite replacement; my idea, going 
> > > forward, was to turn it into a web service, like freecite, too. The ML 
> > > model and the feature dictionary was optimized for my use cases, but 
> > > could be easily improved.
> >
> > So just to clarify, the relevance here is in this approach, we'd need
> > a really smart parser, that would allow us to deconstruct a formatting
> > bibliographic entry into their component parts, and then to match that
> > against CSL macros fragments, to piece together a new style.
> >
> > With the caveat that formatted bibliographic entries are often lossy, so it 
> > might be desirable to parse the entry, use the component parts to identify 
> > the item (e.g. via CrossRef's lookup tools), retrieve more complete 
> > bibliographic data for the item (e.g., once you know the DOI, you could 
> > resolve it and scrape that page; this could be done with server-side Zotero 
> > translators: 
> > http://forums.zotero.org/discussion/19458/translators-server-side/ ), and 
> > run that more complete bibliographic data through the CSL processor for all 
> > styles.
>
> Absolutely. I don't think it's feasible to parse bibliographies 'perfectly'. 
> To have a suitably good parser and combine it with discovery tools is a 
> really good idea.
>
> However, it doesn't help if the data you need to parse isn't really available 
> online.

A similar approach, that doesn't rely on free text parsing, is what
Dan Stillman originally suggested: have predefined formatted data, and
allow users to modify it. Of course, there are a range of ways to
approach this as well (pure free text, pop-up tokens that allow one to
select from options, etc.).

Bruce

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
xbiblio-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/xbiblio-devel

Reply via email to