Hi Matteo For MODS, MIX, and other XML-based metadata schemas, I'd suggest XSLT is probably a more appropriate language than Java.
Conal On 18/01/12 01:12, Matteo Bertazzo wrote: > Hi all, > we are currently analyzing the "usual" MD indexing process using XSLT > transformations to create SOLR documents. > Considering the new integration achieved between GSearch 2.4 and Tika we're > wondering about the opportunity to streamline the indexing process and move > the MD indexing process on Tika. > Tika already support DC documents through a DcXMLParser class and we're > evaluating the opportunity to implement (Java) a set of custom parsers in > order to support other MD schema (MODS, MIX, etc). > What do you think about this approach? > Is there anyone who has already thought about or started a similar > development? > > All the best, > Matteo -- Conal Tuohy eResearch Business Analyst Victorian eResearch Strategic Initiative +61-466324297 ------------------------------------------------------------------------------ Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/fedora-commons-users