I very much like the idea of doing the lion's share of the thinking in RDF world but there are some cases for configuration, particularly when performance is in mind.
In anything that is data-rich, people are going to process as much data as they feel comfortable doing, so no matter how fast it is, transformation speed is going to be a limiting factor for some people. The whole XML infoset has stuff it in that most people neither know or care about and you don't want to materialize it all for everything although somebody will probably want to materialize any particular part of it some of the time. Another interesting question is how to merge in knowledge from XML Schema. For instance, XML Schema is not just a validation schema, it is also an inference schema in that you can infer default values from an XML Schema. Also of course an XML Schema (any schema) has documentation value, and generating something like an RDFS schema for the translated RDF would also be a good thing. On Fri, Jul 29, 2016 at 10:22 AM, Jean-Marc Vanel <[email protected]> wrote: > Thanks for sharing > > There is also Gloze. > The use case of Gloze is diferent from what I understand of your XmlToRdf . > Basically Gloze has no configuration. > There a canonical XML to RDF mapping. > So you get easily RDF out of your XML, and then you use RDF tools like > SPARQL or N3 (Euler/EYE) to transform the resulting RDF . > > The article: > > http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.88.8929&rep=rep1&type=pdf > The source code: > https://github.com/stevebattle/Gloze > I have made enhancements to Gloze in another project that are maybe not in > Github :( . > > > 2016-07-29 10:28 GMT+02:00 Håvard Mikkelsen Ottestad < > [email protected]>: > > > Hi, > > > > I just wanted to give some publicity to a library I have worked on for > > some time. An XML to RDF Java library (open source / apache 2) that’s > > compatible with Jena. > > > > It’s blazingly fast and highly configurable. Available on GitHub > > https://github.com/AcandoNorway/XmlToRdf and on Maven > > http://mvnrepository.com/artifact/no.acando/xmltordf > > > > Regards, > > Håvard M. Ottestad > > > > > > -- > Jean-Marc Vanel > Déductions SARL - Consulting, services, training, > Rule-based programming, Semantic Web > http://deductions-software.com/ > +33 (0)6 89 16 29 52 > Twitter: @jmvanel , @jmvanel_fr ; chat: irc://irc.freenode.net#eulergui > -- Paul Houle *Applying Schemas for Natural Language Processing, Distributed Systems, Classification and Text Mining and Data Lakes* (607) 539 6254 paul.houle on Skype [email protected] :BaseKB -- Query Freebase Data With SPARQL http://basekb.com/gold/ Legal Entity Identifier Lookup https://legalentityidentifier.info/lei/lookup/ <http://legalentityidentifier.info/lei/lookup/> Join our Data Lakes group on LinkedIn https://www.linkedin.com/grp/home?gid=8267275
