By "synchronize" am I to assume that at some later date you will have new XML documents and will rerun the XProc pipeline, produce new RDF and store those into Jena ? You want to update Jena with the new RDF values AND delete ones that no longer are present ?
I think this is more a question for Jena Developers ... I would guess you would want to tag the RDF values somehow so that you knew which ones to replace. I am not sure how MarkLogic will help (or hinder) this process as everything is done prior to be put into the ML Database ... Unless you have a way of also storing the RDF ID's associated with the XML document. That could possibly be put in a property document of the XML Document ... but I dont know how you identify RDF triples in Jena. ----------------------------------------------------------------------------- David Lee Lead Engineer MarkLogic Corporation [email protected] Phone: +1 812-482-5224 Cell: +1 812-630-7622 www.marklogic.com -----Original Message----- From: [email protected] [mailto:[email protected]] On Behalf Of McGibbney, Lewis John Sent: Thursday, November 22, 2012 11:11 AM To: [email protected] Subject: [MarkLogic Dev General] Synchronizing content from heterogeneous data stores Hi All, Currently I have a stack of XML documents in MarkLogic. They get there via an XProc pipeline. I am currently working to run some custom parsers on the XML (within the pipeline) *just* before it gets inserted into MarkLogic. The parsers extract RDF relationships (triples) from the XML content and I would like to send this extracted structure to a triple store (e.g. Jena TDB). The idea is then to build my application on top of MarkLogic and use the triples to compliment structured or text based queries within the search application. Currently I would really appreciate some need clarification on one major area if possible... How would/could I synchronize the an individual XML document and its associated triple graph within triple store? This is my major area of confusion. I am really curious to hear from anyone out there who has attempted anything similar. Thanks very much for any feedback on this one, I realize it is a pretty lengthy question but any suggestions would be great. All the best Lewis Glasgow Caledonian University is a registered Scottish charity, number SC021474 Winner: Times Higher Education’s Widening Participation Initiative of the Year 2009 and Herald Society’s Education Initiative of the Year 2009. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html Winner: Times Higher Education’s Outstanding Support for Early Career Researchers of the Year 2010, GCU as a lead with Universities Scotland partners. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
