Hi David,

Thank you for your reply.

Regarding your assumption on what I meant by "synchronize"... yes you are 
completely correct.

{bq}I think this is more a question for Jena Developers{bq}
I had my fears that you would say that ;) however your final comments are both 
very helpful and interesting. I did not know about property documents, this 
might be a way to at least create *a* link of sorts which is required to 
associate the XML doc with its sister RDF.

Thank you

Lewis
________________________________________
From: [email protected] 
[[email protected]] On Behalf Of David Lee 
[[email protected]]
Sent: 22 November 2012 17:01
To: MarkLogic Developer Discussion
Subject: Re: [MarkLogic Dev General] Synchronizing content from heterogeneous   
data stores

By "synchronize" am I to assume that at some later date you will have new XML 
documents and will rerun the XProc pipeline, produce new RDF and store those 
into Jena ?
You want to update Jena with the new RDF values AND delete ones that no longer 
are present ?

I think this is more a question for Jena Developers ... I would guess you would 
want to tag the RDF values somehow so that you knew which ones to replace.   I 
am not sure how MarkLogic will help (or hinder) this process as everything is 
done prior to be put into the ML Database ...
Unless you have a way of also storing the RDF ID's associated with the XML 
document.  That could possibly be put in a property document of the XML 
Document ... but I dont know how you identify RDF triples in Jena.



-----------------------------------------------------------------------------
David Lee
Lead Engineer
MarkLogic Corporation
[email protected]
Phone: +1 812-482-5224
Cell:  +1 812-630-7622
www.marklogic.com


-----Original Message-----
From: [email protected] 
[mailto:[email protected]] On Behalf Of McGibbney, Lewis 
John
Sent: Thursday, November 22, 2012 11:11 AM
To: [email protected]
Subject: [MarkLogic Dev General] Synchronizing content from heterogeneous data 
stores

Hi All,

Currently I have a stack of XML documents in MarkLogic. They get there via an 
XProc pipeline.
I am currently working to run some custom parsers on the XML (within the 
pipeline) *just* before it gets inserted into MarkLogic.
The parsers extract RDF relationships (triples) from the XML content and I 
would like to send this extracted structure to a triple store (e.g. Jena TDB).
The idea is then to build my application on top of MarkLogic and use the 
triples to compliment structured or text based queries within the search 
application.

Currently I would really appreciate some need clarification on one major area 
if possible...

How would/could I synchronize the an individual XML document and its associated 
triple graph within triple store? This is my major area of confusion. I am 
really curious to hear from anyone out there who has attempted anything similar.

Thanks very much for any feedback on this one, I realize it is a pretty lengthy 
question but any suggestions would be great.

All the best

Lewis

Glasgow Caledonian University is a registered Scottish charity, number SC021474

Winner: Times Higher Education’s Widening Participation Initiative of the Year 
2009 and Herald Society’s Education Initiative of the Year 2009.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html

Winner: Times Higher Education’s Outstanding Support for Early Career 
Researchers of the Year 2010, GCU as a lead with Universities Scotland partners.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Glasgow Caledonian University is a registered Scottish charity, number SC021474

Winner: Times Higher Education’s Widening Participation Initiative of the Year 
2009 and Herald Society’s Education Initiative of the Year 2009.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html

Winner: Times Higher Education’s Outstanding Support for Early Career 
Researchers of the Year 2010, GCU as a lead with Universities Scotland partners.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to