Re: Importing and Exporting XML

Alessandro Bologna Wed, 13 Jun 2007 04:08:55 -0700

Wooly,

I agree with what Dan said. Another approach to customize your importand export that you may find useful can be to use (and extendaccordingly) a couple of classes found in the contrib section of theJackrabbit SVN.http://svn.apache.org/repos/asf/jackrabbit/trunk/contrib/jcr-ext/src/main/java/org/apache/jackrabbit/xml

In particular, you may want to look at the DocumentViewExportVisitorand DocumentViewImportVisitor. You need to understand how SAX parsingworks a bit, but it should be easy enough to customize them to selectwhat attributes you want to see in your DocumentView export, and howto map your XML into Jackrabbit nodes and properties.Incidentally, those classes supports multivalued propertiesrepresented as space separated attributes.

About the SystemView, it is really not meant to be human readable,more like machine readable (hence the name SystemView), and can be abit misleading, because it breaks the node/element mapping that JCRsupports (in the JSR-170 specs is defined as VirtualDocument). Inother words, the xpath expression "/foo/bar" in system view becomessomething like /sv:[EMAIL PROTECTED]:name='foo']/sv:[EMAIL PROTECTED]:name='bar']/sv:value Or something like that...

In my current project, I have implement a REST type interface to theJCR, so that the XSLT document() element can be used to access JCRsubtrees, and it is very important to preserve the mapping betweenxpath in XML fragments (constructed using theDocumentViewExportVisitor) and the xpath in the JCR, so theSystemView does not really apply.


Hope it helps.
Alessandro



On Jun 13, 2007, at 6:16 AM, Dan Connelly wrote:

woolly:
The term "DocumentView" is slightly misleading. Its more like aShredded And Annotated Document View.
The xml document will get shredded into its constituent elementnodes when you import it as "DocumentView". This import will notstore a single, coherent document in the Repository. WebDavsupport in Jackrabbit, on the other hand, can be used to store thedocument as coherent text. Customized, hybrid approaches arepossible to support structured content (partial shredding overWebDav). It depends on your use case how much (or how little)shredding you want.
The metadata gets added to raw shreds during DocumentView import toindicate the Jackrabbit element node type structure. By default,node type will be nt:unstructured on raw nodes (not having metadataalready). You can write a simple XSLT to strip out the metadatawhen you export. For import you can work this in reverse and adda custom structure using XSLT (but that may not be simple).
It sounds like your use case (customized node editing) requiressome custom node types. This can work nicely if the set ofelement tags is limited and fixed. Also, you probably also willneed to add some custom xml processing (dom, sax or xslt).
What xml editor are you using? I think XML Spy has integrationfeatures that would support partial shredding and customizeddocument views. (But, I have never worked this.)
   -- Dan Connelly




woolly wrote:
Hi all,
Is it possible to import xml into a node, and then export that xmlback out
to have the same xml-equivalent file? At the moment I'm trying:

fis = new FileInputStream(inputFile);
session.importXML(node.getPath(), fis,
ImportUUIDBehavior.IMPORT_UUID_CREATE_NEW);
fis.close();

// followed by....
out = new FileOutputStream(outputFile);
session.exportDocumentView(node.getPath(), out, true, false);
The difference between inputFile and outputFile seems to be thatthere are
some additional jcr specific attributes. Is this necessary?
What I'm really trying to do is manage an xml document (eventuallymany xml
documents), allow people to make changes to only certain parts of it,
versioning those parts and using other JackRabbit features. Isthis the kindof thing that JackRabbit was intended for? Or should I just loadthe xmldocument in as a property of a node and deal with the other thingsmyself?
Thanks for any help,

Phil.

Re: Importing and Exporting XML

Reply via email to