Re: line and column of an element/DOMNode?

4pzbrog02 Sat, 12 Aug 2006 04:06:36 -0700

Thanks for all the flowers :) I was just at that time working on aproject that uses the parser heavily, and it's abstracted enough so Icould simply copy and paste it into an email - Xerces is free, and alsolives from people donating code, so I thought I give something back.

I was suggesting the hash table solution because of the many allocationsthe current implementation makes (it uses std::string for a start, andallocates a Tag for each DOMElement). You could create a hash table thatonly allocates one large block upfront and then fills it up until it'sfull, and after that goes on and resizes it if more Tags come along. Oh,and some std::string implementations might actually use copy-on-write;in this case you don't really have a memory footprint problem with thesystemID.

Storing a pointer to the parent's systemID is in principal not a badsolution, however, it makes the implementation more complicated, sinceyou could import nodes from other DOMDocuments, so you'd end up withTags in document A pointing to Tags in document B, which could createproblems when you delete B (for instance by deleting it's parent parser).

But feel free to change it, that's why I posted it here ;) I originallyonly wanted something that works, without paying too much attention toit's memory footprint or performance (Other parts of my project demandmuch more attention in these and other aspects *g*)


Glad I could help,

Cheers,

Uwe

Michael Weitzel michael.weitzel-at-uni-siegen.de |xerces-c-users mailinglist| schrieb:

Hi Uwe,

I am really impressed by the speed your response and the universality of
your solution. Your TaggingDOMParser works just fine. Many thanks :-)

I agree that it's a common concern to locate the source of an error for
any context sensitive application (IDREFs are useful but this method is
too weak because of its global scoping). Maybe this is a problem of the
DOM standard ...

I think your solution is just fine. Wouldn't a hash table for the tags
create additional overhead? I will remove the SystemID from the Tags to save
memory. It is redundant to store it in every Tag. Maybe it should be replaced
by a pointer to the parent's SystemID -- similar to the static scoping found
in programming languages with nested blocks where a variable is searched in
the surrounding ("parent") blocks when is can't be located in the current
block...).

Thanks again :-)

Am Freitag, den 11. August 2006, um 18:07h schrieb [EMAIL PROTECTED]:
First of all, this is one of these recurring questions that are askedall over again and again. Maybe the answer is put in a place soprominent (FAQ?) that it doesn't occur any more (I rember that I wasasking the same question as well a couple of months ago)
You will have to add tagging objects to each DOMElement while you letthe DOMParser parse the XML file. So basically, what you do is is toderive a class from the DOMParser and override it's startElementfunction (yep, it is actually a SAXParser as well). Another option mightbe to maintain a hashtable with pointers to DOMNodes as keys, but Ididn't do that (hmm, might not be too bad of an idea, maybe next time ;)
The Tags need to be refernce counted if you want to clone nodes (hmm, Iwonder if I do that in my project, actually), otherwise you'll end upwith nasty lifetime issues for your Tag objects. For this reason, Tagsare implemented with their own specific DataHandler, which takes care ofthese lifetime issues.
However, I hope that the code I'm submitting here will answer it onceand for all (although I don't guarantee it to be perfect nor performant,it just works for what I do, if it doesn't work for you it's yourproblem - and <disclaimer>I DONT TAKE RESPONSIBILITY FOR ANY PROBLEMSCAUSED BY IT</disclaimer> ;)
Find snippets of the code I am using to do this.
I hope this code is somewhat useful to you Michael, and also everyoneelse who stumbles over this problem. It that seems all too simple sothat one would assume Xerces can do it out of the box, but unfortunatelyit can't.
Cheers,

Uwe

------------ from TaggingDOMParser.hpp:
[...]
-------------------
Oh, and StrX() is actually just a helper class that takes and XMLCh* init's contructor and stores the UTF8-transcoded version, which can beobtained by it's getString() method; you might find a similarimplementation in Xerces' examples
Hope this helps, feedback welcome.

Cheers,

Uwe
is there a way to determine the line and column of the XML file
associated with a specific DOMNode / element? The data in my XML format
requires a few complex semantic validations that cannot be expressed by
the DTD. The simpler errors can be detected based on the context while
traversing the DOM tree and it would be nice to give more specific error
messages.

Am I right that "DOMLocator" cannot be used since no "DOMError" occurs?

Re: line and column of an element/DOMNode?

Reply via email to