On Thu, Jun 14, 2007 at 09:20:30PM +0200, Torsten Mohr wrote:
> Hi,
> 
> thanks a lot for the answers i got to my question.  It pretty much
> explained the behaviour.  My understandig of XML was different before.
> 
> I'd like to use XML as a file format for a program i want to write
> and in that file format i'd like to ignore everything that is not
> XML.  Below you describe that by using a DTD i can skip the unwanted
> whitespaces when reading the XML (if i understand correctly).
> 
> I just googled for some DTD descriptions and the descriptions i found
> explained a lot.  But i would not have a clue on how to ignore whitespace
> that comes from e.g. xmllint --format in.xml > out.xml.
> 
> Could anybody please give me a hint on what options to set or what DTD
> to use so that a formatted and an XML with all tags in one line without
> whitespaces give the same result?  Is it possible to place such a DTD as
> a string into memory?

  In general no. Please do not try to assume you will be able to get libxml2
to ignore data. This may work or not, and the DTD is usually not a garantee
because document are usually not valid. Instead of trying to build a dangerous
pile of assumtion to try to avoid processing a few nodes, please code the
full algorithm, and skip those nodes there. You will avoid wasting a lot
of time on design, coding, testing and when your users actually start
to use the code. It's not like testing if a node is text and just white
spaces is hard so what ???

Daniel

-- 
Red Hat Virtualization group http://redhat.com/virtualization/
Daniel Veillard      | virtualization library  http://libvirt.org/
[EMAIL PROTECTED]  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine  http://rpmfind.net/
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to