Re: Pretty print problem in serializer in 1.0.1

Scott Boag/CAM/Lotus 7 Jan 2000 22:35:01 -0000

> Well sorry to disagree but I think the other way is better. Ignorable
> whitespace should be included by defaults.


+1.  By no means should the parser muck with whitespace as the default.

-scott




                                                                                
                                   
                    Arnaud Le                                                   
                                   
                    Hors                 To:     [EMAIL PROTECTED]              
                           
                    <[EMAIL PROTECTED]        cc:     (bcc: Scott 
Boag/CAM/Lotus)                                       
                    m.com>               Subject:     Re: Pretty print problem 
in serializer in 1.0.1              
                                                                                
                                   
                    01/07/00                                                    
                                   
                    04:56 PM                                                    
                                   
                    Please                                                      
                                   
                    respond to                                                  
                                   
                    xerces-dev                                                  
                                   
                                                                                
                                   
                                                                                
                                   




Assaf Arkin wrote:
>
> +1 from me :-)
>
> arkin
>
> Andy Clark wrote:
> >
> > Assaf Arkin wrote:
> > >
> > > Ignorable whitespace, not just whitespace.
> >
> > Good point.
> >
> > > I think the feature should be "include ignorable whitespace as text
> > > nodes" and off by default.

Well sorry to disagree but I think the other way is better. Ignorable
whitespace should be included by defaults. The main reason for that is:

By the XML 1.0 spec validating XML processors are required to provide
this information [1], and this is confirmed by the XML Infoset spec [2].

[1] http://www.w3.org/TR/REC-xml#sec-white-space
[2] http://www.w3.org/TR/1999/WD-xml-infoset-19991220#infoitem.character

In addition, it seems desirable that the tree doesn't change too much
whether the validation is on or off.

While DOM Level 2 doesn't expose the fact that there are Text nodes
containing such whitespace (commonly called "ignorable whitespace"), it
is on the issues list for DOM Level 3.

> > So how about the following ID?
> >
> >   http://apache.org/xml/features/dom/include-ignorable-whitespace
> >
> > It's getting a little verbose but that's probably erring on
> > the side of understandability.

I don't care much what the name is.

Besides, Assaf, I really think that a serializer ought to look into the
Text nodes and deal with the fact that there may be some leading and
trailing whitespace. I consider not doing so a bug, since it makes all
of your efforts to smartly indent the nodes moot.
--
Arnaud  Le Hors - IBM Cupertino, XML Technology Group

Re: Pretty print problem in serializer in 1.0.1

Reply via email to