RE: splits in character data using SAX

Vincent Finn Wed, 25 Feb 2004 02:38:12 -0800

OK, 
That's the way BJ suggested and the way I thought I'd have to do it.
Oh well, not very elegant but it'll have to do


        Thanks, Vin

> -----Original Message-----
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
> Sent: 24 February 2004 16:34
> To: [EMAIL PROTECTED]
> Subject: RE: splits in character data using SAX
> 
> 
> 
> 
> 
> 
> You save all of the text in your own buffer, until you get an 
> event that's
> not characters().
> 
> I've never used progressive parsing, and there's no real standard, so
> you'll have to experiment with it to see how it works.
> 
> Dave
> 
> 
> 
>                                                               
>                                                               
>                           
>                       "Vincent Finn"                          
>                                                               
>                           
>                       <[EMAIL PROTECTED]         To:      
> <[EMAIL PROTECTED]>                                 
>                                
>                       omsoft.com>               cc:      
> (bcc: David N Bertoni/Cambridge/IBM)                          
>                                
>                                                 Subject: RE: 
> splits in character data using SAX                            
>                            
>                       02/23/2004 10:03                        
>                                                               
>                           
>                       AM                                      
>                                                               
>                           
>                       Please respond to                       
>                                                               
>                           
>                       xerces-c-dev                            
>                                                               
>                           
>                                                               
>                                                               
>                           
> 
> 
> 
> so what is the best strategy for getting a complete text?
> 
> Do I have to do multiple calls to parseNext whenever I hit a text node
> until I hit an endElement?
> 
> 
> > -----Original Message-----
> > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
> > Sent: 23 February 2004 17:50
> > To: [EMAIL PROTECTED]
> > Subject: Re: splits in character data using SAX
> >
> >
> >
> >
> >
> >
> > Hi,
> >
> > The parser will also make multiple calls to the characters
> > function when it
> > encounters an entity reference, so your plan is likely
> > unworkable.  Also,
> > it's quite possible the buffer size could change in the 
> future, or the
> > parser's behavior could change for some other reason.
> >
> > Dave
> >
> >
> >
> >
> >
> >
> >                       "Vincent Finn"
> >
> >
> >                       <[EMAIL PROTECTED]         To:
> > <[EMAIL PROTECTED]>
> >
> >                       omsoft.com>               cc:
> > (bcc: David N Bertoni/Cambridge/IBM)
> >
> >                                                 Subject:
> > splits in character data using SAX
> >
> >                       02/23/2004 09:29
> >
> >
> >                       AM
> >
> >
> >                       Please respond to
> >
> >
> >                       xerces-c-dev
> >
> >
> >
> >
> >
> >
> >
> >
> > Hi,
> >
> > I have a question about the way DefaultHandler::characters()
> > can send to
> > chunks for a single text element
> >
> > I found an answer of sorts in the mail archive but I am
> > hoping for a bit
> > more certainty
> >
> > The question is when will the characters function split data?
> > Are there specific rules?
> >
> > What I am hoping for is to be able to make sure this will 
> never happen
> > so I can ignore it rathger than having to write extra code.
> >
> > The archive link is this
> > http://marc.theaimsgroup.com/?l=xerces-c-dev&m=100076161808793&w=2
> >
> > essentially that there may be a split if the buffer is full
> > in which case can anyone tell me the buffer size?
> >
> > then and the second reply mentions running into child nodes
> > which has me
> > slightly confused
> > it mentions that a split may happen if a child node is 
> found, but text
> > nodes don't have child nodes
> > have I missed the point of the function?
> > I was working with it as a method of getting the value of text nodes
> >
> >              Vin

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

RE: splits in character data using SAX

Reply via email to