I did get rid of the '&', but still have the problem with  CRLF. The part of 
the XML wich I want to process shouls have the same number of:

<desc> .... </desc>
<label_text> ... </label_text>

I've decided to, at least for now, abandon SAX and use J to extract the data 
which I want.  

-----Original Message-----
From: [email protected] 
[mailto:[email protected]] On Behalf Of Devon McCormick
Sent: Monday, August 11, 2014 13:44
To: J-programming forum
Subject: Re: [Jprogramming] Help for SAX addon

Tom - if you pre-process your file to ensure all "&"s are "&amp;", does SAX 
still get messed up?


On Mon, Aug 11, 2014 at 12:53 PM, bill lam <[email protected]> wrote:

> I guess & should be escaped as &amp; so that your xml data is 
> malformed.
>
> linefeed should be regarded as whitespace, not sure whether it is a 
> bug in sax addon or a feature of sax dll.
>
> Пн, 11 авг 2014, Tom Arneson написал(а):
> > I discovered that an '&' in a tagged field also messes up the SAX paser.
> >
> > -----Original Message-----
> > From: [email protected]
> > [mailto:[email protected]] On Behalf Of Tom
> Arneson
> > Sent: Sunday, August 10, 2014 21:02
> > To: [email protected]
> > Subject: [Jprogramming] Help for SAX addon
> >
> > I have a need to parse XML data. So I'm trying the SAX addon. It is 
> > not
> in
> > the 802 addon, so I got it from the JAL/801 addon.
> >
> > Part of the XML data
> > ======================================
> > <wpt lat="41.08439000" lon="-93.50432000"> 
> > <time>2014-08-10T11:28:59.289Z</time>
> > <name>75PIJU</name>
> > <desc>420 f w:2:05-06:Bahamas
> > </desc>
> > <link 
> > href="http://geovexilla.gpsgames.org/cgi-bin/vx.pl?wp=VX75-PIJU";>
> > <text>VX75-PIJU</text>
> > <type>text/html</type>
> > </link>
> > <sym>City (Medium)</sym>
> > <type>3</type>
> > <extensions>
> > <time
> > xmlns="http://www.topografix.com/GPX/gpx_modified/0/1
> ">2014-08-10T11:52:16.6
> > 80Z</time>
> > <label xmlns="http://www.topografix.com/GPX/gpx_overlay/0/3";>
> > <label_text>75PIJU</label_text>
> > <text xmlns="http://www.topografix.com/GPX/gpx_style/0/2";>
> > <font>
> > <family generic="sans-serif">
> > <face>Arial</face>
> > </family>
> > </font>
> > </text>
> > </label>
> > </extensions>
> > </wpt>
> > ========================================
> >
> > Note that there is a CRLF at the end of the string between the  
> > <desc> </desc> tags
> >
> > When this is parsed I get:
> > ______________________
> > │420 f w:2:05-06:Bahamas│ │
> >                                                        ^boxed LF the 
> > CRLF causes <10{a to be appended to the result list.
> >
> > NB. My script to parse (modified from sax_test4.ijs) NB.
> > =========================================================
> > NB. return a boxed list of the field with tag TAG NB.
> > =========================================================
> >
> >
> > require 'xml/sax'
> >
> > saxclass 'psaxtag'
> >
> > TAG=: 'desc'
> >
> > startDocument=: 3 : 'PATH=: R=: $0'
> > endDocument=:   3 : 'R'
> > startElement=:  4 : 'PATH=: PATH,<y'
> > endElement=:    3 : 'PATH=: }:PATH'
> > characters=:    3 : 'if. TAG -: >@{: PATH do. R=: R,<y end.'
> >
> >
> > NB. =========================================================
> > cocurrent 'base'
> >
> > desc=: process_psaxtag_ z
> >
> > TAG_psaxtag_ =: 'label_text'
> >
> > name=: process_psaxtag_ z
> >
> > --------------------------------------------------------------------
> > -- For information about J forums see 
> > http://www.jsoftware.com/forums.htm
> >
> > --------------------------------------------------------------------
> > -- For information about J forums see 
> > http://www.jsoftware.com/forums.htm
>
> --
> regards,
> ====================================================
> GPG key 1024D/4434BAB3 2008-08-24
> gpg --keyserver subkeys.pgp.net --recv-keys 4434BAB3 gpg --keyserver 
> subkeys.pgp.net --armor --export 4434BAB3
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
>



--
Devon McCormick, CFA
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to