perhaps you could also pre-process the xml to replace all CRLF with space. IIRC Raul Miller said sax addon did not work on 64-bit windows, ymmv. On Aug 12, 2014 4:43 AM, "Tom Arneson" <[email protected]> wrote:
> I did get rid of the '&', but still have the problem with CRLF. The part > of the XML wich I want to process shouls have the same number of: > > <desc> .... </desc> > <label_text> ... </label_text> > > I've decided to, at least for now, abandon SAX and use J to extract the > data which I want. > > -----Original Message----- > From: [email protected] [mailto: > [email protected]] On Behalf Of Devon McCormick > Sent: Monday, August 11, 2014 13:44 > To: J-programming forum > Subject: Re: [Jprogramming] Help for SAX addon > > Tom - if you pre-process your file to ensure all "&"s are "&", does > SAX still get messed up? > > > On Mon, Aug 11, 2014 at 12:53 PM, bill lam <[email protected]> wrote: > > > I guess & should be escaped as & so that your xml data is > > malformed. > > > > linefeed should be regarded as whitespace, not sure whether it is a > > bug in sax addon or a feature of sax dll. > > > > Пн, 11 авг 2014, Tom Arneson написал(а): > > > I discovered that an '&' in a tagged field also messes up the SAX > paser. > > > > > > -----Original Message----- > > > From: [email protected] > > > [mailto:[email protected]] On Behalf Of Tom > > Arneson > > > Sent: Sunday, August 10, 2014 21:02 > > > To: [email protected] > > > Subject: [Jprogramming] Help for SAX addon > > > > > > I have a need to parse XML data. So I'm trying the SAX addon. It is > > > not > > in > > > the 802 addon, so I got it from the JAL/801 addon. > > > > > > Part of the XML data > > > ====================================== > > > <wpt lat="41.08439000" lon="-93.50432000"> > > > <time>2014-08-10T11:28:59.289Z</time> > > > <name>75PIJU</name> > > > <desc>420 f w:2:05-06:Bahamas > > > </desc> > > > <link > > > href="http://geovexilla.gpsgames.org/cgi-bin/vx.pl?wp=VX75-PIJU"> > > > <text>VX75-PIJU</text> > > > <type>text/html</type> > > > </link> > > > <sym>City (Medium)</sym> > > > <type>3</type> > > > <extensions> > > > <time > > > xmlns="http://www.topografix.com/GPX/gpx_modified/0/1 > > ">2014-08-10T11:52:16.6 > > > 80Z</time> > > > <label xmlns="http://www.topografix.com/GPX/gpx_overlay/0/3"> > > > <label_text>75PIJU</label_text> > > > <text xmlns="http://www.topografix.com/GPX/gpx_style/0/2"> > > > <font> > > > <family generic="sans-serif"> > > > <face>Arial</face> > > > </family> > > > </font> > > > </text> > > > </label> > > > </extensions> > > > </wpt> > > > ======================================== > > > > > > Note that there is a CRLF at the end of the string between the > > > <desc> </desc> tags > > > > > > When this is parsed I get: > > > ______________________ > > > │420 f w:2:05-06:Bahamas│ │ > > > ^boxed LF the > > > CRLF causes <10{a to be appended to the result list. > > > > > > NB. My script to parse (modified from sax_test4.ijs) NB. > > > ========================================================= > > > NB. return a boxed list of the field with tag TAG NB. > > > ========================================================= > > > > > > > > > require 'xml/sax' > > > > > > saxclass 'psaxtag' > > > > > > TAG=: 'desc' > > > > > > startDocument=: 3 : 'PATH=: R=: $0' > > > endDocument=: 3 : 'R' > > > startElement=: 4 : 'PATH=: PATH,<y' > > > endElement=: 3 : 'PATH=: }:PATH' > > > characters=: 3 : 'if. TAG -: >@{: PATH do. R=: R,<y end.' > > > > > > > > > NB. ========================================================= > > > cocurrent 'base' > > > > > > desc=: process_psaxtag_ z > > > > > > TAG_psaxtag_ =: 'label_text' > > > > > > name=: process_psaxtag_ z > > > > > > -------------------------------------------------------------------- > > > -- For information about J forums see > > > http://www.jsoftware.com/forums.htm > > > > > > -------------------------------------------------------------------- > > > -- For information about J forums see > > > http://www.jsoftware.com/forums.htm > > > > -- > > regards, > > ==================================================== > > GPG key 1024D/4434BAB3 2008-08-24 > > gpg --keyserver subkeys.pgp.net --recv-keys 4434BAB3 gpg --keyserver > > subkeys.pgp.net --armor --export 4434BAB3 > > ---------------------------------------------------------------------- > > For information about J forums see http://www.jsoftware.com/forums.htm > > > > > > -- > Devon McCormick, CFA > ---------------------------------------------------------------------- > For information about J forums see http://www.jsoftware.com/forums.htm > > ---------------------------------------------------------------------- > For information about J forums see http://www.jsoftware.com/forums.htm ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm
