I recommend using the one on the package server
(pkgs.racket-lang.org). It integrates more smoothly with the rest of
Racket libraries. While it is out of date, there are not significant
changes in the 3.0 version: "Numeric character entities now parse to
Racket strings instead of Racket characters, to bring SXML/xexp back
closer to SXML." I heavily use this package and have not hit sites
that it can't parse well. If necessary for you, we can update the
package, but it'll be slightly different than the 3.0 Planet version
because we'd rather not have version incompatibilities for this
change.

Jay

On Wed, Feb 17, 2016 at 10:04 AM, Brian Adkins <lojicdot...@gmail.com> wrote:
> I'm looking for an html parser that can handle real world web pages that are 
> typically invalid (similar to Ruby's Nokogiri). I came across recommendations 
> for the html-parsing package, so I went to:
>
> https://pkgs.racket-lang.org/
>
> I couldn't find it via the parsing or parser tags, but using my browser's 
> find, found html-parsing. The description states: "A fork of the Planet 1 
> html-parsing library". On the documentation page, I see a history with the 
> last entry being 0.3 2011-08-27 and the following require:
>
> (require sxml/html)
>
> Clicking on the following link at the top of the doc page:
>
> http://www.neilvandyke.org/racket-html-parsing/
>
> takes me to Neil's page with a more recent history: PLaneT 3:0 — 2015-04-24 
> and the following require:
>
> (require (planet neil/html-parsing:3:0))
>
> Which package do I want?
>
> Brian
>
> --
> You received this message because you are subscribed to the Google Groups 
> "Racket Users" group.
> To unsubscribe from this group and stop receiving emails from it, send an 
> email to racket-users+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.



-- 
Jay McCarthy
Associate Professor
PLT @ CS @ UMass Lowell
http://jeapostrophe.github.io

           "Wherefore, be not weary in well-doing,
      for ye are laying the foundation of a great work.
And out of small things proceedeth that which is great."
                          - D&C 64:33

-- 
You received this message because you are subscribed to the Google Groups 
"Racket Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to racket-users+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to