Or you can use swing html parser to get to html components


Gary Gregory wrote:

Another approach would be to use a util to turn it into XHTML (XML) and
then using Xpath to get to anything.

Gary

-----Original Message-----
From: Brant Hahn [mailto:[EMAIL PROTECTED] Sent: Friday, November 19, 2004 2:30 PM
To: 'Jakarta Commons Users List'
Subject: [HttpClient] Screen Scraping Components?


Hi,



I've been using HttpClient for a few months now.  I was wondering if
anyone
out there using had a recommendation on any 3rd party component for
screen
scraping?  I've seen a few out there, including Jericho, but generally
have
to write more code than I want to when using it.  Just curious if there
was
something out there that takes-in regex Pattern objects (or just regex
pre-compiled strings) to easily get the data that I want off of any
page.



Thanks,

Brant


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



.





---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to