Or you can use swing html parser to get to html components
Gary Gregory wrote:
Another approach would be to use a util to turn it into XHTML (XML) and then using Xpath to get to anything.
Gary
-----Original Message-----
From: Brant Hahn [mailto:[EMAIL PROTECTED] Sent: Friday, November 19, 2004 2:30 PM
To: 'Jakarta Commons Users List'
Subject: [HttpClient] Screen Scraping Components?
Hi,
I've been using HttpClient for a few months now. I was wondering if anyone out there using had a recommendation on any 3rd party component for screen scraping? I've seen a few out there, including Jericho, but generally have to write more code than I want to when using it. Just curious if there was something out there that takes-in regex Pattern objects (or just regex pre-compiled strings) to easily get the data that I want off of any page.
Thanks,
Brant
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
.
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
