Another approach would be to use a util to turn it into XHTML (XML) and
then using Xpath to get to anything.

Gary

-----Original Message-----
From: Brant Hahn [mailto:[EMAIL PROTECTED] 
Sent: Friday, November 19, 2004 2:30 PM
To: 'Jakarta Commons Users List'
Subject: [HttpClient] Screen Scraping Components?

Hi, 

 

I've been using HttpClient for a few months now.  I was wondering if
anyone
out there using had a recommendation on any 3rd party component for
screen
scraping?  I've seen a few out there, including Jericho, but generally
have
to write more code than I want to when using it.  Just curious if there
was
something out there that takes-in regex Pattern objects (or just regex
pre-compiled strings) to easily get the data that I want off of any
page.

 

Thanks,

Brant


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to