But if you do really want/need to analyse html, then have a look at the htmlparser project on sourceforge. There are other html parsers e.g. Tidy.
S. On 10/08/06, Ortwin Glück <[EMAIL PROTECTED]> wrote:
Errol, No, there is no such functionality in the API. HttpClient is a transport library and does not look at the content. Whether its binary or HTML, HttpClient doesn't care. I also believe that "HTML screen scraping" is not the way to code interfaces between machines. Webservices were invented for this purpose. Cheers Ortwin Errol Dalgic wrote: > Hi there, > > I've written several bots to extract the form elements and values from HTML > pages utilising HttpClient and the java regex pattern/matcher api. I was > wondering whether there were any generic methods within the HTTPClient API > which would allow me to perform this task ie. retrieve all the name and > value pairs from the response? I am sure this would be useful and happy to > share what I have written so far. > > Thanks, > Errol > > > Errol Dalgic > Programmer Analyst > ________________________________________ > eSolutions - Consumer Operations Group > SingTel Optus Pty Ltd > [EMAIL PROTECTED] > > > > The information contained in this e-mail message and any accompanying files > is or may be confidential. If you are not the intended recipient, any use, > dissemination, reliance, forwarding, printing or copying of this e-mail or > any attached files is unauthorised. This e-mail is subject to copyright. No > part of it should be reproduced, adapted or communicated without the written > consent of the copyright owner. If you have received this e-mail in error, > please advise the sender immediately by return e-mail, or telephone and > delete all copies. Optus does not guarantee the accuracy or completeness of > any information contained in this e-mail or attached files. Internet > communications are not secure, therefore Optus does not accept legal > responsibility for the contents of this message or attached files. > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > -- [web] http://www.odi.ch/ [blog] http://www.odi.ch/weblog/ [pgp] key 0x81CF3416 finger print F2B1 B21F F056 D53E 5D79 A5AF 02BE 70F5 81CF 3416 --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]