Well, this is not only a perl issue. You should get your hands on a paper with HTML specification (http://www.w3.org/TR/html401/ for version 4.01) and do a parser in any language you want (perl , for example).
Its not simply getting a file, because you could just get a frame! Hope it helps, Duarte -- Duarte Manuel Cordeiro Manager - IT Infraestructure - Security & Communications mailto:[EMAIL PROTECTED] | msn: [EMAIL PROTECTED] | http://www.neoris.com/ -- Neoris Portugal Edificio Inovação IV - Sala 819 - Taguspark * 2780-920 Oeiras * Portugal Tel: +351 21 423-8350 | Fax: +351 21 421-7626 | Mob: +35191 613-5706 -- Privileged/Confidential Information may be contained in this message. If you are not the addressee indicated in this message (or responsible for delivery of the message to such person), you may not copy or deliver this message to anyone. In such case, you should destroy this message and kindly notify the sender by reply email. Please advise immediately if you or your employer does not consent to Internet email for messages of this kind. Opinions, conclusions and other information in this message that do not relate to the official business of my firm shall be understood as neither given nor endorsed by it. -----Original Message----- From: Collins, Joe (EDSI\BDR) [mailto:[EMAIL PROTECTED]] Sent: quarta-feira, 30 de Janeiro de 2002 14:27 To: '[EMAIL PROTECTED]' Subject: How do I read a web page from within perl? For example, suppose I want to capture www.cnn.com into an array and process the text. How does one do this? Many thanks, Joe -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] ____________________________________________________________________________ For your protection, this e-mail message has been scanned for viruses. Visit us at http://www.neoris.com/ -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]