Sorry, User unknown.
Warning, delivery failure! This is a status message indicating that a message could
not be delivered to 1 or more recipients.
Original message subject: [Robots] Re: Security News Robot
Date received: 10-Jul-2001 07:15:34 +0200
Recipients and delivery history
[EMAIL PROTECTED]
---- Transcript of session follows ---
10-Jul-2001 07:15:34 +0200 Received via SMTP from WWW.MCCMEDIA.COM
10-Jul-2001 07:15:57 +0200 [EMAIL PROTECTED] is unknown
Reporting-MTA: dns;192.168.36.23
Final-Recipient: rfc822;[EMAIL PROTECTED]
Action: failed
Status: 5.0.0 (permanent failure)
Received: from WWW.MCCMEDIA.COM by [192.168.36.23]
with SMTP (QuickMail Pro Server for Mac 2.0.1); 10-Jul-2001 07:15:33 +0200
Received: from www.mccmedia.com (IDENT:listar@localhost [127.0.0.1])
by www.mccmedia.com (8.11.0/8.8.7) with ESMTP id f6A4mSA31383;
Mon, 9 Jul 2001 21:48:28 -0700
Received: with LISTAR (v1.0.0; list robots); Mon, 09 Jul 2001 21:48:27 -0700 (PDT)
Received: from mmb1.vsnl.net.in (bom9.vsnl.net.in [202.54.1.72])
by www.mccmedia.com (8.11.0/8.8.7) with ESMTP id f6A4H5A31097
for <[EMAIL PROTECTED]>; Mon, 9 Jul 2001 21:17:05 -0700
Received: from anish.flashmail.com (unknown [203.199.168.172])
by mmb1.vsnl.net.in (Postfix) with ESMTP id 985EE4D8D
for <[EMAIL PROTECTED]>; Tue, 10 Jul 2001 09:49:28 +0530 (IST)
Message-Id: <[EMAIL PROTECTED]>
X-Sender: [EMAIL PROTECTED] (Unverified)
X-Mailer: QUALCOMM Windows Eudora Version 5.1
Date: Tue, 10 Jul 2001 09:49:31 +0530
To: [EMAIL PROTECTED]
From: Anish Nair <[EMAIL PROTECTED]>
Subject: [Robots] Re: Security News Robot
In-Reply-To: <A1F9D8DB3D9ED311ABA3009027D3BCCF288D01@UTOPY1>
Mime-Version: 1.0
Content-type: text/plain
Content-Transfer-Encoding: 8bit
X-Approved-By: [EMAIL PROTECTED]
X-listar-version: Listar v1.0.0
Sender: [EMAIL PROTECTED]
Errors-to: [EMAIL PROTECTED]
X-original-sender: [EMAIL PROTECTED]
Precedence: bulk
Reply-to: [EMAIL PROTECTED]
X-list: robots
Hi,
I'm a student who's working on something similar to what you were. here's my
report. please comment on its feasibility...
Anish
-----------
At 03:45 AM 7/10/01, you wrote:
try moreover.com, they have service that supporting news titles and links in
XML format.
try in http://w.moreover.com/dev/index.html[1]
If you have any more questions about crawling into news sites, you can send
me the questions directly, I worked on product that collecting tech news
articles from the Web in the last year.
-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED][2]]
Sent: Monday, July 09, 2001 2:34 PM
To: [EMAIL PROTECTED]
Subject: [Robots] Security News Robot
Hi,
I am researching what would be the best way to get a robot (crawler) that
would search internet news sites with news that would be relevant for my
website. This would be anything about Security/Hacking/Privacy. I have a
list of sites I'd like to crawl as well as keywords that could be used.
I'd also like to be able to parse out the data from each article on my end
such as title, author, text, date of publication, URL, etc. One issue that
I am aware of is that not every site likes people using bots on them. Are
there ways around this? Is anyone aware if any tech news sites offer
XML\file format export of their articles? For large sites where I get lots
of articles, this would work well. My goal is to instead of manually going
out to sites to find articles about security I want the articles and
relevant info to come to me.
Any help, or a point in the right direction would be much appreciated.
Christopher Braswell
****************************************************************************
***
Note: The information contained in this message may be privileged
and confidential and protected from disclosure. If the reader of this
message is not the intended recipient, or an employee or agent responsible
for delivering this message to the intended recipient, you are hereby
notified that any dissemination, distribution or copying of this
communication is strictly prohibited. If you have received this
communication in error, please notify us immediately by replying to the
message and deleting it from your computer. Thank you. Ernst &Young LLP
****************************************************************************
***
--
This message was sent by the Internet robots and spiders discussion list
([EMAIL PROTECTED]). For list server commands, send "help" in the body of
a message to "[EMAIL PROTECTED]".
--
This message was sent by the Internet robots and spiders discussion list
([EMAIL PROTECTED]). For list server commands, send "help" in the body of
a message to "[EMAIL PROTECTED]".
-----------------------------
Anish R Nair
www.anishnair.com[3]
[EMAIL PROTECTED]
-----------------------------
---------------------------------------------
Your future depends on your dreams
So go to sleep !
---------------------------------------------
--- Links ---
1 http://w.moreover.com/dev/index.html
2 mailto:[EMAIL PROTECTED]
3 http://www.anishnair.com/
-- Binary/unsupported file stripped by Listar --
-- Type: application/msword
-- File: word_final.doc
--
This message was sent by the Internet robots and spiders discussion list
([EMAIL PROTECTED]). For list server commands, send "help" in the body of a message
to "[EMAIL PROTECTED]".