their tech and company (also attended the session).
Alex
At 12:10 PM 02/02/2001 -0800, Nick Arnett wrote:
Anyone know more about this company or project...?
http://news.bbc.co.uk/hi/english/sci/tech/newsid_1146000/1146589.stm
Nick Arnett
Sr. VP and Co-Founder
Opion Inc.
Direct phone/fax: 408-733
there's the
seed of a good idea in it. However, it seems to me that if the authors of a
page would actually bother to create meta-tags to increase search
efficiency, it would be much easier (semi-automated, even) to create a tag
containing the *most* relevant words, not the least.
Nick Arnett
-Original Message-
From: Sean 'Captain Napalm' Conner [mailto:[EMAIL PROTECTED]]
Sent: Friday, November 23, 2001 11:26 PM
To: [EMAIL PROTECTED]
Subject: Re: [Robots] Re: Correct URL, shlash at the end ?
It was thus said that the Great George Phillips once stated:
Don't be mislead
-Original Message-
From: Sean 'Captain Napalm' Conner [mailto:[EMAIL PROTECTED]]
Sent: Friday, November 23, 2001 11:26 PM
To: [EMAIL PROTECTED]
Subject: Re: [Robots] Re: Correct URL, shlash at the end ?
It was thus said that the Great George Phillips once stated:
Don't be mislead
their tech and company (also attended the session).
Alex
At 12:10 PM 02/02/2001 -0800, Nick Arnett wrote:
Anyone know more about this company or project...?
http://news.bbc.co.uk/hi/english/sci/tech/newsid_1146000/1146589.stm
Nick Arnett
Sr. VP and Co-Founder
Opion Inc.
Direct phone/fax: 408-733-7613
, 31 Oct 2000 15:48:21 -0800
Reply-To: [EMAIL PROTECTED]
Sender: [EMAIL PROTECTED]
From: Nick Arnett [EMAIL PROTECTED]
Subject: Robots, km lists back up
Comments: To: [EMAIL PROTECTED], [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Date: Tue, 31 Oct 2000 23:54:17 -0800
Content-transfer-encoding: 7bit
x
of bounces that show addresses that are
not subscribed to the list... so if you see a few bounces when you post
(most come here, as they should), that may be the reason.
Nick Arnett
Sr. VP and Co-Founder
Opion Inc.
Direct phone: 408-733-7613 Fax: 408-904-7198
http://www.opion.com
From [EMAIL PROTECTED] Fri Nov 10 14: 47:29 2000
Received: by mccmedia.com from localhost
(router,SLMail V2.7); Fri, 10 Nov 2000 14:47:29 -0800
Received: by mccmedia.com from mail2
(209.133.89.19::mail daemon; unverified,SLMail V2.7); Fri, 10 Nov 2000 14:47:26
-0800
Received: from
to the servers with every request.
Hope that's reasonably clear.
Nick
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of Otis Gospodnetic
Sent: Thursday, March 14, 2002 8:57 AM
To: [EMAIL PROTECTED]
Subject: [Robots] Re: SV: matching and UserAgent
was really going
on. Following an LWP request through the debugger is a long and convoluted
journey...
Nick
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of srinivas mohan
Sent: Thursday, March 14, 2002 9:48 AM
To: [EMAIL PROTECTED]
Subject: [Robots
.
Ah, now I see your point. That does seem to be a problem, since apparently
version numbers were contemplated in User-Agent headers... Sounds like
something for the LWP author(s).
Or, a convenient excuse for a badly behaved robot... !
Nick
--
This message was sent by the Internet robots
I've been hitting problems with a Python-based robot I'm working on and just
found out that there's a timeout module that will make it easy to implement
the kind of functionality that Tim Bray was suggesting here earlier. It
apparently works for any TCP connection. Here's the link:
Commands need to be send to [EMAIL PROTECTED].
Send unsubscribe robots in the body of a message to leave this list.
Nick
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of HuiFang Wang
Sent: Tuesday, March 26, 2002 2:30 AM
To: [EMAIL PROTECTED
that would clearly apply. If they want to
have a limit on robots, I sure would appreciate it if they would say what it
is...
It's been about 30 minutes now and I'm still blocked, it seems.
Just checked from another machine -- they still have no robots.txt at all.
Nick
--
[EMAIL PROTECTED]
(408
.
Sure would be nice if they'd just say what they consider acceptable...
Nick
--
[EMAIL PROTECTED]
(408) 904-7198
___
Robots mailing list
[EMAIL PROTECTED]
http://www.mccmedia.com/mailman/listinfo/robots
essentially creating a toolbox with Python and MySQL, which I'm using to
create custom information products for consulting clients. For the moment,
those (obviously) are companies with a strong interest in Java.
Nick
--
Nick Arnett
Phone/fax: (408) 904-7198
[EMAIL PROTECTED
At the risk of talking to myself... Would a gateway from mailing lists to
NNTP address most of the issues I described? NNTP already knows about
threading, updating, etc.
However, I've been stymied by the problem of discovering new NNTP servers.
--
Nick Arnett
Phone/fax: (408) 904-7198
[EMAIL
[EMAIL PROTECTED] wrote:
I've created a robot, www.dead-links.com and i wonder if this list is alive.
It is alive, but very, very quiet.
Nick
___
Robots mailing list
[EMAIL PROTECTED]
http://www.mccmedia.com/mailman/listinfo/robots
need recursion. It's about 400 lines. A lot of it deals with things
like missing messages, zeroing in on desired date ranges, avoiding
downloading huge messages, recovery from failure, etc.
All of these talk to MySQL...
Nick
--
Nick Arnett
Phone/fax: (408) 904-7198
[EMAIL PROTECTED
what they'd consider acceptable.
And yet, their own servers don't seem to have a robots.txt that defines
any limitations. Sure would be nice if *they* would tell *us* what's
acceptable when crawling Yahoo!
Nick
--
Nick Arnett
Director, Business Intelligence Services
LiveWorld Inc.
Phone/fax
modified-by: UptimeBot team
Best regards.
Maks (aka Luft)
--
Nick Arnett
Director, Business Intelligence Services
LiveWorld Inc.
Phone/fax: (408) 551-0427
[EMAIL PROTECTED]
___
Robots mailing list
[EMAIL PROTECTED]
http://www.mccmedia.com/mailman/listinfo
22 matches
Mail list logo