Re: [PHP] How to stop site flipping?

2002-05-14 Thread Miguel Cruz

On Tue, 14 May 2002, Kirk Babb wrote:
> I have a listing of department alumni which calls up their email address
> when one of the form buttons (letters of the alphabet) is clicked.  The
> emails are not hard-coded; the form sends the $letter to a php script which
> handles all the usual stuff (the query, getting the array for the query,
> printing/formatting it).  Is this type of thing accessible to spiders?  They
> can't actually perform actions on the pages can they? 

Many crawlers follow GET arguments, so if you use happen to be using links
rather than  tags, then they'll probably find 'em.

miguel


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php




Re: [PHP] How to stop site flipping?

2002-05-14 Thread Kirk Babb

I have a listing of department alumni which calls up their email address
when one of the form buttons (letters of the alphabet) is clicked.  The
emails are not hard-coded; the form sends the $letter to a php script which
handles all the usual stuff (the query, getting the array for the query,
printing/formatting it).  Is this type of thing accessible to spiders?  They
can't actually perform actions on the pages can they?  Sorry if this is a
stupid question, I've been reading along with this discussion.  From reading
what's been said so far, I'm guessing the only email they can get to is the
webmaster address on the front page of the site

Thanks!

Kirk


"Justin French" <[EMAIL PROTECTED]> wrote in message
[EMAIL PROTECTED]">news:[EMAIL PROTECTED]...
> This might help:
>
> I have a php file in my doc root called email.php.  Instead of linking an
> email address with mailto:[EMAIL PROTECTED]";>, I link it like this:
>
> Justin French
>
> (Actually I do it with a function, but for clarity sake, I'll leave the
> above in place).
>
> email.php isn't a HTML page, it's a small script which sends a mailto:
> header to the browser:
>
> 
> if(!isset($_GET['to']))
> {
> $_GET['to'] = "info";
> }
>
> if(!isset($domain))
> {
> $_GET['domain'] = "mydomain.com.au";
> }
>
> $to = $_GET['to'];
> $domain = $_GET['domain'];
>
> $email_address = $to."@".$domain;
>
> // send email header to page
> header ("Location: mailto:$email_address";);
>
> ?>
>
> As you can see, there are defaults in there, so if they link to
> email.php?to=justin, the default domain will be used, and vice-versa for a
> missing "to", or both missing.
>
> I tested it in a few browsers, and the original page was not refreshed...
an
> email window popped up in outlook express, just as it would with a regular
> mailto: link, and everything works fine.
>
>
> This prevents spiders hunting for mailto:something in your pages.  Other
> spiders might look for anything that looks like an email address, so I'd
> recommend not having regular email address' anywhere:
>
> Justin French
>
> instead of
>
> [EMAIL PROTECTED]
>
>
> I've seen some simular stuff where the users put in something like a space
> around the @ to make it harder for the spiders to find the text too, but I
> dunno if this is that reliable:
>
> justin @
> indent.com.au
>
>
> Of course the other option is to strip every email address from your site,
> and put in a "contact us" form.
>
>
> Justin French
>
>
>
>
>
> on 14/05/02 12:07 PM, David McInnis ([EMAIL PROTECTED]) wrote:
>
> > I have been running a newswire service since 97 and recently noticed an
> > increase in the number of people flipping our site to harvest email
> > addresses contained in the news releases posted on our site. (prweb.com)
> >
> > I am running apache and php on a linux box.  Can anyone suggest
> > something that I can implement that would block users who are harvesting
> > data from our site?  I do not mind legit users from using this data, but
> > the flippers are chewing up my bandwidth and db resources.
> >
> > I also want to be careful to not block valid search engine spiders from
> > indexing our site.
> >
> > David McInnis
> >
>



-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php




RE: [PHP] How to stop site flipping?

2002-05-13 Thread Cal Evans

sweet.  Very nice solution.

*
* Cal Evans
* Journeyman Programmer
* Techno-Mage
* http://www.calevans.com
*


-Original Message-
From: Justin French [mailto:[EMAIL PROTECTED]]
Sent: Monday, May 13, 2002 10:18 PM
To: David McInnis; [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Subject: Re: [PHP] How to stop site flipping?


This might help:

I have a php file in my doc root called email.php.  Instead of linking an
email address with mailto:[EMAIL PROTECTED]";>, I link it like this:

Justin French

(Actually I do it with a function, but for clarity sake, I'll leave the
above in place).

email.php isn't a HTML page, it's a small script which sends a mailto:
header to the browser:

mailto:$email_address";);

?>

As you can see, there are defaults in there, so if they link to
email.php?to=justin, the default domain will be used, and vice-versa for a
missing "to", or both missing.

I tested it in a few browsers, and the original page was not refreshed... an
email window popped up in outlook express, just as it would with a regular
mailto: link, and everything works fine.


This prevents spiders hunting for mailto:something in your pages.  Other
spiders might look for anything that looks like an email address, so I'd
recommend not having regular email address' anywhere:

Justin French

instead of

[EMAIL PROTECTED]


I've seen some simular stuff where the users put in something like a space
around the @ to make it harder for the spiders to find the text too, but I
dunno if this is that reliable:

justin @
indent.com.au


Of course the other option is to strip every email address from your site,
and put in a "contact us" form.


Justin French





on 14/05/02 12:07 PM, David McInnis ([EMAIL PROTECTED]) wrote:

> I have been running a newswire service since 97 and recently noticed an
> increase in the number of people flipping our site to harvest email
> addresses contained in the news releases posted on our site. (prweb.com)
>
> I am running apache and php on a linux box.  Can anyone suggest
> something that I can implement that would block users who are harvesting
> data from our site?  I do not mind legit users from using this data, but
> the flippers are chewing up my bandwidth and db resources.
>
> I also want to be careful to not block valid search engine spiders from
> indexing our site.
>
> David McInnis
>


--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php



-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php




Re: [PHP] How to stop site flipping?

2002-05-13 Thread Justin French

This might help:

I have a php file in my doc root called email.php.  Instead of linking an
email address with mailto:[EMAIL PROTECTED]";>, I link it like this:

Justin French

(Actually I do it with a function, but for clarity sake, I'll leave the
above in place).

email.php isn't a HTML page, it's a small script which sends a mailto:
header to the browser:

mailto:$email_address";);

?>

As you can see, there are defaults in there, so if they link to
email.php?to=justin, the default domain will be used, and vice-versa for a
missing "to", or both missing.

I tested it in a few browsers, and the original page was not refreshed... an
email window popped up in outlook express, just as it would with a regular
mailto: link, and everything works fine.


This prevents spiders hunting for mailto:something in your pages.  Other
spiders might look for anything that looks like an email address, so I'd
recommend not having regular email address' anywhere:

Justin French

instead of

[EMAIL PROTECTED]


I've seen some simular stuff where the users put in something like a space
around the @ to make it harder for the spiders to find the text too, but I
dunno if this is that reliable:

justin @
indent.com.au


Of course the other option is to strip every email address from your site,
and put in a "contact us" form.


Justin French





on 14/05/02 12:07 PM, David McInnis ([EMAIL PROTECTED]) wrote:

> I have been running a newswire service since 97 and recently noticed an
> increase in the number of people flipping our site to harvest email
> addresses contained in the news releases posted on our site. (prweb.com)
> 
> I am running apache and php on a linux box.  Can anyone suggest
> something that I can implement that would block users who are harvesting
> data from our site?  I do not mind legit users from using this data, but
> the flippers are chewing up my bandwidth and db resources.
> 
> I also want to be careful to not block valid search engine spiders from
> indexing our site.
> 
> David McInnis
> 


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php




Re: [PHP] How to stop site flipping?

2002-05-13 Thread John S. Huggins

If they reveal themselves in the AGENT info of each web request, you could
act on that accordingly in your PHP code by looking at the REQUEST_AGENT
variable.  I use this tactic to prevent web sucking programs from access
my site all at once.

On Mon, 13 May 2002, David McInnis wrote:

>-I have been running a newswire service since 97 and recently noticed an
>-increase in the number of people flipping our site to harvest email
>-addresses contained in the news releases posted on our site. (prweb.com)
>-
>-I am running apache and php on a linux box.  Can anyone suggest
>-something that I can implement that would block users who are harvesting
>-data from our site?  I do not mind legit users from using this data, but
>-the flippers are chewing up my bandwidth and db resources.  
>-
>-I also want to be careful to not block valid search engine spiders from
>-indexing our site.
>-
>-David McInnis
>-
>-
>--- 
>-PHP General Mailing List (http://www.php.net/)
>-To unsubscribe, visit: http://www.php.net/unsub.php
>-

**

John Huggins
VANet

[EMAIL PROTECTED]
http://www.va.net/

**


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php




Re: [PHP] How to stop site flipping?

2002-05-13 Thread Jason Morehouse

A good start is: http://www.neilgunton.com/spambot_trap/

Though it has some specific details for mod_perl, much of the information 
can be tied to php.

On Mon, 13 May 2002, David McInnis wrote:

> I have been running a newswire service since 97 and recently noticed an
> increase in the number of people flipping our site to harvest email
> addresses contained in the news releases posted on our site. (prweb.com)
> 
> I am running apache and php on a linux box.  Can anyone suggest
> something that I can implement that would block users who are harvesting
> data from our site?  I do not mind legit users from using this data, but
> the flippers are chewing up my bandwidth and db resources.  
> 
> I also want to be careful to not block valid search engine spiders from
> indexing our site.
> 
> David McInnis
> 
> 
> 

-- 

.--- .- ... --- -. /  -- --- .-. .  --- ..- ... .

 Jason Morehouse
 [EMAIL PROTECTED]
 
 Netconcepts LTD
 Auckland, New Zealand

 - Linux: free your mind and your OS will follow


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php




[PHP] How to stop site flipping?

2002-05-13 Thread David McInnis

I have been running a newswire service since 97 and recently noticed an
increase in the number of people flipping our site to harvest email
addresses contained in the news releases posted on our site. (prweb.com)

I am running apache and php on a linux box.  Can anyone suggest
something that I can implement that would block users who are harvesting
data from our site?  I do not mind legit users from using this data, but
the flippers are chewing up my bandwidth and db resources.  

I also want to be careful to not block valid search engine spiders from
indexing our site.

David McInnis


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php