Re: [PHP] How to stop site flipping?
On Tue, 14 May 2002, Kirk Babb wrote: > I have a listing of department alumni which calls up their email address > when one of the form buttons (letters of the alphabet) is clicked. The > emails are not hard-coded; the form sends the $letter to a php script which > handles all the usual stuff (the query, getting the array for the query, > printing/formatting it). Is this type of thing accessible to spiders? They > can't actually perform actions on the pages can they? Many crawlers follow GET arguments, so if you use happen to be using links rather than tags, then they'll probably find 'em. miguel -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
Re: [PHP] How to stop site flipping?
I have a listing of department alumni which calls up their email address when one of the form buttons (letters of the alphabet) is clicked. The emails are not hard-coded; the form sends the $letter to a php script which handles all the usual stuff (the query, getting the array for the query, printing/formatting it). Is this type of thing accessible to spiders? They can't actually perform actions on the pages can they? Sorry if this is a stupid question, I've been reading along with this discussion. From reading what's been said so far, I'm guessing the only email they can get to is the webmaster address on the front page of the site Thanks! Kirk "Justin French" <[EMAIL PROTECTED]> wrote in message [EMAIL PROTECTED]">news:[EMAIL PROTECTED]... > This might help: > > I have a php file in my doc root called email.php. Instead of linking an > email address with mailto:[EMAIL PROTECTED]";>, I link it like this: > > Justin French > > (Actually I do it with a function, but for clarity sake, I'll leave the > above in place). > > email.php isn't a HTML page, it's a small script which sends a mailto: > header to the browser: > > > if(!isset($_GET['to'])) > { > $_GET['to'] = "info"; > } > > if(!isset($domain)) > { > $_GET['domain'] = "mydomain.com.au"; > } > > $to = $_GET['to']; > $domain = $_GET['domain']; > > $email_address = $to."@".$domain; > > // send email header to page > header ("Location: mailto:$email_address";); > > ?> > > As you can see, there are defaults in there, so if they link to > email.php?to=justin, the default domain will be used, and vice-versa for a > missing "to", or both missing. > > I tested it in a few browsers, and the original page was not refreshed... an > email window popped up in outlook express, just as it would with a regular > mailto: link, and everything works fine. > > > This prevents spiders hunting for mailto:something in your pages. Other > spiders might look for anything that looks like an email address, so I'd > recommend not having regular email address' anywhere: > > Justin French > > instead of > > [EMAIL PROTECTED] > > > I've seen some simular stuff where the users put in something like a space > around the @ to make it harder for the spiders to find the text too, but I > dunno if this is that reliable: > > justin @ > indent.com.au > > > Of course the other option is to strip every email address from your site, > and put in a "contact us" form. > > > Justin French > > > > > > on 14/05/02 12:07 PM, David McInnis ([EMAIL PROTECTED]) wrote: > > > I have been running a newswire service since 97 and recently noticed an > > increase in the number of people flipping our site to harvest email > > addresses contained in the news releases posted on our site. (prweb.com) > > > > I am running apache and php on a linux box. Can anyone suggest > > something that I can implement that would block users who are harvesting > > data from our site? I do not mind legit users from using this data, but > > the flippers are chewing up my bandwidth and db resources. > > > > I also want to be careful to not block valid search engine spiders from > > indexing our site. > > > > David McInnis > > > -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
RE: [PHP] How to stop site flipping?
sweet. Very nice solution. * * Cal Evans * Journeyman Programmer * Techno-Mage * http://www.calevans.com * -Original Message- From: Justin French [mailto:[EMAIL PROTECTED]] Sent: Monday, May 13, 2002 10:18 PM To: David McInnis; [EMAIL PROTECTED] Cc: [EMAIL PROTECTED]; [EMAIL PROTECTED] Subject: Re: [PHP] How to stop site flipping? This might help: I have a php file in my doc root called email.php. Instead of linking an email address with mailto:[EMAIL PROTECTED]";>, I link it like this: Justin French (Actually I do it with a function, but for clarity sake, I'll leave the above in place). email.php isn't a HTML page, it's a small script which sends a mailto: header to the browser: mailto:$email_address";); ?> As you can see, there are defaults in there, so if they link to email.php?to=justin, the default domain will be used, and vice-versa for a missing "to", or both missing. I tested it in a few browsers, and the original page was not refreshed... an email window popped up in outlook express, just as it would with a regular mailto: link, and everything works fine. This prevents spiders hunting for mailto:something in your pages. Other spiders might look for anything that looks like an email address, so I'd recommend not having regular email address' anywhere: Justin French instead of [EMAIL PROTECTED] I've seen some simular stuff where the users put in something like a space around the @ to make it harder for the spiders to find the text too, but I dunno if this is that reliable: justin @ indent.com.au Of course the other option is to strip every email address from your site, and put in a "contact us" form. Justin French on 14/05/02 12:07 PM, David McInnis ([EMAIL PROTECTED]) wrote: > I have been running a newswire service since 97 and recently noticed an > increase in the number of people flipping our site to harvest email > addresses contained in the news releases posted on our site. (prweb.com) > > I am running apache and php on a linux box. Can anyone suggest > something that I can implement that would block users who are harvesting > data from our site? I do not mind legit users from using this data, but > the flippers are chewing up my bandwidth and db resources. > > I also want to be careful to not block valid search engine spiders from > indexing our site. > > David McInnis > -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
Re: [PHP] How to stop site flipping?
This might help: I have a php file in my doc root called email.php. Instead of linking an email address with mailto:[EMAIL PROTECTED]";>, I link it like this: Justin French (Actually I do it with a function, but for clarity sake, I'll leave the above in place). email.php isn't a HTML page, it's a small script which sends a mailto: header to the browser: mailto:$email_address";); ?> As you can see, there are defaults in there, so if they link to email.php?to=justin, the default domain will be used, and vice-versa for a missing "to", or both missing. I tested it in a few browsers, and the original page was not refreshed... an email window popped up in outlook express, just as it would with a regular mailto: link, and everything works fine. This prevents spiders hunting for mailto:something in your pages. Other spiders might look for anything that looks like an email address, so I'd recommend not having regular email address' anywhere: Justin French instead of [EMAIL PROTECTED] I've seen some simular stuff where the users put in something like a space around the @ to make it harder for the spiders to find the text too, but I dunno if this is that reliable: justin @ indent.com.au Of course the other option is to strip every email address from your site, and put in a "contact us" form. Justin French on 14/05/02 12:07 PM, David McInnis ([EMAIL PROTECTED]) wrote: > I have been running a newswire service since 97 and recently noticed an > increase in the number of people flipping our site to harvest email > addresses contained in the news releases posted on our site. (prweb.com) > > I am running apache and php on a linux box. Can anyone suggest > something that I can implement that would block users who are harvesting > data from our site? I do not mind legit users from using this data, but > the flippers are chewing up my bandwidth and db resources. > > I also want to be careful to not block valid search engine spiders from > indexing our site. > > David McInnis > -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
Re: [PHP] How to stop site flipping?
If they reveal themselves in the AGENT info of each web request, you could act on that accordingly in your PHP code by looking at the REQUEST_AGENT variable. I use this tactic to prevent web sucking programs from access my site all at once. On Mon, 13 May 2002, David McInnis wrote: >-I have been running a newswire service since 97 and recently noticed an >-increase in the number of people flipping our site to harvest email >-addresses contained in the news releases posted on our site. (prweb.com) >- >-I am running apache and php on a linux box. Can anyone suggest >-something that I can implement that would block users who are harvesting >-data from our site? I do not mind legit users from using this data, but >-the flippers are chewing up my bandwidth and db resources. >- >-I also want to be careful to not block valid search engine spiders from >-indexing our site. >- >-David McInnis >- >- >--- >-PHP General Mailing List (http://www.php.net/) >-To unsubscribe, visit: http://www.php.net/unsub.php >- ** John Huggins VANet [EMAIL PROTECTED] http://www.va.net/ ** -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
Re: [PHP] How to stop site flipping?
A good start is: http://www.neilgunton.com/spambot_trap/ Though it has some specific details for mod_perl, much of the information can be tied to php. On Mon, 13 May 2002, David McInnis wrote: > I have been running a newswire service since 97 and recently noticed an > increase in the number of people flipping our site to harvest email > addresses contained in the news releases posted on our site. (prweb.com) > > I am running apache and php on a linux box. Can anyone suggest > something that I can implement that would block users who are harvesting > data from our site? I do not mind legit users from using this data, but > the flippers are chewing up my bandwidth and db resources. > > I also want to be careful to not block valid search engine spiders from > indexing our site. > > David McInnis > > > -- .--- .- ... --- -. / -- --- .-. . --- ..- ... . Jason Morehouse [EMAIL PROTECTED] Netconcepts LTD Auckland, New Zealand - Linux: free your mind and your OS will follow -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
[PHP] How to stop site flipping?
I have been running a newswire service since 97 and recently noticed an increase in the number of people flipping our site to harvest email addresses contained in the news releases posted on our site. (prweb.com) I am running apache and php on a linux box. Can anyone suggest something that I can implement that would block users who are harvesting data from our site? I do not mind legit users from using this data, but the flippers are chewing up my bandwidth and db resources. I also want to be careful to not block valid search engine spiders from indexing our site. David McInnis -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php