Re: [PHP] Googlebot (not another rtfm-mail)

2004-01-29 Thread Chris Hayes
At 17:58 28-1-04, you wrote:
On Wed, Jan 28, 2004 at 11:51:06 -0500, Michael Mulligan wrote:
 Googlebot visits my site occasionally and follows a lot of my PHP links
 with long query strings...

same here, google has been indexing my php site for over 3 years now :)
I find the google site is NOT very helpful for PHP guidelines, considering 
my experiences.

I used the error_document 404 redirect trick to use short links and it took 
some time to realise that Google boycotted that, probably because it gets a 
404.

Also links with a long parameter string of (5 parameters, 30 characters) 
were ignored.  As far as I could see Google only indexed 1 index.php page. 
So I had to overhaul the whole link logic of my cms. I now use mod_rewrite too.

But I have seen sites with only an one parameter (?ID=11) being indexed.

Be aware that javascript links and form buttons are unlikely to be indexed.

Chris

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


RE: [PHP] Googlebot (not another rtfm-mail)

2004-01-29 Thread Williams, Olwen - SAL
Doing  that work just fine but you have to send the right heading.

My sites that do that have 
header(HTTP/1.0 200 OK); 
as the first instruction and are very well indexed by Google

Olwen Williams
[EMAIL PROTECTED]
-- 





-Original Message-
From: Chris Hayes [mailto:[EMAIL PROTECTED]
Sent: Friday, January 30, 2004 1:04 AM
To: [EMAIL PROTECTED]
Subject: Re: [PHP] Googlebot (not another rtfm-mail)


At 17:58 28-1-04, you wrote:
On Wed, Jan 28, 2004 at 11:51:06 -0500, Michael Mulligan wrote:
  Googlebot visits my site occasionally and follows a lot of my PHP links
  with long query strings...

same here, google has been indexing my php site for over 3 years now :)

I find the google site is NOT very helpful for PHP guidelines, considering 
my experiences.

I used the error_document 404 redirect trick to use short links and it took 
some time to realise that Google boycotted that, probably because it gets a 
404.

Also links with a long parameter string of (5 parameters, 30 characters) 
were ignored.  As far as I could see Google only indexed 1 index.php page. 
So I had to overhaul the whole link logic of my cms. I now use mod_rewrite
too.

But I have seen sites with only an one parameter (?ID=11) being indexed.

Be aware that javascript links and form buttons are unlikely to be indexed.

Chris

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


CAUTION - This message may contain privileged and confidential 
information intended only for the use of the addressee named above.
If you are not the intended recipient of this message you are hereby 
notified that any use, dissemination, distribution or reproduction 
of this message is prohibited. If you have received this message in 
error please notify Safe Air Ltd immediately. Any views expressed 
in this message are those of the individual sender and may not 
necessarily reflect the views of Safe Air.
_
For more information on the Safe Air Group, visit us online
at http://www.safeair.co.nz/ 
_

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php



Re: [PHP] Googlebot

2004-01-28 Thread Jon Bennett
from google http://www.google.com/webmasters/facts.html

Fiction:
Sites are not included in Google's index if they use ASP (or some other  
non-html file-type.)

Fact:
At Google, we are able to index most types of pages and files with very  
few exceptions. File types we are able to index include: pdf, asp, jsp,  
hdml, shtml, xml, cfm, doc, xls, ppt, rtf, wks, lwp, wri.

I notice php is not mentioned in this list, surely it can index php  
generated pages ??

Thanks,

Jon

jon bennett  |  [EMAIL PROTECTED]
new media creative
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
J   b   e   n   .   n   e   t

91 Gloucester Rd,  Trowbridge,  Wilts,  BA14 0AD
t: +44 (0) 1225 341039 w: http://www.jben.net/
iChat (AIM): jbendotnet
On 27 Jan 2004, at 14:53, Ben Ramsey wrote:

Perhaps this link will help:
http://www.google.com/webmasters/
Frank Tudor wrote:

That's not very helpful.
I am also curious about this.
Anyone have a more thought provoking answer?
Frank
--- Raditha Dissanayake [EMAIL PROTECTED] wrote:
If you read the pages on google you will know the answer.

Hartley, Matt wrote:


Is Googlebot (or any other bot) able to follow links that are
php?

e.g.
a href=Contact.phpContact Us/a
a href=Product.phpimg src=Images/product.jpg
width=110 height=111

border=0/a

Is there a way to invite bots to your site?

Thanks
Matt



--  
Raditha Dissanayake.

-- 
--
http://www.radinks.com/sftp/ |
http://www.raditha.com/megaupload
Lean and mean Secure FTP applet with | Mega Upload - PHP file
uploader
Graphical User Inteface. Just 150 KB | with progress bar.
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php
__
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-28 Thread Jon Bennett
is that it tends to avoid or downgrade URI's with parameters tacked 
onto
the end.
Does that go for internal links in your site then ???

news.php?start=10 etc ??? Could be tricky writing dyamic pages then.

Thanks,

Jon

jon bennett  |  [EMAIL PROTECTED]
new media creative
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
J   b   e   n   .   n   e   t

91 Gloucester Rd,  Trowbridge,  Wilts,  BA14 0AD
t: +44 (0) 1225 341039 w: http://www.jben.net/
iChat (AIM): jbendotnet
On 28 Jan 2004, at 01:05, Mike Migurski wrote:

Is Googlebot (or any other bot) able to follow links that are php?
As others have already pointed out to you, what better place to find
information about google than google?
That, or your server logs - googlebot is great about identifying 
itself.

RTFM's aside, a little understanding of how HTTP request-response loop
goes a long way. There's no reason why google (or any other bot) 
shouldn't
be capable of making a GET request for a *.php resource. My 
understanding
is that it tends to avoid or downgrade URI's with parameters tacked 
onto
the end.

-
michal migurski- contact info and pgp key:
sf/cahttp://mike.teczno.com/contact.html
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-28 Thread Stuart
Jon Bennett wrote:
is that it tends to avoid or downgrade URI's with parameters tacked onto
the end.
Does that go for internal links in your site then ???

news.php?start=10 etc ??? Could be tricky writing dyamic pages then.
Some spiders will not visit URLs with query strings, some will. Those 
that will generally put a limit on the number of links they will visit 
of that nature on a particular site to prevent it from following 
possibly infinite combinations. As stated in 
http://www.google.com/webmasters/2.html#A1...

We are able to index dynamically generated pages. However, because our 
web crawler can easily overwhelm and crash sites serving dynamic 
content, we limit the amount of dynamic pages we index.

Nice the way they say it's to help you not them :).

As far as your tricky comment goes, dynamic content does not have to 
look like dynamic content. Check the archives for info on how to get rid 
of query strings from your site.

--
Stuart
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-28 Thread Brian V Bonini
On Wed, 2004-01-28 at 08:34, Jon Bennett wrote:
  is that it tends to avoid or downgrade URI's with parameters tacked 
  onto
  the end.
 
 Does that go for internal links in your site then ???
 
 news.php?start=10 etc ??? Could be tricky writing dyamic pages then.
 

mod_rewrite is your friend here. You can rewrite stuff like /foo/ /bar/
etc... to *.php?foo=barbar=foo.  Kind of a pain in the arse but if you
MUST have a spider following links then...


-- 
BrianGnuPG - KeyID: 0x04A4F0DC | URL: www.gfx-design.com/keys
  Key Server: pgp.mit.edu
==
gpg --keyserver pgp.mit.edu --recv-keys 04A4F0DC
GnuPG: http://gnupg.org
http://www.biglumber.com/x/web?qs=0x2C35011004A4F0DC
Linux Registered User #339825 at http://counter.li.org


signature.asc
Description: This is a digitally signed message part


Re: [PHP] Googlebot

2004-01-28 Thread Michael Mulligan
Googlebot visits my site occasionally and follows a lot of my PHP links 
with long query strings...

-Mike

__
Hi! I'm a .signature virus! Copy me into your ~/.signature to help me 
spread!
__

On Jan 28, 2004, at 8:34 AM, Jon Bennett wrote:

Does that go for internal links in your site then ???

news.php?start=10 etc ??? Could be tricky writing dyamic pages then.

Thanks,

Jon
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-28 Thread Mat Harris
On Wed, Jan 28, 2004 at 11:51:06 -0500, Michael Mulligan wrote:
 Googlebot visits my site occasionally and follows a lot of my PHP links 
 with long query strings...

same here, google has been indexing my php site for over 3 years now :)

-- 
A Pope has a Water Cannon.   It is a Water Cannon.
He fires Holy-Water from it.It is a Holy-Water Cannon.
He Blesses it. It is a Holy Holy-Water Cannon.
He Blesses the Hell out of it.  It is a Wholly Holy Holy-Water Cannon.
He has it pierced.It is a Holey Wholly Holy Holy-Water Cannon.
He makes it official.   It is a Canon Holey Wholly Holy Holy-Water Cannon.

Yes, of course it's the right cabl [le0: NO CARRIER]


pgp0.pgp
Description: PGP signature


[PHP] Googlebot

2004-01-27 Thread Hartley, Matt
Is Googlebot (or any other bot) able to follow links that are php?

e.g.
a href=Contact.phpContact Us/a
a href=Product.phpimg src=Images/product.jpg width=110 height=111
border=0/a

Is there a way to invite bots to your site?

Thanks
Matt


Re: [PHP] Googlebot

2004-01-27 Thread Stuart
Hartley, Matt wrote:
Is Googlebot (or any other bot) able to follow links that are php?

e.g.
a href=Contact.phpContact Us/a
a href=Product.phpimg src=Images/product.jpg width=110 height=111
border=0/a
Yes.

Is there a way to invite bots to your site?
For most, yes.

--
Stuart
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-27 Thread Raditha Dissanayake
If you read the pages on google you will know the answer.

Hartley, Matt wrote:

Is Googlebot (or any other bot) able to follow links that are php?

e.g.
a href=Contact.phpContact Us/a
a href=Product.phpimg src=Images/product.jpg width=110 height=111
border=0/a
Is there a way to invite bots to your site?

Thanks
Matt
 



--
Raditha Dissanayake.

http://www.radinks.com/sftp/ | http://www.raditha.com/megaupload
Lean and mean Secure FTP applet with | Mega Upload - PHP file uploader
Graphical User Inteface. Just 150 KB | with progress bar.
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-27 Thread Frank Tudor
That's not very helpful.

I am also curious about this.

Anyone have a more thought provoking answer?

Frank

--- Raditha Dissanayake [EMAIL PROTECTED] wrote:
 If you read the pages on google you will know the answer.
 
 Hartley, Matt wrote:
 
 Is Googlebot (or any other bot) able to follow links that are
 php?
 
 e.g.
 a href=Contact.phpContact Us/a
 a href=Product.phpimg src=Images/product.jpg
 width=110 height=111
 border=0/a
 
 Is there a way to invite bots to your site?
 
 Thanks
 Matt
 
   
 
 
 
 -- 
 Raditha Dissanayake.


 http://www.radinks.com/sftp/ |
 http://www.raditha.com/megaupload
 Lean and mean Secure FTP applet with | Mega Upload - PHP file
 uploader
 Graphical User Inteface. Just 150 KB | with progress bar.
 
 -- 
 PHP General Mailing List (http://www.php.net/)
 To unsubscribe, visit: http://www.php.net/unsub.php
 


__
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php



Re: [PHP] Googlebot

2004-01-27 Thread Ben Ramsey
Perhaps this link will help:
http://www.google.com/webmasters/
Frank Tudor wrote:

That's not very helpful.

I am also curious about this.

Anyone have a more thought provoking answer?

Frank

--- Raditha Dissanayake [EMAIL PROTECTED] wrote:

If you read the pages on google you will know the answer.

Hartley, Matt wrote:


Is Googlebot (or any other bot) able to follow links that are
php?

e.g.
a href=Contact.phpContact Us/a
a href=Product.phpimg src=Images/product.jpg
width=110 height=111

border=0/a

Is there a way to invite bots to your site?

Thanks
Matt




--
Raditha Dissanayake.


http://www.radinks.com/sftp/ |
http://www.raditha.com/megaupload
Lean and mean Secure FTP applet with | Mega Upload - PHP file
uploader
Graphical User Inteface. Just 150 KB | with progress bar.
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


__
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-27 Thread Stuart
Frank Tudor wrote:
Anyone have a more thought provoking answer?
Do some minimal research (Google is good for this) and you would find 
the answers yourself.

Hartley, Matt wrote:
Is Googlebot (or any other bot) able to follow links that are
php?
Yes. Most spiders will crawl over most types of URL. As Raditha said, 
Google has several pages of information on this for it's crawling 
activities, and most other legit spiders have similar resources.

Is there a way to invite bots to your site?
Google has a page where you can submit your site for crawling. Some 
other spiders also have this, but it depends on the purpose of the 
spider and the philosophy it follows.

There are many, many, many, many, many, ..., many, many resources on 
search engine optimisation all over the web. Use them. Find them with... 
you guessed it... Google!

--
Stuart
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-27 Thread Raditha Dissanayake
As others have already pointed out to you, what better place to find 
information about google than google?

Frank Tudor wrote:

That's not very helpful.

I am also curious about this.

Anyone have a more thought provoking answer?

Frank

--- Raditha Dissanayake [EMAIL PROTECTED] wrote:
 

If you read the pages on google you will know the answer.

Hartley, Matt wrote:

   

Is Googlebot (or any other bot) able to follow links that are
 

php?
   

e.g.
a href=Contact.phpContact Us/a
a href=Product.phpimg src=Images/product.jpg
 

width=110 height=111
   

border=0/a

Is there a way to invite bots to your site?

Thanks
Matt


 

--
Raditha Dissanayake.
   


 

http://www.radinks.com/sftp/ |
http://www.raditha.com/megaupload
Lean and mean Secure FTP applet with | Mega Upload - PHP file
uploader
Graphical User Inteface. Just 150 KB | with progress bar.
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php
   



__
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
 



--
Raditha Dissanayake.

http://www.radinks.com/sftp/ | http://www.raditha.com/megaupload
Lean and mean Secure FTP applet with | Mega Upload - PHP file uploader
Graphical User Inteface. Just 150 KB | with progress bar.
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] Googlebot

2004-01-27 Thread Mike Migurski
Is Googlebot (or any other bot) able to follow links that are php?

As others have already pointed out to you, what better place to find
information about google than google?

That, or your server logs - googlebot is great about identifying itself.

RTFM's aside, a little understanding of how HTTP request-response loop
goes a long way. There's no reason why google (or any other bot) shouldn't
be capable of making a GET request for a *.php resource. My understanding
is that it tends to avoid or downgrade URI's with parameters tacked onto
the end.

-
michal migurski- contact info and pgp key:
sf/cahttp://mike.teczno.com/contact.html

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php