RE: SEO and servlets

2002-12-07 Thread neal
Those are both very good points I hadn't considered - thank you!

:)

Neal

-Original Message-
From: Craig R. McClanahan [mailto:[EMAIL PROTECTED]]
Sent: Friday, December 06, 2002 6:35 PM
To: Tomcat Users List
Subject: RE: SEO and servlets




On Fri, 6 Dec 2002, Cox, Charlie wrote:

 Date: Fri, 6 Dec 2002 08:13:20 -0500
 From: Cox, Charlie [EMAIL PROTECTED]
 Reply-To: Tomcat Users List [EMAIL PROTECTED]
 To: 'Tomcat Users List' [EMAIL PROTECTED]
 Subject: RE: SEO  and servlets

 I do a silimar thing where all my content is in a database as XML, so that
 the users can update their content when *they* need to(not when I get
around
 to it).  Then the html is created using xsl when requested.

 The way that I set this up is that I map a directory to each servlet (or
 sometimes multiple directories to a servlet for different functionality
for
 another set of pages) and then use real page names. These appear to be
 static html pages. This allows the pagename to be a prarm for the servlet.


A popular technique for supporting this sort of thing is to use path
mapping with the wild card pattern.  Consider a servlet that is mapped to
/display/*, and a URL like this:

  http://www.mycompany.com/myapp/display/foo.html

The /foo.html part of that URL shows up in your servlet as the extra
path info, and can be used to go look up the database information and
perform the translation.

Your application understands that you're executing a servlet that is
creating (potentially) dynamic content.  Yet, to users and search engines,
this appears to be a URL of a static HTML page in the display
subdirectory.

Is it live or is it Memorex? was a marketing slogan for a brand of
audiotapes a few years ago.  Only your app server knows for sure ... :-)

 I also do not have many forms(nor do I use struts,etc) so that is not a
 problem for me.


This kind of dynamic content (where it changes occasionally but not often)
is reasonable to index with a search engine.  One thing you'll want to
make sure you do is include a Date header (containing some
representation of when the underlying database was last updated) in the
response -- that way, browsers can also cache the rendered page and only
update when the underlying data changes.

You'll also want to implement the getLastModified() method in your
servlet for this to work.

 Charlie


Craig



--
To unsubscribe, e-mail:
mailto:[EMAIL PROTECTED]
For additional commands, e-mail:
mailto:[EMAIL PROTECTED]


--
To unsubscribe, e-mail:   mailto:[EMAIL PROTECTED]
For additional commands, e-mail: mailto:[EMAIL PROTECTED]




RE: SEO and servlets

2002-12-06 Thread neal
Craig,

Thanks for the reponse.  I know what you're saying that you wouldn't want to
submit most web app pages, presuming that most of it is form data ... but in
this case I have a bit of content that was placed into dynamic pages because
(a) There is a common naivgation framework that is essentially included, (b)
in the case of the main page there is some 'weekly specials' data that gets
inserted into the content.  The framework I developed presumes XsLT for the
template rather than jsp, which means that I must use a servlet (in this
case with my own extension), and thus the concern.

So that's why I was wondering. I knew about the other issues such s have
params beyond the question mark (they won't be picked up)... again another
reason to use a servlet wherein *.abc would invoke the abc serlet and *
could introspectively obtained as a param .. I thought that might be a way
around it ... but I'm beginning to think perhaps not.  :(

Anyway, thanks for your thoughts.


-Original Message-
From: Craig R. McClanahan [mailto:[EMAIL PROTECTED]]
Sent: Thursday, December 05, 2002 6:55 PM
To: Tomcat Users List
Subject: Re: SEO and servlets




On Wed, 4 Dec 2002, neal wrote:

 Date: Wed, 4 Dec 2002 22:06:53 -0800
 From: neal [EMAIL PROTECTED]
 Reply-To: Tomcat Users List [EMAIL PROTECTED]
 To: Tomcat Users List [EMAIL PROTECTED]
 Subject: SEO  and servlets

 I was reading on Google the other day that it supports many of the common
 file types including JSP ... but this led me to wonder if they index
 servlets without file extensions, or how about common frameworks such as
 Struts with the DO extensions.

 Does anyone know how these file types index with Google and other major
 engines?


Different search engines follow different policies -- you'd have to ask
them how their spiders are programmed.  Most of them, probably, omit at
least some of the following types of URLs:
* URLs protected by an authentication constraint
* URLs matching patterns listed in the deny list of your
  robots.txt file
* URLs that have query parameters in them
* URLs that have no-cache headers in the returned content

Personally, I think it's somewhere between misleading and silly to index
pages from a web *applications* (as opposed to web sites)  based on an MVC
framework (like Struts).  Why?  Because the URL that a search engine
spider would submit doesn't necessarily have *anything* to do with the
output that gets rendered.

Consider a very common case where you have a form submit that goes to a
URL ending in .../saveCustomer.do in a Struts app.  If you've made any
errors that get caught by the validation rules, the original input form is
redisplayed.  On the other hand, if you did everything correctly, the next
page in your app's user interaction is displayed (probably a menu or
something).  But the URL is the same!

Which kind of output should a search engine index?

Web applications != Web sites

 Thanks.
 Neal


Craig



--
To unsubscribe, e-mail:
mailto:[EMAIL PROTECTED]
For additional commands, e-mail:
mailto:[EMAIL PROTECTED]


--
To unsubscribe, e-mail:   mailto:[EMAIL PROTECTED]
For additional commands, e-mail: mailto:[EMAIL PROTECTED]




RE: SEO and servlets

2002-12-06 Thread Cox, Charlie
I do a silimar thing where all my content is in a database as XML, so that
the users can update their content when *they* need to(not when I get around
to it).  Then the html is created using xsl when requested.

The way that I set this up is that I map a directory to each servlet (or
sometimes multiple directories to a servlet for different functionality for
another set of pages) and then use real page names. These appear to be
static html pages. This allows the pagename to be a prarm for the servlet. 

I also do not have many forms(nor do I use struts,etc) so that is not a
problem for me.

Charlie

 -Original Message-
 From: neal [mailto:[EMAIL PROTECTED]]
 Sent: Friday, December 06, 2002 3:49 AM
 To: Tomcat Users List
 Subject: RE: SEO and servlets
 
 
 Craig,
 
 Thanks for the reponse.  I know what you're saying that you 
 wouldn't want to
 submit most web app pages, presuming that most of it is form 
 data ... but in
 this case I have a bit of content that was placed into 
 dynamic pages because
 (a) There is a common naivgation framework that is 
 essentially included, (b)
 in the case of the main page there is some 'weekly specials' 
 data that gets
 inserted into the content.  The framework I developed 
 presumes XsLT for the
 template rather than jsp, which means that I must use a 
 servlet (in this
 case with my own extension), and thus the concern.
 
 So that's why I was wondering. I knew about the other issues 
 such s have
 params beyond the question mark (they won't be picked up)... 
 again another
 reason to use a servlet wherein *.abc would invoke the abc 
 serlet and *
 could introspectively obtained as a param .. I thought that 
 might be a way
 around it ... but I'm beginning to think perhaps not.  :(
 
 Anyway, thanks for your thoughts.
 
 
 -Original Message-
 From: Craig R. McClanahan [mailto:[EMAIL PROTECTED]]
 Sent: Thursday, December 05, 2002 6:55 PM
 To: Tomcat Users List
 Subject: Re: SEO and servlets
 
 
 
 
 On Wed, 4 Dec 2002, neal wrote:
 
  Date: Wed, 4 Dec 2002 22:06:53 -0800
  From: neal [EMAIL PROTECTED]
  Reply-To: Tomcat Users List [EMAIL PROTECTED]
  To: Tomcat Users List [EMAIL PROTECTED]
  Subject: SEO  and servlets
 
  I was reading on Google the other day that it supports 
 many of the common
  file types including JSP ... but this led me to wonder if 
 they index
  servlets without file extensions, or how about common 
 frameworks such as
  Struts with the DO extensions.
 
  Does anyone know how these file types index with Google and 
 other major
  engines?
 
 
 Different search engines follow different policies -- you'd 
 have to ask
 them how their spiders are programmed.  Most of them, 
 probably, omit at
 least some of the following types of URLs:
 * URLs protected by an authentication constraint
 * URLs matching patterns listed in the deny list of your
   robots.txt file
 * URLs that have query parameters in them
 * URLs that have no-cache headers in the returned content
 
 Personally, I think it's somewhere between misleading and 
 silly to index
 pages from a web *applications* (as opposed to web sites)  
 based on an MVC
 framework (like Struts).  Why?  Because the URL that a search engine
 spider would submit doesn't necessarily have *anything* to do with the
 output that gets rendered.
 
 Consider a very common case where you have a form submit that 
 goes to a
 URL ending in .../saveCustomer.do in a Struts app.  If 
 you've made any
 errors that get caught by the validation rules, the original 
 input form is
 redisplayed.  On the other hand, if you did everything 
 correctly, the next
 page in your app's user interaction is displayed (probably a menu or
 something).  But the URL is the same!
 
 Which kind of output should a search engine index?
 
 Web applications != Web sites
 
  Thanks.
  Neal
 
 
 Craig
 
 
 
 --
 To unsubscribe, e-mail:
 mailto:[EMAIL PROTECTED]
 For additional commands, e-mail:
 mailto:[EMAIL PROTECTED]
 
 
 --
 To unsubscribe, e-mail:   
mailto:[EMAIL PROTECTED]
For additional commands, e-mail:
mailto:[EMAIL PROTECTED]

--
To unsubscribe, e-mail:   mailto:[EMAIL PROTECTED]
For additional commands, e-mail: mailto:[EMAIL PROTECTED]




RE: SEO and servlets

2002-12-06 Thread Craig R. McClanahan


On Fri, 6 Dec 2002, Cox, Charlie wrote:

 Date: Fri, 6 Dec 2002 08:13:20 -0500
 From: Cox, Charlie [EMAIL PROTECTED]
 Reply-To: Tomcat Users List [EMAIL PROTECTED]
 To: 'Tomcat Users List' [EMAIL PROTECTED]
 Subject: RE: SEO  and servlets

 I do a silimar thing where all my content is in a database as XML, so that
 the users can update their content when *they* need to(not when I get around
 to it).  Then the html is created using xsl when requested.

 The way that I set this up is that I map a directory to each servlet (or
 sometimes multiple directories to a servlet for different functionality for
 another set of pages) and then use real page names. These appear to be
 static html pages. This allows the pagename to be a prarm for the servlet.


A popular technique for supporting this sort of thing is to use path
mapping with the wild card pattern.  Consider a servlet that is mapped to
/display/*, and a URL like this:

  http://www.mycompany.com/myapp/display/foo.html

The /foo.html part of that URL shows up in your servlet as the extra
path info, and can be used to go look up the database information and
perform the translation.

Your application understands that you're executing a servlet that is
creating (potentially) dynamic content.  Yet, to users and search engines,
this appears to be a URL of a static HTML page in the display
subdirectory.

Is it live or is it Memorex? was a marketing slogan for a brand of
audiotapes a few years ago.  Only your app server knows for sure ... :-)

 I also do not have many forms(nor do I use struts,etc) so that is not a
 problem for me.


This kind of dynamic content (where it changes occasionally but not often)
is reasonable to index with a search engine.  One thing you'll want to
make sure you do is include a Date header (containing some
representation of when the underlying database was last updated) in the
response -- that way, browsers can also cache the rendered page and only
update when the underlying data changes.

You'll also want to implement the getLastModified() method in your
servlet for this to work.

 Charlie


Craig



--
To unsubscribe, e-mail:   mailto:[EMAIL PROTECTED]
For additional commands, e-mail: mailto:[EMAIL PROTECTED]




Re: SEO and servlets

2002-12-05 Thread Craig R. McClanahan


On Wed, 4 Dec 2002, neal wrote:

 Date: Wed, 4 Dec 2002 22:06:53 -0800
 From: neal [EMAIL PROTECTED]
 Reply-To: Tomcat Users List [EMAIL PROTECTED]
 To: Tomcat Users List [EMAIL PROTECTED]
 Subject: SEO  and servlets

 I was reading on Google the other day that it supports many of the common
 file types including JSP ... but this led me to wonder if they index
 servlets without file extensions, or how about common frameworks such as
 Struts with the DO extensions.

 Does anyone know how these file types index with Google and other major
 engines?


Different search engines follow different policies -- you'd have to ask
them how their spiders are programmed.  Most of them, probably, omit at
least some of the following types of URLs:
* URLs protected by an authentication constraint
* URLs matching patterns listed in the deny list of your
  robots.txt file
* URLs that have query parameters in them
* URLs that have no-cache headers in the returned content

Personally, I think it's somewhere between misleading and silly to index
pages from a web *applications* (as opposed to web sites)  based on an MVC
framework (like Struts).  Why?  Because the URL that a search engine
spider would submit doesn't necessarily have *anything* to do with the
output that gets rendered.

Consider a very common case where you have a form submit that goes to a
URL ending in .../saveCustomer.do in a Struts app.  If you've made any
errors that get caught by the validation rules, the original input form is
redisplayed.  On the other hand, if you did everything correctly, the next
page in your app's user interaction is displayed (probably a menu or
something).  But the URL is the same!

Which kind of output should a search engine index?

Web applications != Web sites

 Thanks.
 Neal


Craig



--
To unsubscribe, e-mail:   mailto:[EMAIL PROTECTED]
For additional commands, e-mail: mailto:[EMAIL PROTECTED]




SEO and servlets

2002-12-04 Thread neal
I was reading on Google the other day that it supports many of the common
file types including JSP ... but this led me to wonder if they index
servlets without file extensions, or how about common frameworks such as
Struts with the DO extensions.

Does anyone know how these file types index with Google and other major
engines?

Thanks.
Neal


--
To unsubscribe, e-mail:   mailto:[EMAIL PROTECTED]
For additional commands, e-mail: mailto:[EMAIL PROTECTED]