Please advise concerning my approach for enabling googlebot to index
my GWT site: http://www.SudokuComplete.com/

First, I needed to create a static, flattened, html-only view of my
website.  To do this, I created a simple .Net program that uses the IE
WebBrowser control to navigate to my GWT site.  The program takes a
URL with a history token, let's the browser run through the GWT
javascript, and then saves a copy of the resulting DOM html to a file
with a name based on the history token.  This file is essentially an
html-only view of the site as-of the history token.  I use the program
to save static versions of each of the major history tokens on the
site.  These files are the ones I want googlebot to index.  And to
provide navigation for googlebot, I add links between all the files at
the end of each of them.

Next, I created my "UserBotRouter", a .Net HttpModule (analagous to a
J2EE Web Filter) that analizes the incoming requests to the website.
It checks the UserAgent header to see if the request is being made by
a bot or a standard browser.  If it is a bot, then it routes the
request to the appropriate static html page (created in step #1).  If,
however, a standard (non-bot) browser requests one of the static
pages, then my module sends an HTTP redirect to send the user's
browser to the corresponding GWT page including the respective history
token.  In this manner, I am able to route users to the GWT pages, and
bots to the static HTML pages.

Am I missing anything?  Does this sound like a workable approach?

Can GWT build something like this into their compiler?  That is, if
GWT compiles different versions for the different browsers, why not
create a set of standard "bot" pages.  In the module XML file the
developer could specify the tokens for which GWT should create html
pages.  GWT would use an approach similar to mine above to create
static html files for the tokens.  Then, in these static html files,
some javascript could redirect the browser to the corresponding GWT
url.  This javascript redirect would affect actual user's browsers,
whereas bots would continue to read the page as-is, following links to
the other static pages.

Please let me know what you think,

john...


--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Google Web Toolkit" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/Google-Web-Toolkit?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to