Simon:

A worthy project!

Would the solution you propose serve other search indexers as well, or just
Google?

I was also wondering what the effect of the solution you propose would be
on the cached versions of pages Google stores that are available via Google
search, eg:
http://webcache.googleusercontent.com/search?q=cache:ELwAKCibsysJ:www.sakaiproject.org/+&cd=1&hl=en&ct=clnk&gl=us

= nate

On Thu, Aug 2, 2012 at 11:03 AM, Simon Gaeremynck <[email protected]>wrote:

> Hi all,
>
> I've been working on KERN-3084 [1] which tries to add support for Google's
> AJAX crawler [2].
> When Google notices you're using AJAX/Javascript to display content on
> your page it sends a request to the server asking for a completely rendered
> page. The idea is that we then run the page trough a headless browser and
> sent that response back to Google.
>
> I've created an implementation [3] [4] that does this but I'd like some
> feedback before I send a PR.
> This commit would, much like the preview processor, bring in yet another
> dependency. I'm using PhantomJS as it fires up a headless WebKit browser
> and exposes a nice little nodejs api that you can (ab)use.
> I tried using the same toolset as the previewprocessor (wkhtmltopdf) but
> that just seems to generate PDF's and doesn't allow access to the generated
> DOM?
> (PhantomJS supports PDF creation but it's nowhere near as good as
> wkhtmltopdf though.)
>
>
> What's the feeling about this? Does anyone have a recommendation for a
> better tool/approach?
>
> Regards,
>
> Simon
>
>
>
> [1] https://jira.sakaiproject.org/browse/KERN-3084
> [2]
> https://developers.google.com/webmasters/ajax-crawling/docs/getting-started
> [3]
> https://github.com/simong/nakamura/commit/83212d6fe814ee32be7dd3d9cd771c40dff6f69f
> [4]
> https://confluence.sakaiproject.org/display/KERNDOC/KERN-3084+Making+OAE+indexable+by+Google
> [5] http://phantomjs.org/
>
> _______________________________________________
> oae-dev mailing list
> [email protected]
> http://collab.sakaiproject.org/mailman/listinfo/oae-dev
>
>
_______________________________________________
oae-dev mailing list
[email protected]
http://collab.sakaiproject.org/mailman/listinfo/oae-dev

Reply via email to