Hi,

I'm running a fairly large website (10,000 news items). Initially it
was made in ASP with MSSQL, then I took the project over and ported it
to PHP and MYSQL. Finally, 6 months ago I ported it to Django and
MySQL.

Now, ever since the site has been running on Django, I've received
about a dozen error emails every couple of days (once I even received
400 overnight!). Those errors are systematically caused by web
crawlers (yahoo slurp, googlebot, msn, yeti, etc.). It systematically
chokes on the same line of code, which is loading some data from file
caching. The traceback is pretty much always as follows:

 File "/MYPATH/apps/news/templatetags/news_tags.py", line 13, in
show_sidebar
   cached_sidebar = cache.get('the_sidebar')

 File "/MYPATH/django/core/cache/backends/filebased.py", line 50, in
get
   return pickle.load(f)

UnicodeDecodeError: 'utf8' codec can't decode byte 0x8a in position
5999: unexpected code byte

The actual utf-8 character varies each time.

I spent a lot of time cleaning up, reorganising and improving the
code... in vain. Now I strongly suspect it might be because of the
data being corrupted in some way.

Now, what puzzles me is that all the URLs which fail with web
crawlers, actually work perfectly well when I simply open them in a
browser.

The code has always followed a recent trunk of Django and now runs on
1.0. I have already raised that issue in this mailing list a couple of
times in the past, but I didn't get much help. I haven't opened a
ticket because I cannot reproduce the error myself (it only happens
with web crawlers) and because I suspect it might be because of my
setup (no other site that I have and use file caching have this
problem).

I could also say that the site was originally running with mod_python,
then with mod_wsgi, and has even moved servers. After all these
changes the problem is still there, to it seems to be independent from
server configuration.

So, any hint to debug this would be very much appreciated. If you
think this is worth filing a ticket, I'd also appreciate any hint on
how to phrase this problem correctly.

Thanks a lot for your help.

Julien
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Django users" group.
To post to this group, send email to django-users@googlegroups.com
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/django-users?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to