Dear Readers,

CPS_RSS-1.0.0
CPS-3.4.6
feedparser.py  3.2 (default) & 4.1 (latest)

Refreshing the following Japanese feed in the RSS Tool:

http://blogs.dion.ne.jp/sanskrit/index.rdf

Results in:

UnicodeEncodeError: 'latin-1' codec can't encode characters ...

[see the full error log attached as cps-latin-error.log]

Subsequently the RSS Tool becomes completely unusable. (The only way I
could manage to get the RSS Tool running again was to reinstall the
whole CPS site from backup.)

The problematic feed should render as follows (using SPIP):

Indica et Buddhica - Tabulae :: Kataoka, Kei
http://tabulae.indica-et-buddhica.org/rubrique.php3?id_rubrique=261

I'm not sure if this issue with Japanese characters is related to the
incorrect rendering of Latin diacritics with the following feed -- many
commonly used in Romanised Sanskrit transliteration, e.g., a, u and i
macron, S acute, n under-dot &c.:

http://www.informaworld.com/ampp/rss~content=t713405669

Incorrect (using CPS RSS):

Indica et Buddhica - Recently Published issues of Asian Philosophy
http://indica-et-buddhica.org/sections/tabulae/periodica/a/asian-philosophy/asp-recently-published

Correct (using SPIP):

Indica et Buddhica - Tabulae :: Asian Philosophy - Recently Published
http://tabulae.indica-et-buddhica.org/rubrique.php3?id_rubrique=238


I'd be very happy to receive any thoughts on how these issues might be
resolved.


Kind regards,

 Richard MAHONEY



-- 
Richard MAHONEY | internet: http://indica-et-buddhica.org/
Littledene      | telephone/telefax (man.): +64 3 312 1699
Bay Road        | cellular: +64 27 482 9986
OXFORD, NZ      | email: [EMAIL PROTECTED]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Indica et Buddhica: Materials for Indology and Buddhology
Scholia: http://scholia.indica-et-buddhica.org/
Tabulae: http://tabulae.indica-et-buddhica.org/
*************************

Display traceback as text

Traceback (innermost last):
  Module ZPublisher.Publish, line 118, in publish
  Module ZServer.HTTPResponse, line 262, in setBody
  Module ZPublisher.HTTPResponse, line 313, in setBody
  Module ZPublisher.HTTPResponse, line 454, in _encode_unicode
UnicodeEncodeError: 'latin-1' codec can't encode characters in position 24927-24933: ordinal not in range(256)

*************************

Exception traceback

Time 	2008/04/20 22:00:10.795 GMT
User Name (User Id) 	rmahoney (rmahoney)
Request URL 	https://indica-et-buddhica.org/portal_rss/manage_main
Exception Type 	UnicodeEncodeError
Exception Value 	'latin-1' codec can't encode characters in position 24927-24933: ordinal not in range(256)

Traceback (innermost last):

    * Module ZPublisher.Publish, line 118, in publish
    * Module ZServer.HTTPResponse, line 262, in setBody
    * Module ZPublisher.HTTPResponse, line 313, in setBody
    * Module ZPublisher.HTTPResponse, line 454, in _encode_unicode

UnicodeEncodeError: 'latin-1' codec can't encode characters in position 24927-24933: ordinal not in range(256)

Display traceback as text

REQUEST
form
-C	''
cookies
__utmz	'119353809.1208722629.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none)'
tree-s	'eJzT0MgpMOQKVneEArcKV1t1rgIjrsSSAmMuPQB7tAe3'
__ac_name	'rmahoney'
cpsskins_view_mode	'eyJ0aGVtZXNfcGFuZWwiOiAid3lzaXd5ZyIsICJ0aGVtZSI6ICJkZWZhdWx0IiwgInBvcnRsZXRzX292ZXJyaWRlIjogIjEiLCAicG9ydGxldHNfcGFuZWwiOiAic2l0ZV9zdHJ1Y3R1cmUiLCAiY3VycmVudF91cmwiOiAiaHR0cHM6Ly9pbmRpY2EtZXQtYnVkZGhpY2Eub3JnL2xvZ2luX2Zvcm0ifQ=='
__utma	'119353809.1305615612.1208722629.1208722629.1208722629.1'
__utmb	'119353809.271.10.1208722629273'
__utmc	'119353809'
_ZopeId	'98336331A3VAwZI4jVU'
lazy items
SESSION	<bound method SessionDataManager.getSessionData of <SessionDataManager at /session_data_manager>>
other
VIRTUAL_URL_PARTS	('https://indica-et-buddhica.org', 'portal_rss/manage_main')
n_	9
VIRTUAL_URL	'https://indica-et-buddhica.org/portal_rss/manage_main'
management_page_charset	'iso-8859-1'
URL2	'https://indica-et-buddhica.org'
AcceptCharset	<Products.Localizer.Accept.AcceptCharset instance at 0xe83c66c>
AUTHENTICATION_PATH	'cps/virtual_hosting'
skey	'id'
AUTHENTICATED_USER	<User 'rmahoney'>
USER_PREF_LANGUAGES	<Products.Localizer.Accept.AcceptLanguage instance at 0xc77508c>
SERVER_URL	'https://indica-et-buddhica.org'
traverse_subpath	[]
ACTUAL_URL	'https://indica-et-buddhica.org/portal_rss/manage_main'
a_	0
URL	'https://indica-et-buddhica.org/portal_rss/manage_main'
rkey	''
PUBLISHED	<App.special_dtml.DTMLFile object at 0x8472a0c>
TraversalRequestNameStack	[]
VirtualRootPhysicalPath	('', 'cps')
BASE1	'https://indica-et-buddhica.org'
BASE2	'https://indica-et-buddhica.org/portal_rss'
BASE3	'https://indica-et-buddhica.org/portal_rss/manage_main'
BASEPATH1	''
AcceptLanguage	<Products.Localizer.Accept.AcceptLanguage instance at 0xc77508c>
URL1	'https://indica-et-buddhica.org/portal_rss'
URL0	https://indica-et-buddhica.org/portal_rss/manage_main
URL1	https://indica-et-buddhica.org/portal_rss
URL2	https://indica-et-buddhica.org
BASE0	https://indica-et-buddhica.org
BASE1	https://indica-et-buddhica.org
BASE2	https://indica-et-buddhica.org/portal_rss
BASE3	https://indica-et-buddhica.org/portal_rss/manage_main
environ
HTTP_X_FORWARDED_SERVER	'indica-et-buddhica.org'
HTTP_REFERER	'https://indica-et-buddhica.org/manage_main'
HTTP_ACCEPT_LANGUAGE	'en-us,en;q=0.5'
SERVER_SOFTWARE	'Zope/(Zope 2.9.8-final, python 2.4.4, sunos5) ZServer/1.1 CPS/3.4'
SCRIPT_NAME	''
REQUEST_METHOD	'GET'
PATH_INFO	'/VirtualHostBase/https/indica-et-buddhica.org:443/cps/VirtualHostRoot/portal_rss/manage_main'
SERVER_PROTOCOL	'HTTP/1.1'
channel.creation_time	1208728810
CONNECTION_TYPE	'Keep-Alive'
HTTP_ACCEPT_CHARSET	'ISO-8859-1,utf-8;q=0.7,*;q=0.7'
HTTP_USER_AGENT	'Mozilla/5.0 (X11; U; SunOS i86pc; en-US; rv:1.8.1.12) Gecko/20080210 Firefox/2.0.0.12'
HTTP_COOKIE	'__utma=119353809.1305615612.1208722629.1208722629.1208722629.1; __utmb=119353809.271.10.1208722629273; __utmc=119353809; __utmz=119353809.1208722629.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none); _ZopeId="98336331A3VAwZI4jVU"; __ac_name="rmahoney"; cpsskins_view_mode="eyJ0aGVtZXNfcGFuZWwiOiAid3lzaXd5ZyIsICJ0aGVtZSI6ICJkZWZhdWx0IiwgInBvcnRsZXRzX292ZXJyaWRlIjogIjEiLCAicG9ydGxldHNfcGFuZWwiOiAic2l0ZV9zdHJ1Y3R1cmUiLCAiY3VycmVudF91cmwiOiAiaHR0cHM6Ly9pbmRpY2EtZXQtYnVkZGhpY2Eub3JnL2xvZ2luX2Zvcm0ifQ=="; __ac="cm1haG9uZXk6c3RldzcxMDY%3D"; tree-s="eJzT0MgpMOQKVneEArcKV1t1rgIjrsSSAmMuPQB7tAe3"'
SERVER_NAME	'localhost'
REMOTE_ADDR	'127.0.0.1'
PATH_TRANSLATED	'/VirtualHostBase/https/indica-et-buddhica.org:443/cps/VirtualHostRoot/portal_rss/manage_main'
SERVER_PORT	'8105'
HTTP_HOST	'localhost:8105'
HTTP_ACCEPT	'text/xml,application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5'
GATEWAY_INTERFACE	'CGI/1.1'
HTTP_X_FORWARDED_FOR	'210.48.84.26'
HTTP_X_FORWARDED_HOST	'indica-et-buddhica.org'
HTTP_ACCEPT_ENCODING	'gzip,deflate'


*************************
_______________________________________________
cps-users mailing list
[email protected]
http://lists.nuxeo.com/mailman/listinfo/cps-users

Reply via email to