Dear Readers, CPS_RSS-1.0.0 CPS-3.4.6 feedparser.py 3.2 (default) & 4.1 (latest)
Refreshing the following Japanese feed in the RSS Tool: http://blogs.dion.ne.jp/sanskrit/index.rdf Results in: UnicodeEncodeError: 'latin-1' codec can't encode characters ... [see the full error log attached as cps-latin-error.log] Subsequently the RSS Tool becomes completely unusable. (The only way I could manage to get the RSS Tool running again was to reinstall the whole CPS site from backup.) The problematic feed should render as follows (using SPIP): Indica et Buddhica - Tabulae :: Kataoka, Kei http://tabulae.indica-et-buddhica.org/rubrique.php3?id_rubrique=261 I'm not sure if this issue with Japanese characters is related to the incorrect rendering of Latin diacritics with the following feed -- many commonly used in Romanised Sanskrit transliteration, e.g., a, u and i macron, S acute, n under-dot &c.: http://www.informaworld.com/ampp/rss~content=t713405669 Incorrect (using CPS RSS): Indica et Buddhica - Recently Published issues of Asian Philosophy http://indica-et-buddhica.org/sections/tabulae/periodica/a/asian-philosophy/asp-recently-published Correct (using SPIP): Indica et Buddhica - Tabulae :: Asian Philosophy - Recently Published http://tabulae.indica-et-buddhica.org/rubrique.php3?id_rubrique=238 I'd be very happy to receive any thoughts on how these issues might be resolved. Kind regards, Richard MAHONEY -- Richard MAHONEY | internet: http://indica-et-buddhica.org/ Littledene | telephone/telefax (man.): +64 3 312 1699 Bay Road | cellular: +64 27 482 9986 OXFORD, NZ | email: [EMAIL PROTECTED] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Indica et Buddhica: Materials for Indology and Buddhology Scholia: http://scholia.indica-et-buddhica.org/ Tabulae: http://tabulae.indica-et-buddhica.org/
************************* Display traceback as text Traceback (innermost last): Module ZPublisher.Publish, line 118, in publish Module ZServer.HTTPResponse, line 262, in setBody Module ZPublisher.HTTPResponse, line 313, in setBody Module ZPublisher.HTTPResponse, line 454, in _encode_unicode UnicodeEncodeError: 'latin-1' codec can't encode characters in position 24927-24933: ordinal not in range(256) ************************* Exception traceback Time 2008/04/20 22:00:10.795 GMT User Name (User Id) rmahoney (rmahoney) Request URL https://indica-et-buddhica.org/portal_rss/manage_main Exception Type UnicodeEncodeError Exception Value 'latin-1' codec can't encode characters in position 24927-24933: ordinal not in range(256) Traceback (innermost last): * Module ZPublisher.Publish, line 118, in publish * Module ZServer.HTTPResponse, line 262, in setBody * Module ZPublisher.HTTPResponse, line 313, in setBody * Module ZPublisher.HTTPResponse, line 454, in _encode_unicode UnicodeEncodeError: 'latin-1' codec can't encode characters in position 24927-24933: ordinal not in range(256) Display traceback as text REQUEST form -C '' cookies __utmz '119353809.1208722629.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none)' tree-s 'eJzT0MgpMOQKVneEArcKV1t1rgIjrsSSAmMuPQB7tAe3' __ac_name 'rmahoney' cpsskins_view_mode 'eyJ0aGVtZXNfcGFuZWwiOiAid3lzaXd5ZyIsICJ0aGVtZSI6ICJkZWZhdWx0IiwgInBvcnRsZXRzX292ZXJyaWRlIjogIjEiLCAicG9ydGxldHNfcGFuZWwiOiAic2l0ZV9zdHJ1Y3R1cmUiLCAiY3VycmVudF91cmwiOiAiaHR0cHM6Ly9pbmRpY2EtZXQtYnVkZGhpY2Eub3JnL2xvZ2luX2Zvcm0ifQ==' __utma '119353809.1305615612.1208722629.1208722629.1208722629.1' __utmb '119353809.271.10.1208722629273' __utmc '119353809' _ZopeId '98336331A3VAwZI4jVU' lazy items SESSION <bound method SessionDataManager.getSessionData of <SessionDataManager at /session_data_manager>> other VIRTUAL_URL_PARTS ('https://indica-et-buddhica.org', 'portal_rss/manage_main') n_ 9 VIRTUAL_URL 'https://indica-et-buddhica.org/portal_rss/manage_main' management_page_charset 'iso-8859-1' URL2 'https://indica-et-buddhica.org' AcceptCharset <Products.Localizer.Accept.AcceptCharset instance at 0xe83c66c> AUTHENTICATION_PATH 'cps/virtual_hosting' skey 'id' AUTHENTICATED_USER <User 'rmahoney'> USER_PREF_LANGUAGES <Products.Localizer.Accept.AcceptLanguage instance at 0xc77508c> SERVER_URL 'https://indica-et-buddhica.org' traverse_subpath [] ACTUAL_URL 'https://indica-et-buddhica.org/portal_rss/manage_main' a_ 0 URL 'https://indica-et-buddhica.org/portal_rss/manage_main' rkey '' PUBLISHED <App.special_dtml.DTMLFile object at 0x8472a0c> TraversalRequestNameStack [] VirtualRootPhysicalPath ('', 'cps') BASE1 'https://indica-et-buddhica.org' BASE2 'https://indica-et-buddhica.org/portal_rss' BASE3 'https://indica-et-buddhica.org/portal_rss/manage_main' BASEPATH1 '' AcceptLanguage <Products.Localizer.Accept.AcceptLanguage instance at 0xc77508c> URL1 'https://indica-et-buddhica.org/portal_rss' URL0 https://indica-et-buddhica.org/portal_rss/manage_main URL1 https://indica-et-buddhica.org/portal_rss URL2 https://indica-et-buddhica.org BASE0 https://indica-et-buddhica.org BASE1 https://indica-et-buddhica.org BASE2 https://indica-et-buddhica.org/portal_rss BASE3 https://indica-et-buddhica.org/portal_rss/manage_main environ HTTP_X_FORWARDED_SERVER 'indica-et-buddhica.org' HTTP_REFERER 'https://indica-et-buddhica.org/manage_main' HTTP_ACCEPT_LANGUAGE 'en-us,en;q=0.5' SERVER_SOFTWARE 'Zope/(Zope 2.9.8-final, python 2.4.4, sunos5) ZServer/1.1 CPS/3.4' SCRIPT_NAME '' REQUEST_METHOD 'GET' PATH_INFO '/VirtualHostBase/https/indica-et-buddhica.org:443/cps/VirtualHostRoot/portal_rss/manage_main' SERVER_PROTOCOL 'HTTP/1.1' channel.creation_time 1208728810 CONNECTION_TYPE 'Keep-Alive' HTTP_ACCEPT_CHARSET 'ISO-8859-1,utf-8;q=0.7,*;q=0.7' HTTP_USER_AGENT 'Mozilla/5.0 (X11; U; SunOS i86pc; en-US; rv:1.8.1.12) Gecko/20080210 Firefox/2.0.0.12' HTTP_COOKIE '__utma=119353809.1305615612.1208722629.1208722629.1208722629.1; __utmb=119353809.271.10.1208722629273; __utmc=119353809; __utmz=119353809.1208722629.1.1.utmcsr=(direct)|utmccn=(direct)|utmcmd=(none); _ZopeId="98336331A3VAwZI4jVU"; __ac_name="rmahoney"; cpsskins_view_mode="eyJ0aGVtZXNfcGFuZWwiOiAid3lzaXd5ZyIsICJ0aGVtZSI6ICJkZWZhdWx0IiwgInBvcnRsZXRzX292ZXJyaWRlIjogIjEiLCAicG9ydGxldHNfcGFuZWwiOiAic2l0ZV9zdHJ1Y3R1cmUiLCAiY3VycmVudF91cmwiOiAiaHR0cHM6Ly9pbmRpY2EtZXQtYnVkZGhpY2Eub3JnL2xvZ2luX2Zvcm0ifQ=="; __ac="cm1haG9uZXk6c3RldzcxMDY%3D"; tree-s="eJzT0MgpMOQKVneEArcKV1t1rgIjrsSSAmMuPQB7tAe3"' SERVER_NAME 'localhost' REMOTE_ADDR '127.0.0.1' PATH_TRANSLATED '/VirtualHostBase/https/indica-et-buddhica.org:443/cps/VirtualHostRoot/portal_rss/manage_main' SERVER_PORT '8105' HTTP_HOST 'localhost:8105' HTTP_ACCEPT 'text/xml,application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5' GATEWAY_INTERFACE 'CGI/1.1' HTTP_X_FORWARDED_FOR '210.48.84.26' HTTP_X_FORWARDED_HOST 'indica-et-buddhica.org' HTTP_ACCEPT_ENCODING 'gzip,deflate' *************************
_______________________________________________ cps-users mailing list [email protected] http://lists.nuxeo.com/mailman/listinfo/cps-users
