Package: rss2email
Version: 1:2.54-2
Severity: normal

I just got the following exception.  The (bzipped) feed is attached.


737:[EMAIL PROTECTED]: ~] r2e run
=== SEND THE FOLLOWING TO [EMAIL PROTECTED] ===
E: could not parse http://transhumanism.org/index.php/th/rss_2.0/
Traceback (most recent call last):
  File "/usr/share/rss2email/rss2email.py", line 350, in run
    id = getID(entry)
  File "/usr/share/rss2email/rss2email.py", line 187, in getID
    content = getContent(entry)
  File "/usr/share/rss2email/rss2email.py", line 176, in getContent
    return html2text(c.value)
  File "/usr/share/rss2email/html2text.py", line 387, in html2text
    return optwrap(html2text_file(html, None))
  File "/usr/share/rss2email/html2text.py", line 382, in html2text_file
    h.feed(html)
  File "/usr/lib/python2.3/sgmllib.py", line 95, in feed
    self.goahead(0)
  File "/usr/lib/python2.3/sgmllib.py", line 120, in goahead
    self.handle_data(rawdata[i:j])
  File "/usr/share/rss2email/html2text.py", line 376, in handle_data
    self.o(data, 1)
  File "/usr/share/rss2email/html2text.py", line 371, in o
    self.out(data)
  File "/usr/share/rss2email/html2text.py", line 154, in outtextf
    if type(s) is type(''): s = codecs.utf_8_decode(s)[0]
UnicodeDecodeError: 'utf8' codec can't decode byte 0x9d in position 58: 
unexpected code byte
rss2email 2.54
feedparser 3.3
html2text 2.2
Python 2.3.5 (#2, May  4 2005, 08:51:39)
[GCC 3.3.5 (Debian 1:3.3.5-12)]
=== END HERE ===



-- System Information:
Debian Release: 3.1
  APT prefers unstable
  APT policy: (500, 'unstable')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.6.10-1-686
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)

Versions of packages rss2email depends on:
ii  python                        2.3.5-2    An interactive high-level object-o

-- no debconf information

-- 
Martin Michlmayr
http://www.cyrius.com/

Attachment: index.html.bz2
Description: Binary data

Reply via email to