I am trying to parse an html page. Have following error while doing that

 src = sel.get_html_source()
        links = re.findall(r'<a class="al4"[^<]*</a>', src)
        for link in links:
            print link



======================================================================
ERROR: test_new (__main__.NewTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "<stdin>", line 19, in test_new
UnicodeEncode    Error: 'ascii' codec can't encode character u'\xae' in
position 90: ordinal not in range(128)

----------------------------------------------------------------------
Ran 1 test in 6.345s
_______________________________________________
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Reply via email to