Hi experts,

Thank you very much for your quick reply on this. I encountered another
error when going through
the tutorial noted here:

http://scikit-learn.github.io/scikit-learn-tutorial/working_with_text_data.html

>>> from sklearn.datasets import fetch_20newsgroups
>>> twenty_train = fetch_20newsgroups(subset='train',
categories=categories, shuffle=True, random_state=42)
/Library/Frameworks/Python.framework/Versions/3.3/lib/python3.3/site-packages/sklearn/datasets/twenty_newsgroups.py:79:
DeprecationWarning: The 'warn' method is deprecated, use 'warning' instead
  logger.warn("Downloading dataset from %s (14 MB)", URL)
Downloading dataset from
http://people.csail.mit.edu/jrennie/20Newsgroups/20news-bydate.tar.gz (14
MB)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File
"/Library/Frameworks/Python.framework/Versions/3.3/lib/python3.3/site-packages/sklearn/datasets/twenty_newsgroups.py",
line 144, in fetch_20newsgroups
    cache_path=cache_path)
  File
"/Library/Frameworks/Python.framework/Versions/3.3/lib/python3.3/site-packages/sklearn/datasets/twenty_newsgroups.py",
line 88, in download_20newsgroups
    cache = dict(train=load_files(train_path, charset='latin1'),
  File
"/Library/Frameworks/Python.framework/Versions/3.3/lib/python3.3/site-packages/sklearn/datasets/base.py",
line 181, in load_files
    data = [open(filename).read() for filename in filenames]
  File
"/Library/Frameworks/Python.framework/Versions/3.3/lib/python3.3/site-packages/sklearn/datasets/base.py",
line 181, in <listcomp>
    data = [open(filename).read() for filename in filenames]
  File
"/Library/Frameworks/Python.framework/Versions/3.3/lib/python3.3/codecs.py",
line 300, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xb5 in position 894:
invalid start byte


On Fri, Jul 26, 2013 at 4:06 PM, Lars Buitinck <[email protected]> wrote:

> 2013/7/27 Vlad Niculae <[email protected]>:
> > Only the current development version, and the upcoming release, has,
> > as of recently, support for Python 3.  Even so, it won't be easy to
> > support 3.2, we just aim for 3.3 at the moment.
> > This being said, I have no idea what causes this specific error. That
> > line seems unchanged in the current master.
>
> Actually, that code just got merged in last week. It's probably not
> being tested because of some test fixture. Anyway, I fixed it, and it
> should work with Python 3.x now.
>
> --
> Lars Buitinck
> Scientific programmer, ILPS
> University of Amsterdam
>
>
> ------------------------------------------------------------------------------
> See everything from the browser to the database with AppDynamics
> Get end-to-end visibility with application monitoring from AppDynamics
> Isolate bottlenecks and diagnose root cause in seconds.
> Start your free trial of AppDynamics Pro today!
> http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
> _______________________________________________
> Scikit-learn-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to