Re: [Scikit-learn-general] fetch_mldata()

2012-06-14 Thread Andreas Mueller
Hi Immanuel. Sorry, I couldn't find my scripts. I'll look at home, though it is possible I deleted them. Have you tried dense data? Not sure how the sparse format works. Have you uploaded something already? I can't really help you today, but maybe tomorrow. If you want, you can send me you hdf5 fi

Re: [Scikit-learn-general] In Beijing next week

2012-06-14 Thread xinfan meng
On Thu, Jun 14, 2012 at 1:43 PM, Gael Varoquaux < [email protected]> wrote: > It seems that this email didn't make it through, so I am resending it: > > On Thu, Jun 14, 2012 at 02:05:17AM +0200, Gael Varoquaux wrote: > > On Mon, Jun 11, 2012 at 10:29:03PM +0800, xinfan meng wrote: > >

Re: [Scikit-learn-general] In Beijing next week

2012-06-14 Thread Gael Varoquaux
On Mon, Jun 11, 2012 at 10:29:03PM +0800, xinfan meng wrote: >Gael, it seems that there are not many restaurants around CNCC. If you >want to try some Chinese food, I recommend the ones around Mudanyuan >station, which is not far way from CNCC. How about we say that we meet at 18:30 ou

[Scikit-learn-general] Customizing the vectorizer classes ... for Asian Languages

2012-06-14 Thread Dinesh B Vadhia
Hi! In the docs under Customizing the vectorizer classes - http://scikit-learn.org/dev/modules/feature_extraction.html#customizing-the-vectorizer-classes - it says, "Customizing the vectorizer can be very useful to handle Asian languages that do not use an explicit word separator such as the whi

Re: [Scikit-learn-general] pickled random forest file size, by design?

2012-06-14 Thread Olivier Grisel
2012/6/13 Emanuele Olivetti : > Hi, > > You can use gzip.open() instead of open() to add compression and to (possibly) > decrease the file size a lot - at least it did to me in a similar example: > > import gzip > pickle.dump(clf, gzip.open("test.pkl", 'wb'), > protocol=pickle.HIGHEST_PROTOCOL) >

[Scikit-learn-general] SciPy2012 conference: Last week for early birds, poster submissions

2012-06-14 Thread Stéfan van der Walt
Hi everyone We're rapidly approaching SciPy2012 , which takes place in Austin, Texas from July 16th to 21st. This is a reminder that the *discounted early bird registration* closes on the 18th of this month. Also, we decided to keep the queue for *poster su

Re: [Scikit-learn-general] Possibility to do a sprint in Paris, 13-14 September

2012-06-14 Thread Alexandre Gramfort
I should be there too Alex On Thu, Jun 14, 2012 at 6:50 PM, Olivier Grisel wrote: > 2012/6/14 Nelle Varoquaux : >> Hi everyone, >> >> PyconFR offers this year to host sprints during two days, at la villette in >> Paris, mid >> september. Considering there is a big french machine learning communi

Re: [Scikit-learn-general] Possibility to do a sprint in Paris, 13-14 September

2012-06-14 Thread Olivier Grisel
2012/6/14 Nelle Varoquaux : > Hi everyone, > > PyconFR offers this year to host sprints during two days, at la villette in > Paris, mid > september. Considering there is a big french machine learning community, I > think this would be a great opportunity for us to do a sprint ! > > The sprint would

Re: [Scikit-learn-general] Installing scikit-learn on Cygwin failed

2012-06-14 Thread matt_bali
A more verbose output log: == compile options: '-DNO_ATLAS_INFO=1 -Isklearn/src/cblas -I/usr/lib/python2.6/site-packages/numpy/core/include -I/usr/lib/python2.6/site-packages/numpy/core/include -I/usr/include/python2.6 -c' gcc: sklearn/utils/arra

[Scikit-learn-general] Installing scikit-learn on Cygwin failed

2012-06-14 Thread matt_bali
In current Cygwin with builtin Numpy (1.6.2) and lapack, I managed to install Scipy, Matplotlib and some other modules. Unfortunately scikit-learn fails with this error: == error: Command "gcc -fno-strict-aliasing -g -O2 -pipe -DNDEBUG -g -fwrapv -

Re: [Scikit-learn-general] fetch_mldata()

2012-06-14 Thread iBayer
Hey Andreas, I don't get the hdf5 files though the mldata.org parser and the documentation is somewhat sparse... Do you have some old code laying around for setting up the hdf5 files? Currently I'm using: ( and failing... ) - f = h5py.File('InternetAd.h5', 'w') # HDF5 attributes on the root

[Scikit-learn-general] Possibility to do a sprint in Paris, 13-14 September

2012-06-14 Thread Nelle Varoquaux
Hi everyone, PyconFR offers this year to host sprints during two days, at la villette in Paris, mid september. Considering there is a big french machine learning community, I think this would be a great opportunity for us to do a sprint ! The sprint would take place the 13 and 14 of September, ju