On 14 January 2013 19:43, Andreas Mueller wrote:
> Hi Robert.
> Not sure if you saw my mail:
> In current master, this is fixed!
> See
> https://github.com/scikit-learn/scikit-learn/issues/1137
> and
> https://github.com/scikit-learn/scikit-learn/pull/1443
>
> Best,
> Andy
>
>
> -
Hi Robert.
Not sure if you saw my mail:
In current master, this is fixed!
See
https://github.com/scikit-learn/scikit-learn/issues/1137
and
https://github.com/scikit-learn/scikit-learn/pull/1443
Best,
Andy
--
Master Visual
On 14 January 2013 17:42, Gael Varoquaux wrote:
> > I've been having a lot of trouble loading as a numpy array. I know
> > generally how to do it, but I must be doing it wrong since the numpy
> > array can't fit in memory, whle the "list of strings" representation
> > does
>
> I believe that i
there is a fix for that in current master. check arrays now has 'allow lists'.
andy
Robert Layton schrieb:
>When using cross_validation.X, all arrays are checked in the normal way
>--
>using check_arrays.
>I am developing code that uses string documents as input, so I have a
>list
>of strings
> I've been having a lot of trouble loading as a numpy array. I know
> generally how to do it, but I must be doing it wrong since the numpy
> array can't fit in memory, whle the "list of strings" representation
> does
I believe that it's because the string are store in a 'string
representation
On 14 January 2013 16:10, Kenneth C. Arnold wrote:
> Why not use numpy arrays of strings all along? Their importance here is
> fancy indexing... Or use X=np.arange(N) and do the fancy indexing yourself
> on demand?
>
> -Ken
> On Jan 13, 2013 11:04 PM, "Robert Layton" wrote:
>
>> When using cross_
Why not use numpy arrays of strings all along? Their importance here is
fancy indexing... Or use X=np.arange(N) and do the fancy indexing yourself
on demand?
-Ken
On Jan 13, 2013 11:04 PM, "Robert Layton" wrote:
> When using cross_validation.X, all arrays are checked in the normal way --
> using
When using cross_validation.X, all arrays are checked in the normal way --
using check_arrays.
I am developing code that uses string documents as input, so I have a list
of strings as the "data" and a numpy array as classes as normal.
(In case anyone doesn't know, my research area is authorship ana