Sayak Paul | sayak.dev

---------- Forwarded message ---------
From: <scikit-learn-requ...@python.org>
Date: Fri, Sep 13, 2019 at 10:46 AM
Subject: scikit-learn Digest, Vol 42, Issue 15
To: <scikit-learn@python.org>


Send scikit-learn mailing list submissions to
        scikit-learn@python.org

To subscribe or unsubscribe via the World Wide Web, visit
        https://mail.python.org/mailman/listinfo/scikit-learn
or, via email, send a message with subject or body 'help' to
        scikit-learn-requ...@python.org

You can reach the person managing the list at
        scikit-learn-ow...@python.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of scikit-learn digest..."


Today's Topics:

   1. Re: scikit-learn Digest, Vol 42, Issue 14 (Sayak Paul)


----------------------------------------------------------------------

Message: 1
Date: Fri, 13 Sep 2019 10:46:09 +0530
From: Sayak Paul <spsayakp...@gmail.com>
To: scikit-learn@python.org
Subject: Re: [scikit-learn] scikit-learn Digest, Vol 42, Issue 14
Message-ID:
        <CAGa_XGHgifW7W81-dVv3R3e80Nz2BVx=ewikv52orf-bs+r...@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

I was able to solve the problem using -

mlb = MultiLabelBinarizer()
mlb.fit([y_train])

Thanks for the suggestions. The output of mlb.classes_ now looks the
following (first ten classes):
[image: image.png]

However, when I transform it using mlb.transform([y_train]), another
problem arrises -

[image: image.png]

Kindly suggest :)


Sayak Paul | sayak.dev


On Thu, Sep 12, 2019 at 9:33 PM <scikit-learn-requ...@python.org> wrote:

> Send scikit-learn mailing list submissions to
>         scikit-learn@python.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>         https://mail.python.org/mailman/listinfo/scikit-learn
> or, via email, send a message with subject or body 'help' to
>         scikit-learn-requ...@python.org
>
> You can reach the person managing the list at
>         scikit-learn-ow...@python.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of scikit-learn digest..."
>
>
> Today's Topics:
>
>    1. Re: MultiLabelBinarizer gives individual characters instead
>       of the classes (Lo?c Est?ve)
>    2. Re: Vote on SLEP009: keyword only arguments (Guillaume Lema?tre)
>    3. How can I enable line tracing for cython modules.
>       (Alejandro Javier Peralta Frias)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Thu, 12 Sep 2019 07:24:48 +0200
> From: Lo?c Est?ve <loic.est...@ymail.com>
> To: Scikit-learn mailing list <scikit-learn@python.org>
> Subject: Re: [scikit-learn] MultiLabelBinarizer gives individual
>         characters instead of the classes
> Message-ID: <vnokwoeeozmn....@ymail.com>
> Content-Type: text/plain; charset=utf-8
>
> I think this caveat has been added in the dev doc (not yet in the stable
> doc). You may want to read:
>
>
https://scikit-learn.org/dev/modules/generated/sklearn.preprocessing.MultiLabelBinarizer.html
> and in particular the part that starts with "A common mistake is to pass
> in a list".
>
> Cheers,
> Lo?c
>
> > Hi.
> >
> > I am working on a Multi-label text classification problem. In order to
> encode the labels, I am using MultiLabelBinarizer. The labels of the
> dataset look like -
> >
> > image
> >
> > When I am using
> >
> > mlb = MultiLabelBinarizer()
> > mlb.fit(labels)
> > print(mlb.classes_)
> >
> > I am getting -
> >
> > image
> >
> > Whereas, the output (sample output) I want is -
> >
> > image
> >
> > I got the above output by -
> >
> > mlb = MultiLabelBinarizer()
> > sample_labels = [
> >     ['stat.ML', 'cs.LG'],
> >     ['cs.CV', 'cs.RO']
> > ]
> > mlb.fit(sample_labels)
> > print(mlb.classes_)
> >
> > Help would be very much appreciated here.
> >
> > Here's the dataset I had prepared:
> > arXivdata.csv.zip
> >
> > I stripped away the double quotes in the labels after loading it in a
> pandas DataFrame by -
> >
> > import re
> >
> > arxiv_data['labels'] = arxiv_data['labels'].str.replace(r"[\"]", '')
> >
> > scikit-learn version: '0.21.3'
> >
> > Sayak Paul | sayak.dev
>
>
>
> ------------------------------
>
> Message: 2
> Date: Thu, 12 Sep 2019 10:06:30 +0200
> From: Guillaume Lema?tre <g.lemaitr...@gmail.com>
> To: Scikit-learn mailing list <scikit-learn@python.org>
> Subject: Re: [scikit-learn] Vote on SLEP009: keyword only arguments
> Message-ID:
>         <
> cacdxx9jcke5gajrnj3tkinbuywzqvxmrrchbbqn6q_fxydp...@mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> To the question: do we want to utilise Python 3's force-keyword-argument
> syntax
> and to change existing APIs which support arguments positionally to use
> this
> syntax, via a deprecation period?
>
> I am +1.
>
> IMO, even if the syntax might be unknown, it will remain unknown until
> projects
> from the ecosystem are not using it.
>
> To the question: which methods should be impacted?
>
> I think we should be as gentle as possible at first. I am a little
> concerned about
> breaking some codes which were working fine before.
>
> On Thu, 12 Sep 2019 at 04:43, Joel Nothman <joel.noth...@gmail.com> wrote:
>
> > These there details of specific API changes to be decided:
> >
> > The question being put, as per the SLEP, is:
> > do we want to utilise Python 3's force-keyword-argument syntax
> > and to change existing APIs which support arguments positionally to use
> > this syntax, via a deprecation period?
> > _______________________________________________
> > scikit-learn mailing list
> > scikit-learn@python.org
> > https://mail.python.org/mailman/listinfo/scikit-learn
> >
>
>
> --
> Guillaume Lemaitre
> INRIA Saclay - Parietal team
> Center for Data Science Paris-Saclay
> https://glemaitre.github.io/
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
>
http://mail.python.org/pipermail/scikit-learn/attachments/20190912/047eb83c/attachment-0001.html
> >
>
> ------------------------------
>
> Message: 3
> Date: Thu, 12 Sep 2019 09:23:03 -0300
> From: Alejandro Javier Peralta Frias
>         <alejandro.pera...@mercadolibre.com>
> To: scikit-learn@python.org
> Subject: [scikit-learn] How can I enable line tracing for cython
>         modules.
> Message-ID:
>         <CAL+ZpG6ccwnnJm1Q2CQM4qt+sfiMtHV5Tr=
> mgsgfpcmaszu...@mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> Hello all,
>
> To enable cython tracing (in particular I want to line trace neighbors
> module) I understand that I have to recompile the cython modules with
> CYTHON_TRACE=1 but I'm not sure where should I set this.
>
> Should I use:
>
> # distutils: define_macros=CYTHON_TRACE_NOGIL=1
>
>
> In the files I want to trace?
>
> Regards,
> --
> Ale
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
>
http://mail.python.org/pipermail/scikit-learn/attachments/20190912/0377329b/attachment-0001.html
> >
>
> ------------------------------
>
> Subject: Digest Footer
>
> _______________________________________________
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
>
> ------------------------------
>
> End of scikit-learn Digest, Vol 42, Issue 14
> ********************************************
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <
http://mail.python.org/pipermail/scikit-learn/attachments/20190913/921c80cd/attachment.html
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image.png
Type: image/png
Size: 16117 bytes
Desc: not available
URL: <
http://mail.python.org/pipermail/scikit-learn/attachments/20190913/921c80cd/attachment.png
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image.png
Type: image/png
Size: 7675 bytes
Desc: not available
URL: <
http://mail.python.org/pipermail/scikit-learn/attachments/20190913/921c80cd/attachment-0001.png
>

------------------------------

Subject: Digest Footer

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn


------------------------------

End of scikit-learn Digest, Vol 42, Issue 15
********************************************
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to