Re: [scikit-learn] Long term roadmap and moonshot goals

2019-07-23 Thread Piotr Szymański
If I could pitch in, it would be lovely, very lovely indeed, if
scikit-learn models could:

- operate on sparse data, both input and output by default
- implement some kind of sparse vector representation (as in
https://github.com/scikit-learn/scikit-learn/issues/8908 )
- perhaps have a unifiying numpy.array / scipy.sparse_matrix interface to
give people some slack on jumping betwen [] operator conventions

We would benefit from that strongly in scikit-multilearn, as when a
multi-output problem is transformed to a single-output problem based on
unique combinations, this representation has to be dense for scikit-learn
at the moment. We end up losing some speed there. I'm sure other libraries
like ex. imbalanced-learn, or scikit-multiflow would also see these as a
huge thing.

Best,
Piotr



On Sun, Jul 14, 2019 at 8:44 PM Andreas Mueller  wrote:

> Hi all.
> At SciPy, Brian Granger raised a good point about their planning for the
> Jupyter Project, which is the importance of long-term goals.
>
> I think it's great that we now have a detailed short-term roadmap
> (https://scikit-learn.org/dev/roadmap.html).
> Given that we now have about 6(!) full time people (Oliver, Jeremy,
> Guillaume, Nicolas, Thomas, Adrin) on scikit-learn (GO TEAM!!), I think
> it's realistic
> to achieve most of these within a year or two. We have actually made
> some significant progress already.
>
> I think now would be a good time to start thinking about a longer-term
> roadmap, say 3-5 years out.
> What do we want to achieve? What are realistic goals, and what are
> moonshot goals?
> Having a common vision and shared goals might help us with funding, but
> might also help us with prioritization and motivation.
>
> What do you think? Do you think this is important and worth-while?
> And what should our goals be?
>
> Best,
> Andy
> ___
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>


-- 
Piotr Szymański
nied...@gmail.com
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn


Re: [scikit-learn] Inclusion of an LSTM Classifier

2019-02-17 Thread Piotr Szymański
I've created a couple of scikit-learn compatible wrappers and model
generators for scikit-multilearn: http://scikit.ml/multilabeldnn.html

Depends on what library you prefer, here's some examples on how to use
LSTMs via:
- Keras:
https://medium.com/@dclengacher/keras-lstm-recurrent-neural-networks-c1f5febde03d
- pyTorch:
https://pytorch.org/tutorials/beginner/nlp/sequence_models_tutorial.html

Just create a relevant model generating function, take a wrapper from
scikit-multilearn, and put it into the scikit pipeline anyway you want.

Best,
Piotr Szymanski
Scikit-multilearn Maintainer

On Sun, Feb 17, 2019 at 7:55 PM David Burns 
wrote:

> There is an sklearn wrapper for Keras models in the Keras library. That's
> an easy way to use LSTM in sklearn. Also the sklearn estimator API is
> pretty easy to figure out if you want to roll your own wrapper for any
> model really.
> ___
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>


-- 
Piotr Szymański
nied...@gmail.com
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn


Re: [scikit-learn] How does the random state influence the decision tree splits?

2018-10-28 Thread Piotr Szymański
> >>>
> >>> If someone knows more about this, where the random_state is used, I'd
> be happy to hear it :)
> >>>
> >>> Also, we could then maybe add the info to the DecisionTreeClassifier's
> docstring, which is currently a bit too generic to be useful, I think:
> >>>
> >>>
> https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/tree/tree.py
> >>>
> >>>
> >>>   random_state : int, RandomState instance or None, optional
> (default=None)
> >>>   If int, random_state is the seed used by the random number
> generator;
> >>>   If RandomState instance, random_state is the random number
> generator;
> >>>   If None, the random number generator is the RandomState instance
> used
> >>>   by `np.random`.
> >>>
> >>>
> >>> Best,
> >>> Sebastian
> >>> ___
> >>> scikit-learn mailing list
> >>> scikit-learn@python.org
> >>> https://mail.python.org/mailman/listinfo/scikit-learn
> >>> ___
> >>> scikit-learn mailing list
> >>> scikit-learn@python.org
> >>> https://mail.python.org/mailman/listinfo/scikit-learn
> >>
> >> ___
> >> scikit-learn mailing list
> >> scikit-learn@python.org
> >> https://mail.python.org/mailman/listinfo/scikit-learn
> > ___
> > scikit-learn mailing list
> > scikit-learn@python.org
> > https://mail.python.org/mailman/listinfo/scikit-learn
>
> ___
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>


-- 
Piotr Szymański
nied...@gmail.com
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn


Re: [scikit-learn] Scikit Multi learn error.

2018-06-26 Thread Piotr Szymański
Scikit-multilearn features a larger variety of models, many of which are
still not above the selectiveness threshold of scikit-learn. In general,
scikit-learn implements only three multi-label classifiers -
BinaryRelevance, OneVsRest and ClassifierChains. And generally
sklearn.multioutput is a very recent addition (2016), added 3 years after
the scikit-multilearn library was started.

(responding again, after joining list, i'm sorry if anyone got this twice)


wt., 26 cze 2018 o 19:59 użytkownik Piotr Szymański 
napisał:

> Scikit-multilearn features a larger variety of models, many of which are
> still not above the selectiveness threshold of scikit-learn. In general,
> scikit-learn implements only three multi-label classifiers -
> BinaryRelevance, OneVsRest and ClassifierChains. And generally
> sklearn.multioutput is a very recent addition (2016), added 3 years after
> the scikit-multilearn library was started.
>
> wt., 26 cze 2018 o 19:40 użytkownik Fernando Marcos Wittmann <
> fernando.wittm...@gmail.com> napisał:
>
>> Why there's a library based on Sklearn for multi classification? Sklearn
>> itself can handle this (
>> http://scikit-learn.org/stable/modules/multiclass.html)
>>
>> On Tue, Jun 26, 2018 at 5:08 PM, Vlad Niculae  wrote:
>>
>>> Hi Aijaz,
>>>
>>> You're writing to the wrong mailing list. This is the mailing list for
>>> scikit-learn, not scikit-multilearn, which is a different and unrelated
>>> project. You're unlikely to get an answer here;
>>> I recommend following the contact information on the scikit-multilearn
>>> website.
>>>
>>> Best of luck,
>>>
>>> Yours
>>> Vlad
>>>
>>> On Tue, Jun 26, 2018, 11:04 aijaz qazi  wrote:
>>>
>>>> Dear developer ,
>>>>
>>>> I am working on web page categorization  with http://scikit.ml/ .
>>>>
>>>>
>>>> *Question*: I am not able to execute MLkNN code on the link
>>>> http://scikit.ml/api/classify.html. I have installed py 3.6.
>>>>
>>>> I found scipy versions not compatible with scikit.ml 0.0.5.
>>>>
>>>> Which version of scipy would work with scikit.ml 0.0.5.
>>>>
>>>> Kindly let me know. I will be grateful.
>>>>
>>>>
>>>> *Regards,*
>>>> *Aijaz A.Qazi *
>>>> ___
>>>> scikit-learn mailing list
>>>> scikit-learn@python.org
>>>> https://mail.python.org/mailman/listinfo/scikit-learn
>>>>
>>>
>>> ___
>>> scikit-learn mailing list
>>> scikit-learn@python.org
>>> https://mail.python.org/mailman/listinfo/scikit-learn
>>>
>>>
>>
>>
>> --
>>
>> Fernando Marcos Wittmann
>> MS Student - Energy Systems Dept.
>> School of Electrical and Computer Engineering, FEEC
>> University of Campinas, UNICAMP, Brazil
>> +55 (19) 987-211302
>>
>>
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn