Re: [scikit-learn] Is there any official position on PEP484/mypy?

Andreas Mueller Wed, 27 Jul 2016 14:10:17 -0700

Hi Daniel.
This hasn't been brought up before so there is no "official position".
I am generally in favor, though I'm not sure how doable it is.

We are generally pretty generous in accepting all kinds of inputs, andmany of our options can have different types: (None, int, float, string,nd-array) is relatively common as a type for an option.

As we still support 2.6, we would need to do comments or external files.

As a user, you are probably most interested in the outputs, right? Thetypes returned by scikit-learn could probably be auto-generated.


I'm curious to see what others think.

I'd be surprised if anyone is willing to invest a large amount of timeon this, though if you guys want to contribute,

we might be able to work something out.

Andy


On 07/27/2016 03:17 PM, Daniel Moisset wrote:

Hi,
[If you're also on the numpy mailing list and get a similar version ofthe message, I apologise for that]
I work at Machinalis were we use a lot of scikit-learn (and the pydatastack in general). Recently we've also been getting involved withmypy, which is a tool to type check (not on runtime, think of it as alinter) annotated python code (the way of annotating python types hasbeen recently standarized in PEP 484).
As part of that involvement we've started creating type annotationsfor the Python libraries we use most, which include both numpy andscikit-learn. Mypy provides a way to specify types with annotations inseparate files in case you don't have control over a library, so wehave created an initial proof of concept for numpy at [1], and we areactively improving it. You can find some additional information aboutit and some problems we've found on the way at this blogpost [2]. Wewere planning to also start some work on scikit-learn (which has amuch larger surface area than numpy, so probably focusing on smallparts for now); we had to start with numpy anyway given that SKLdepends on it.
What I wanted to ask is if the people involved on the SKL project areaware of PEP484 annotations and if you have some interest in startingusing them. The main benefit is that annotations serve as clear (andautomatically testable) documentation for users, and secondarybenefits is that users discovers bugs more quickly and that some IDEs(like pycharm) are starting to use this information for smart editorfeatures (autocompletion, online checking, refactoring tools);eventually tools like jupyter could take advantage of theseannotations in the future. And the cost of writing and including theseare relatively low.
We're doing the work anyway, but contributing our typespecs back couldmake it easier for users to benefit from this, and for us to maintainit and keep it in sync with future releases.
If you've never heard about PEP484 or mypy (it happens a lot) I'll behappy to clarify anything about it that might helpunderstand thissituation
Thanks!

D.


[1] https://github.com/machinalis/mypy-data
[2] http://www.machinalis.com/blog/writing-type-stubs-for-numpy/

--
Daniel F. Moisset - UK Country Manager
www.machinalis.com <http://www.machinalis.com>
Skype: @dmoisset


_______________________________________________
scikit-learn mailing list
[email protected]
https://mail.python.org/mailman/listinfo/scikit-learn

_______________________________________________
scikit-learn mailing list
[email protected]
https://mail.python.org/mailman/listinfo/scikit-learn

Re: [scikit-learn] Is there any official position on PEP484/mypy?

Reply via email to