[ 
https://issues.apache.org/jira/browse/STATISTICS-7?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16812326#comment-16812326
 ] 

Gilles commented on STATISTICS-7:
---------------------------------

{quote}making a formal proposal on GSOC19 portal
{quote}
Do you know that the deadline for submitting applications is tomorrow?

That said, nothing prevents you from making concrete suggestions about the 
refactoring, but we won't have much time to exchange about it.

In any case, you are welcome to contribute, GSoC or not.
 If you are interested, please [subscribe to the "dev" mailing 
list|http://commons.apache.org/mail-lists.html].

> Stream-based Java statistical processing
> ----------------------------------------
>
>                 Key: STATISTICS-7
>                 URL: https://issues.apache.org/jira/browse/STATISTICS-7
>             Project: Apache Commons Statistics
>          Issue Type: New Feature
>            Reporter: Eric Barnhill
>            Priority: Major
>              Labels: GSoC2019, gsoc2019, statistics, streams
>
> The new component aims to be a library of commons statistics functions 
> synchronized with the latest developments in the Java language, in particular 
> Java's functional programming syntax.
> The library will make commonly used statistical functions available to an end 
> user through a simple grammar comparable to commons-math-statistics or 
> scikit-learn, while under the hood will implement Java's mapping, streaming, 
> and other producer and consumer functions to ensure the statistical methods 
> run optimally in new Java implementations.
> As functional programming grows increasingly central to big data applications 
> we believe these libraries will play an important function in the data 
> engineering ecosystem. In particular, data engineering is widely done with 
> Java, then passed to other languages for data-scientific analyses; however, 
> the common availability of functionally implemented statistical mapping and 
> reductions in Java could prove very useful at the interface of data science 
> and engineering, by enabling teams to more easily perform reductions on the 
> engineering side before handing off to the analysis side.
> Developers working on the project will have the opportunity to demonstrate 
> Java programming, functional programming, algorithm design, and data science 
> skills and receive authorship on a commons project that is likely to be 
> widely used.
> The ideal contributor will also be able to help with important architectural 
> decision making. The old source of these libraries, commons-math, grew too 
> large, hierarchically complex and interdependent for the commons mission. The 
> developers on this project need to make architectural choices that will 
> enable the statiscal code to be lightweight and reusable, with a minimum of 
> outside dependencies while avoiding redundancy.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to