Hi Jay, I am not on the Commons PMC, so feel free to give preference to other suggestions, but I have one suggestion.
I contributed a Principle Component Analysis algorithm to the Hipparchus project[1][2][3] (it's a Commons Math fork). I have been meaning to propose the same class to Commons Math but haven't found the cycles yet. You'd use this algorithm after doing linear regression, clustering, or classification of n-dimensional data and then want to visualize it in 2 or 3 dimensions. It relies on some statistics functionality. See [4] for some examples of use (both Commons Math, with PCA done by hand, and Hipparchus examples are shown) Why it could be an easy first task? Assuming the PMC were happy with this addition, it could be as simple as placing a single existing source file under (suggestion): https://github.com/apache/commons-math/tree/master/commons-math-legacy/src/main/java/org/apache/commons/math4/legacy/stat/projection/PCA.java And a corresponding test file (also existing, see [3]), plus some doco fixes. So, it might be an easy win that gets you introduced to the codebase and ASF processes. Why it might not be? Commons math has been undergoing a process of splitting into fine-grained functionality. The "legacy" part of the codebase isn't really where we want to be putting new stuff. It isn't clear though what the fine-grained prerequisite pieces might be for this class (or the appetite for supporting such pieces). If there was appetite for creating a medium granularity commons-ml (consisting of linear, ml, parts of stat packages), something new wouldn't be too hard but it seems quite a bit more work to make it finer grained than that. If the project has some guidance, I'd be happy to help. Cheers, Paul. [1] https://github.com/Hipparchus-Math/hipparchus/blob/develop/hipparchus-stat/src/main/java/org/hipparchus/stat/projection/PCA.java [2] https://github.com/Hipparchus-Math/hipparchus/issues/221 [3] https://github.com/Hipparchus-Math/hipparchus/pull/222 [4] https://github.com/paulk-asert/groovy-data-science/tree/master/subprojects/Whiskey On Tue, Apr 8, 2025 at 8:07 AM Jay Sharma <sharmajay2...@gmail.com> wrote: > > Hello Apache Commons Team, > > I’m Jay Rakesh Sharma, a GSoC 2024 aspirant with a background in B.Sc. > (specializing in Mathematics) and currently pursuing an M.Sc. in Computer > Applications. I have a strong foundation in core mathematics and Java > programming, and I’m highly enthusiastic about contributing to Apache > Commons Math. > > I’ve gone through the GitHub repo and project ideas and I’m very interested > in contributing to algorithmic improvements, optimization, or statistical > modules. I’d love guidance on selecting a good beginner issue to get > started and also on aligning my efforts with the most impactful areas for > GSoC. > > I’m committed to consistent contributions and learning, and I would be > grateful for any suggestions or mentorship. > > Looking forward to working with you all! > > Thank you, > Your Sincerely > Jay Sharma > GitHub: github.com/jaysharmagithub --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org For additional commands, e-mail: dev-h...@commons.apache.org