Name: A binding to R
Level: Advanced
Possible mentor: ?
Possible second mentor: ?
Description
With the increased popularity of high-level statistical
programming language and environments for data analysis like R, a
way to interface this package is becoming a must have for
Smalltalk, since it implements complex and unrivalled statistical
techniques for data manipulation and presentation, like Analysis
of Variance, Covariance, Time Series, Generalized Linear Models,
Additive Models, Non-linear Regressions, Tree Models,
Multivariate Statistics, etc. besides the many mathematical
functions, which are used in fields from economics to medicine
and engineering. It is estimated that R posseses about 2 million
of users worldwide and more than 2000 add-ons and increasing
everyday through repositories like CRAN and Crantastic.org
Technical Details
The student should know or be motivated to learn statistics with
the R environment and language, and its fundamental workflow:
importing and preparing the data, and finally running the
analysis, and presenting the results. Dealing with R sessions and
presentation of results (like vectors and plots) will be
challenging too.
Benefits to the Student
The student will gain invaluable experience from two
complementary environments, and his experience with the interface
technology choosed will be useful for the many projects where
Smalltalk needs help from external systems.
Benefits to the Community
The goal of this project is to build a wrapper to interface R, an
open source statistical programming language, providing a whole
range of missing functionality to Smalltalk. This binding could
complement the R environment where a general programming
environment is needed, attracting many statisticians, and will
open Smalltalk to domain-specific areas as diverse as Clinical
Trials, Finance and Machine Learning.
2012/3/9 Janko Mivšek <[email protected]
<mailto:[email protected]>>
Dear Smalltalkers,
We just submitted the application to this year GSoC, now let
we wait
with fingers crossed for Google until next Friday to decide.
I think we
have quite some chance, specially because we collected and
very well
described 22 project ideas:
http://gsoc2012.esug.org/ideas
There is still a time for ideas, so if someone has still half
finished
one, please finish and publish it. Also authors of ideas
please check if
your idea is put correctly on above page.
Best regards
Janko and Carla
--
Carla F. Griggio, Janko Mivšek
Smalltalk GSoC Admin Team
http://gsoc2012.esug.org
--
Hernán Morales
Information Technology Manager,
Institute of Veterinary Genetics.
National Scientific and Technical Research Council (CONICET).
La Plata (1900), Buenos Aires, Argentina.
Telephone: +54 (0221) 421-1799.
Internal: 422
Fax: 425-7980 or 421-1799.