On 03/24/2015 11:24 AM, Pat Ferrel wrote:
Lots of non-math in math-scala, Reader/Writer traits, options, option parsing,
driver base classes, IndexedDataset (mini-Dataframes), and there will always be
more because there is no other engine-neutral module. So far the rules as I
understand them are; if it’s engine neutral put it in math-scala. The rule
doesn’t match the name and since this will be the first release to go to the
Maven repos as an artifact it seems like a good time to name it for what it is.
Either this or we create a new module for non-math?
Vote:
math-scala
scala-core
scala-base
other?
I would consider IndexedDataset to be math, along with their
corresponding reader writer traits (similar to drm.dfsWrite(...)).Would
it be too confusing to keep as is for 0.10.0 and then move options,
option parsing, driver base classes along with any other new non-math
stuff over to something like scala-core for a later release?
if so i'd say "math-scala" or "math-base" and "scala-core" are good names.
On Mar 23, 2015, at 11:02 AM, Dmitriy Lyubimov <[email protected]> wrote:
I like math-*. And it is math only there. Or was last time i checked. it will be what R
calls "R-base", and I would welcome no other scope there. all environment
things are math. all ML things are math. quasi-newton, bayesian optimizers, linear search
are all math. Stats are math. als, (d)ssvd, d(spca), (d)als are all math.
non-math are perhaps an app server like R shiny, if we ever get there, that
definitely deserves a module. But other than that, what else we are talking
here?
renaming artifacts is confusing to hands-on people (me including). it is the
reason why i lost my way in modern hadoop dependencies.
i am good with all the "oneric ocelot" and the rest of fancy animal kingdom
names
On Sun, Mar 22, 2015 at 9:34 AM, Pat Ferrel <[email protected]
<mailto:[email protected]>> wrote:
The primary artifacts needed/created for the new scala stuff are math,
math-scala, spark, spark-shell, h2o
Might it be better to rename math-scala to core-scala (or just core) since it
does now and will increasingly include non-math. This is where engine neutral
stuff goes and so is a core dependency.
Also thinking about a name for the Mahout Scala DSL, Shell, optimized liner algebra, bayesian
ops, stats, yada yada - environment. How about (sticking with the sanskrit theme) "him
tendua" or maybe just “tendua", which is sanskrit for mountain leopard or panther. A
nice word easily found in searches that evokes good connotations. Besides the kits are
seriously cute.