I'm downstream stages the labels & features are generally expected to be
doubles, so its easier to use as a double.
On Sat, Jan 21, 2017 at 5:32 PM Shiyuan wrote:
> Hi Spark,
> StringIndex uses double instead of int for indexing
> http://spark.apache.org/docs/latest/ml-features.html#stringindexe
Hi Spark,
StringIndex uses double instead of int for indexing
http://spark.apache.org/docs/latest/ml-features.html#stringindexer. What's
the rationale for using double to index? Would it be more appropriate to
use int to index (which is consistent with other place like Vector.sparse)
Shiyuan