Github user thvasilo commented on the pull request:
https://github.com/apache/flink/pull/861#issuecomment-114881148
So in order to make this a feature that we can merge, we should add some
functionality that goes beyond the decision tree use case.
My thoughts were that we use the continuous histogram code in order to add
a pimp-your-class style histogram(numBuckets: Int) function to a
DataSet[Double].
This will cover in part
[FLINK-1127](https://issues.apache.org/jira/browse/FLINK-1127). The design
should be similar to
[this](https://github.com/apache/flink/pull/832/files#diff-04a913384132b325b844f41b9545b2cf)
pending PR, where the histogram function would be added to DataSetUtils, for
DataSets[Double]. How does that sound?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---