Github user wangmiao1981 commented on the issue:
https://github.com/apache/spark/pull/16784
@jkbradley I simplified the tests and modified the data generation API by
using toSparse method, which eliminates the index variable.
"Is this multivariate online summarizer issue really a bug? Or is it from
passing in sparse vectors which are 10x longer than the dense vectors?"
It is not a bug based on my understanding. It is because the sparse size is
larger than the dense vector size, which means we can't mix using them
together.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]