Andrew MacKinlay created SPARK-13610:
----------------------------------------
Summary: Create a Transformer to disassemble vectors in DataFrames
Key: SPARK-13610
URL: https://issues.apache.org/jira/browse/SPARK-13610
Project: Spark
Issue Type: New Feature
Components: ML, SQL
Affects Versions: 1.6.0
Reporter: Andrew MacKinlay
Priority: Minor
It is possible to convert a standalone numeric field into a single-item Vector,
using VectorAssembler. However the inverse operation of retrieving a single
item from a vector and translating it into a field doesn't appear to be
possible. The workaround I've found is to leave the raw field value in the DF,
but I have found no other ways to get a field out of a vector (eg to perform
arithmetic on it). Happy to be proved wrong though. Creating a user-defined
function doesn't work (in Python at least; it gets a pickleexception).This
seems like a simple operation which should be supported for various use cases.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]