[
https://issues.apache.org/jira/browse/SPARK-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14530783#comment-14530783
]
Eron Wright commented on SPARK-7400:
-------------------------------------
I am thinking about ML scenarios here, with the ultimate goal of using a
DataFrame with PortableDataStream-type columns in an ML pipeline. The scenario
is to process image data. Perhaps a transformer would map PortableDataStream
to Vector (decoding and vectorizing each image).
> PortableDataStream UDT
> ----------------------
>
> Key: SPARK-7400
> URL: https://issues.apache.org/jira/browse/SPARK-7400
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Reporter: Eron Wright
>
> Improve support for PortableDataStream in a DataFrame by implementing
> PortableDataStreamUDT.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]