Alexander Ulanov created SPARK-8449:
---------------------------------------
Summary: HDF5 read/write support for Spark MLlib
Key: SPARK-8449
URL: https://issues.apache.org/jira/browse/SPARK-8449
Project: Spark
Issue Type: Improvement
Components: MLlib
Affects Versions: 1.4.0
Reporter: Alexander Ulanov
Fix For: 1.4.1
Add support for reading and writing HDF5 file format to/from LabeledPoint. HDFS
and local file system have to be supported. Other Spark formats to be
discussed.
Interface proposal:
/* path - directory path in any Hadoop-supported file system URI */
MLUtils.saveAsHDF5(sc: SparkContext, path: String, RDD[LabeledPoint]): Unit
/* path - file or directory path in any Hadoop-supported file system URI */
MLUtils.loadHDF5(sc: SparkContext, path: String): RDD[LabeledPoint]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]