Wojciech Indyk created PIO-38:
---------------------------------
Summary: add Apache Parquet as a data source
Key: PIO-38
URL: https://issues.apache.org/jira/browse/PIO-38
Project: PredictionIO
Issue Type: New Feature
Reporter: Wojciech Indyk
Apache Parquet (https://parquet.apache.org/) is a columnar data store, native
for Apache Spark and very well suited to storing batch data (as an input) for
PredictionIO Engine.
Parquet is very popular to archive clickstream, so it would enable to use
PredictionIO without additional import of data (and duplication) to HBase.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)