Leonidas Fegaras created MRQL-28:
------------------------------------
Summary: Fixed dumping of HDFS sequence file in Spark mode
Key: MRQL-28
URL: https://issues.apache.org/jira/browse/MRQL-28
Project: MRQL
Issue Type: Bug
Components: Run-Time/Spark
Reporter: Leonidas Fegaras
Assignee: Leonidas Fegaras
Priority: Minor
The attached patch fixes the following bug in Spark mode:
When using the MRQL "store file from e" in Spark distributed mode and the file
is already in HDFS, it reports an error instead of replacing the old file with
the new.
This is because the Spark method saveAsHadoopFile creates a file but cannot
replace an existing HDFS file (this may be a Spark bug). The patch simply
deletes the file before it calls the saveAsHadoopFile method.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)