GitHub user HyukjinKwon opened a pull request:
https://github.com/apache/spark/pull/20727
[SPARK-23577][SQL] Supports custom line separator for text datasource
## What changes were proposed in this pull request?
This PR proposes to add `lineSep` option for a configurable line separator
in text datasource.
Note that this PR follows Hive's default behaviour for` \n` for now - cover
other newline variants. See this discussion -
https://github.com/apache/spark/pull/18581#discussion_r134814393.
## How was this patch tested?
Manual tests and unit tests were added.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/HyukjinKwon/spark linesep-text
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20727.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20727
----
commit 93652b349ca8107035465096370464053b558ef0
Author: hyukjinkwon <gurwls223@...>
Date: 2018-03-03T11:58:03Z
Supports custom line separator for text datasource
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]