[
https://issues.apache.org/jira/browse/DRILL-6096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arina Ielchiieva updated DRILL-6096:
------------------------------------
Description:
Currently, there is no way for a user to specify the field delimiter for the
writing records as a text output. Further more, if the fields contain the
delimiter, we have no mechanism of specifying quotes.
By default, quotes should be used to enclose non-numeric fields being written.
Description of implemented changes:
2 options are added to control text writer output:
{{store.text.writer.add_header}} - indicates if header should be added in
created text file. Default is true.
{{store.text.writer.force_quotes}} - indicates if all value should be quoted.
Default is false. It means only values that contain special characters (line /
field separators) will be quoted.
Line / field separators, quote / escape characters can be configured using text
format configuration using Web UI. User can create special format only for
writing data and then use it when creating files.
{noformat}
"formats": {
"write_text": {
"type": "text",
"extensions": [
"txt"
],
"delimiter": "|",
}
},
...
{noformat}
was:
Currently, there is no way for a user to specify the field delimiter for the
writing records as a text output. Further more, if the fields contain the
delimiter, we have no mechanism of specifying quotes.
By default, quotes should be used to enclose non-numeric fields being written.
> Provide mechanisms to specify field delimiters and quoted text for
> TextRecordWriter
> -----------------------------------------------------------------------------------
>
> Key: DRILL-6096
> URL: https://issues.apache.org/jira/browse/DRILL-6096
> Project: Apache Drill
> Issue Type: Improvement
> Components: Storage - Text & CSV
> Affects Versions: 1.12.0
> Reporter: Kunal Khatua
> Assignee: Arina Ielchiieva
> Priority: Major
> Labels: doc-impacting
> Fix For: 1.17.0
>
>
> Currently, there is no way for a user to specify the field delimiter for the
> writing records as a text output. Further more, if the fields contain the
> delimiter, we have no mechanism of specifying quotes.
> By default, quotes should be used to enclose non-numeric fields being written.
> Description of implemented changes:
> 2 options are added to control text writer output:
> {{store.text.writer.add_header}} - indicates if header should be added in
> created text file. Default is true.
> {{store.text.writer.force_quotes}} - indicates if all value should be quoted.
> Default is false. It means only values that contain special characters (line
> / field separators) will be quoted.
> Line / field separators, quote / escape characters can be configured using
> text format configuration using Web UI. User can create special format only
> for writing data and then use it when creating files.
> {noformat}
> "formats": {
> "write_text": {
> "type": "text",
> "extensions": [
> "txt"
> ],
> "delimiter": "|",
> }
> },
> ...
> {noformat}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)