[ 
https://issues.apache.org/jira/browse/DRILL-6096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arina Ielchiieva updated DRILL-6096:
------------------------------------
    Description: 
Currently, there is no way for a user to specify the field delimiter for the 
writing records as a text output. Further more, if the fields contain the 
delimiter, we have no mechanism of specifying quotes.

By default, quotes should be used to enclose non-numeric fields being written.

Description of implemented changes:

2 options are added to control text writer output:
{{store.text.writer.add_header}} - indicates if header should be added in 
created text file. Default is true.
{{store.text.writer.force_quotes}} - indicates if all value should be quoted. 
Default is false. It means only values that contain special characters (line / 
field separators) will be quoted.

Line / field separators, quote / escape characters can be configured using text 
format configuration using Web UI. User can create special format only for 
writing data and then use it when creating files.

{noformat}
  "formats": {
    "write_text": {
      "type": "text",
      "extensions": [
        "txt"
      ],
      "delimiter": "|",

    }
   },
...
{noformat}



  was:
Currently, there is no way for a user to specify the field delimiter for the 
writing records as a text output. Further more, if the fields contain the 
delimiter, we have no mechanism of specifying quotes.

By default, quotes should be used to enclose non-numeric fields being written.


> Provide mechanisms to specify field delimiters and quoted text for 
> TextRecordWriter
> -----------------------------------------------------------------------------------
>
>                 Key: DRILL-6096
>                 URL: https://issues.apache.org/jira/browse/DRILL-6096
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Text & CSV
>    Affects Versions: 1.12.0
>            Reporter: Kunal Khatua
>            Assignee: Arina Ielchiieva
>            Priority: Major
>              Labels: doc-impacting
>             Fix For: 1.17.0
>
>
> Currently, there is no way for a user to specify the field delimiter for the 
> writing records as a text output. Further more, if the fields contain the 
> delimiter, we have no mechanism of specifying quotes.
> By default, quotes should be used to enclose non-numeric fields being written.
> Description of implemented changes:
> 2 options are added to control text writer output:
> {{store.text.writer.add_header}} - indicates if header should be added in 
> created text file. Default is true.
> {{store.text.writer.force_quotes}} - indicates if all value should be quoted. 
> Default is false. It means only values that contain special characters (line 
> / field separators) will be quoted.
> Line / field separators, quote / escape characters can be configured using 
> text format configuration using Web UI. User can create special format only 
> for writing data and then use it when creating files.
> {noformat}
>   "formats": {
>     "write_text": {
>       "type": "text",
>       "extensions": [
>         "txt"
>       ],
>       "delimiter": "|",
>     }
>    },
> ...
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to