[
https://issues.apache.org/jira/browse/PHOENIX-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Viraj Jasani updated PHOENIX-5258:
----------------------------------
Fix Version/s: 5.4.0
(was: 5.3.0)
> Add support to parse header from the input CSV file as input columns for
> CsvBulkLoadTool
> ----------------------------------------------------------------------------------------
>
> Key: PHOENIX-5258
> URL: https://issues.apache.org/jira/browse/PHOENIX-5258
> Project: Phoenix
> Issue Type: Improvement
> Reporter: Prashant Vithani
> Assignee: Prashant Vithani
> Priority: Minor
> Fix For: 5.4.0
>
> Attachments: PHOENIX-5258-4.x-HBase-1.4.001.patch,
> PHOENIX-5258-4.x-HBase-1.4.patch, PHOENIX-5258-master.001.patch,
> PHOENIX-5258-master.patch
>
> Time Spent: 40m
> Remaining Estimate: 0h
>
> Currently, CsvBulkLoadTool does not support reading header from the input csv
> and expects the content of the csv to match with the table schema. The
> support for the header can be added to dynamically map the schema with the
> header.
> The proposed solution is to introduce another option for the tool
> `–parse-header`. If this option is passed, the input columns list is
> constructed by reading the first line of the input CSV file.
> * If there is only one file, read the header from the first line and
> generate the `ColumnInfo` list.
> * If there are multiple files, read the header from all the files, and throw
> an error if the headers across files do not match.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)