macdoor615 created NIFI-11989:
---------------------------------
Summary: ConvertExcelToCSVProcessor conversion will lose data if
header line of xlsx file contains empty merged cells
Key: NIFI-11989
URL: https://issues.apache.org/jira/browse/NIFI-11989
Project: Apache NiFi
Issue Type: Bug
Components: Extensions
Affects Versions: 1.23.2
Environment: Linux hb3-prod-hadoop-000 4.18.0-477.13.1.el8_8.x86_64 #1
SMP Tue May 30 14:53:41 EDT 2023 x86_64 x86_64 x86_64 GNU/Linux
Reporter: macdoor615
Attachments: header line with empty merged cell-1.xlsx, header line
with empty merged cell.csv, header line with empty merged cell.xlsx, header
line without empty merged cell.csv
In the "header line with empty merged cell-1.xlsx" file, above the cells with
contents "ca" and "cb" is an empty merged cell.
Convert the "header line with empty merged cell-1.xlsx" file with
ConvertExcelToCSVProcessor.
The output csv file "header line with empty merged cell.csv" loses the content
of the column after the empty merged cell, the content of the "dd" and ''ff"
cells and the following cells are gone.
As a comparison of the xlsx file, the merged cell has content, and the content
is "c", so the subsequent columns will not be lost.
The output is in the "header line without empty merged cell.csv" file
--
This message was sent by Atlassian Jira
(v8.20.10#820010)