[
https://issues.apache.org/jira/browse/NIFI-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17791238#comment-17791238
]
Philipp Korniets edited comment on NIFI-11167 at 11/29/23 6:31 PM:
-------------------------------------------------------------------
To tell you the truth, i'd rather keep using ConvertExcelToCSV - much cleaner
and understandable for other users.
For us, keeping names of columns as they are is essential because this is how
we talk to data providers and business owners. When I construct a formula in
QueryRecords I clearly operate with column names from the file which everyone
can see and check themselves in the raw file.
P.S. still working on test file for
{noformat}
java.lang.IllegalStateException: This cell has a shared formula and it seems
setReadSharedFormulas has been set to false or the formula can't be evaluated
{noformat}
was (Author: iiojj2):
To tell you the truth, i'd rather keep using ConvertExcelToCSV - much cleaner
and understandable for other users.
For us, keeping names of columns as they are is essential because this is how
we talk to data providers and business owners. When I construct a formula in
QueryRecords I clearly operate with column names from the file which every see
and check themselves in the raw file.
P.S. still working on test file for
{noformat}
java.lang.IllegalStateException: This cell has a shared formula and it seems
setReadSharedFormulas has been set to false or the formula can't be evaluated
{noformat}
> Add Excel Record Reader
> -----------------------
>
> Key: NIFI-11167
> URL: https://issues.apache.org/jira/browse/NIFI-11167
> Project: Apache NiFi
> Issue Type: New Feature
> Components: Extensions
> Reporter: David Handermann
> Assignee: Daniel Stieglitz
> Priority: Minor
> Fix For: 2.0.0-M1, 1.23.0
>
> Attachments: CSVRecordSetWriter_configuration.png,
> ExcelReaderConfiguration.png, QueryRecord_configuration.png, Test
> ExcelReader.xlsx, image-2023-11-28-18-22-07-446.png,
> image-2023-11-29-15-51-08-386.png, resulting.csv, screenshot-1.png
>
> Time Spent: 10h 10m
> Remaining Estimate: 0h
>
> A new Excel Record Reader should be implemented to support reading XSLX
> spreadsheet rows as NiFi Records. This Reader will enable integration with
> various record-oriented components, obviating the need for the narrowly
> focused ConvertExcelToCSVProcessor. The initial version of the Excel Reader
> should not support the legacy binary XLS format.
> The ExcelReader should use a library that supports reading from a stream of
> rows to avoid consuming large amounts of heap memory during processing.
> The ExcelReader should support configurable properties to read selected
> sheets. With Excel supporting typed field values, some amount of field type
> mapping will be required. Additional input filtering properties should not be
> implemented as existing Processors like QueryRecord support a wide variety of
> filtering and projection use cases.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)