[
https://issues.apache.org/jira/browse/NIFI-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17790782#comment-17790782
]
Daniel Stieglitz edited comment on NIFI-11167 at 11/28/23 10:20 PM:
--------------------------------------------------------------------
[~iiojj2] I am not able to recreate the
java.lang.IllegalStateException
you are seeing. I was able to configure QueryRecord with an ExcelReader and a
CsvRecordSetWriter and filter out the second column. Attached are the
configurations for each. (NOTE: the FROM flowfile is cut off in the QueryRecord
configuration). I ran this on Centos 7. Attached are screenshots of my
configuration and the the resulting csv.
I did realize though the ExcelReader is missing the 'additional details' link
which now has been fixed with NIFI-12416. In the 'additional details' it
specifies the names of the columns given to the resulting data when using the
schema inference strategy as you can see are used in the SQL query in
QueryRecord and in the resulting CSV.
Are you running NIFI on Windows?
was (Author: JIRAUSER294662):
[~iiojj2] I am not able to recreate the
java.lang.IllegalStateException
you are seeing. I was able to configure QueryRecord with an ExcelReader and a
CsvRecordSetWriter and filter out the second column. Attached are the
configurations for each. I ran this on Centos 7. Attached are screenshots of my
configuration and the the resulting csv.
I did realize though the ExcelReader is missing the additional details link
which now has been fixed with NIFI-12416. In the additional details it
specifies the names of the columns given to the resulting data when using the
schema inference strategy as you can see are used in the SQL query in
QueryRecord and in the resulting CSV.
Are you running NIFI on Windows?
> Add Excel Record Reader
> -----------------------
>
> Key: NIFI-11167
> URL: https://issues.apache.org/jira/browse/NIFI-11167
> Project: Apache NiFi
> Issue Type: New Feature
> Components: Extensions
> Reporter: David Handermann
> Assignee: Daniel Stieglitz
> Priority: Minor
> Fix For: 2.0.0-M1, 1.23.0
>
> Attachments: CSVRecordSetWriter_configuration.png,
> ExcelReaderConfiguration.png, QueryRecord_configuration.png, Test
> ExcelReader.xlsx, image-2023-11-28-18-22-07-446.png, resulting.csv,
> screenshot-1.png
>
> Time Spent: 10h 10m
> Remaining Estimate: 0h
>
> A new Excel Record Reader should be implemented to support reading XSLX
> spreadsheet rows as NiFi Records. This Reader will enable integration with
> various record-oriented components, obviating the need for the narrowly
> focused ConvertExcelToCSVProcessor. The initial version of the Excel Reader
> should not support the legacy binary XLS format.
> The ExcelReader should use a library that supports reading from a stream of
> rows to avoid consuming large amounts of heap memory during processing.
> The ExcelReader should support configurable properties to read selected
> sheets. With Excel supporting typed field values, some amount of field type
> mapping will be required. Additional input filtering properties should not be
> implemented as existing Processors like QueryRecord support a wide variety of
> filtering and projection use cases.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)