[ https://issues.apache.org/jira/browse/NIFI-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17702828#comment-17702828 ]
Daniel Stieglitz commented on NIFI-11167: ----------------------------------------- [~exceptionfactory] Thanks for the clarification on that. Another I question I had is there any issue of the records from each line returned from the Excel Reader varying in size (i.e. in terms of number of columns and values)? Since I am streaming I really do not know where the largest line with data is hence there is no way for me to know how much to pad each record with null column values. > Add Excel Record Reader > ----------------------- > > Key: NIFI-11167 > URL: https://issues.apache.org/jira/browse/NIFI-11167 > Project: Apache NiFi > Issue Type: New Feature > Components: Extensions > Reporter: David Handermann > Assignee: Daniel Stieglitz > Priority: Minor > > A new Excel Record Reader should be implemented to support reading XSLX > spreadsheet rows as NiFi Records. This Reader will enable integration with > various record-oriented components, obviating the need for the narrowly > focused ConvertExcelToCSVProcessor. The initial version of the Excel Reader > should not support the legacy binary XLS format. > The ExcelReader should use a library that supports reading from a stream of > rows to avoid consuming large amounts of heap memory during processing. > The ExcelReader should support configurable properties to read selected > sheets. With Excel supporting typed field values, some amount of field type > mapping will be required. Additional input filtering properties should not be > implemented as existing Processors like QueryRecord support a wide variety of > filtering and projection use cases. -- This message was sent by Atlassian Jira (v8.20.10#820010)