[ 
https://issues.apache.org/jira/browse/NIFI-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17702685#comment-17702685
 ] 

David Handermann commented on NIFI-11167:
-----------------------------------------

Thanks [~dstiegli1]. The Sheets to Extract property makes sense, given that is 
specific to Excel.

The Rows to Skip property seems reasonable, given some spreadsheets contains 
headers and some do not, as well as other prefix material.

The Columns to Skip seems like something that is not necessary, as similar 
behavior can be achieved using the QueryRecord Processor, where undesired 
column names can be filtered out in a select statement.

The Format Cell Values property looks like it could be renamed, as it 
translates to Read Styles for parsing files. Perhaps just call it Read Styles, 
with an explanatory description, could work.



> Add Excel Record Reader
> -----------------------
>
>                 Key: NIFI-11167
>                 URL: https://issues.apache.org/jira/browse/NIFI-11167
>             Project: Apache NiFi
>          Issue Type: New Feature
>          Components: Extensions
>            Reporter: David Handermann
>            Assignee: Daniel Stieglitz
>            Priority: Minor
>
> A new Excel Record Reader should be implemented to support reading XSLX 
> spreadsheet rows as NiFi Records. This Reader will enable integration with 
> various record-oriented components, obviating the need for the narrowly 
> focused ConvertExcelToCSVProcessor. The initial version of the Excel Reader 
> should not support the legacy binary XLS format.
> The ExcelReader should use a library that supports reading from a stream of 
> rows to avoid consuming large amounts of heap memory during processing.
> The ExcelReader should support configurable properties to read selected 
> sheets. With Excel supporting typed field values, some amount of field type 
> mapping will be required. Additional input filtering properties should not be 
> implemented as existing Processors like QueryRecord support a wide variety of 
> filtering and projection use cases.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to