[ 
https://issues.apache.org/jira/browse/NIFI-14426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17940728#comment-17940728
 ] 

Piotr Zalas commented on NIFI-14426:
------------------------------------

Hi [~exceptionfactory], this is also my concern that legacy xls format isn't 
streamable, which doesn't go in line with current implementation. Looking at 
the code, into org.apache.nifi.excel.RowIterator class, it seems that access 
interface to files in both new and legacy format is encapsulated inĀ 
Workbook interface. So it seems that it wouldn't be hard to use StreamingReader 
for new format, while using HSSFWorkbook class for files in legacy format. I 
wonder if creating separate reader wouldn't lead to unnecessary code 
duplication. On the other hand end user might be surprised that depending on 
the file format file reading can consume surprising amount of resources, so 
maybe it's better to separate. WDYT?

> Add support for HSSF format in ExcelReader processor
> ----------------------------------------------------
>
>                 Key: NIFI-14426
>                 URL: https://issues.apache.org/jira/browse/NIFI-14426
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>            Reporter: Piotr Zalas
>            Priority: Major
>
> Currently ExcelReader processor supports only files in new XSSF (.xlsx) 
> format. Add support for legacy HSSF (.xls) format.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to