[ 
https://issues.apache.org/jira/browse/NIFI-13988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17896683#comment-17896683
 ] 

Philipp Korniets edited comment on NIFI-13988 at 11/8/24 2:50 PM:
------------------------------------------------------------------

We have just started testing 2.0 after official release, havent migrated yet. 
Currently we are on 1.23 and will switch to 1.28 first before 2.0. We use 
ExcelReader in few places but not everywhere in our flows. 
So with upcoming migration expect more testing ;)

1. Documentation says - {_}"When using the "Infer Schema" strategy, the Reader 
will determine the schema by first parsing all data in the FlowFile"{_}, why 
not do same for "Use Starting Row", or give this option?
2. I will check how ConvertExcelToCSV handles that... 

Current workaround - Infer Schema + RouteText ${lineNo:gt(1)} to remove extra 
columns


was (Author: iiojj2):
We have just started testing 2.0 after official release, havent migrated yet. 
Currently we are on 1.23 and will switch to 1.28 first before 2.0. We use 
ExcelReader in few places but not everywhere in our flows. 
So with upcoming migration expect more testing ;)

1. Documentation says - {_}"When using the "Infer Schema" strategy, the Reader 
will determine the schema by first parsing all data in the FlowFile"{_}, why 
not do same for "Use Starting Row", or give this option?
2. I will check how ConvertExcelToCSV handles that... 

> ExcelReader - Use Starting Row schema strategy and string empty values
> ----------------------------------------------------------------------
>
>                 Key: NIFI-13988
>                 URL: https://issues.apache.org/jira/browse/NIFI-13988
>             Project: Apache NiFi
>          Issue Type: Bug
>          Components: Core Framework
>    Affects Versions: 2.0.0
>            Reporter: Philipp Korniets
>            Assignee: Daniel Stieglitz
>            Priority: Major
>         Attachments: Test workbook NiFi2_0.xlsx
>
>
> When Use Starting Row as schema strategy in ExcelReader it analyses first 10 
> row. Problem appears with empty cells of  *Numerical* type which can appear 
> anywhere after 10 rows. The cells *looks like* NULL, but actually is an empty 
> string.
> File with example data attached. 
> Field Exercise Price. 
> Use Starting Row throws an error:
> {code:java}
> Caused by: java.lang.NumberFormatException: For input string: ""
>     at 
> java.base/java.lang.NumberFormatException.forInputString(NumberFormatException.java:67)
>     at java.base/java.lang.Long.parseLong(Long.java:719)
>     at java.base/java.lang.Long.parseLong(Long.java:832)
>     at 
> org.apache.nifi.serialization.record.util.DataTypeUtils.toLong(DataTypeUtils.java:1391)
>     at 
> org.apache.nifi.serialization.record.util.DataTypeUtils.convertType(DataTypeUtils.java:213)
>     at 
> org.apache.nifi.serialization.record.util.DataTypeUtils.convertType(DataTypeUtils.java:174)
>     at 
> org.apache.nifi.excel.ExcelRecordReader.convert(ExcelRecordReader.java:170)
>     at 
> org.apache.nifi.excel.ExcelRecordReader.lambda$getCurrentRowValues$0(ExcelRecordReader.java:127)
>     at 
> java.base/java.util.stream.Streams$RangeIntSpliterator.forEachRemaining(Streams.java:104)
>     at 
> java.base/java.util.stream.IntPipeline$Head.forEach(IntPipeline.java:617)
>     at 
> org.apache.nifi.excel.ExcelRecordReader.getCurrentRowValues(ExcelRecordReader.java:114)
>     at 
> org.apache.nifi.excel.ExcelRecordReader.nextRecord(ExcelRecordReader.java:84)
>     ... 28 common frames omitted{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to