zhtk commented on PR #9874: URL: https://github.com/apache/nifi/pull/9874#issuecomment-2858250013
After reading whole discussion I decided to remove type detection and to add `Input File Type` property. There are 2 reasons for that: 1. In case of passworded Excel file, to determine file type the whole Flow File stream must be read. At the moment the stream is resettable only up to 1024 * 1024 mark (i.e. in `createRecordReader` there is a call `in.mark(1024 * 1024);`). Reseting stream on files larger than 1 MB could be problematic. Working around this problem would require contributions to both POI and StreamingReader libraries. 2. Performance characteristics of reading XLS files is different than XLSX. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
