Thanks a lot for the comments, Dominik. To answer your questions..
* How would you ensure feature-parity compared to HSSF/XSSF implementation? There are a large number of things that are possible in a workbook, do you plan to support all those or only a subset? Well.. I would like to start it with the XSSF implimentation , as I am not much familiar with the HSSF one. I am not looking to support a subset, coz no one is going to use it unless it supports some basic functionalities. * The text seems to indicate that there is already some code already available. Can we take a look? You can start a fork of Apache POI fromhttps://github.com/apache/poi/ easily and do the changes there so others can take a look and suggest improvements/changes. Or is it a standalone piece of code? No plans for a stand alone code as long as we can incorporate it with exising functionality. Since we already have a class (org.apache.poi.xssf.streaming.SXSSFWorkbook) that is dedicated to reduce memory consumption, I would like to start with it and see if this can be added as a feature to it. I will also take a look at the code to see if we can leverage any exisitng functionality. * How would you ensure that the code is maintained over time? As this sounds like quite a large chunk of code, are you planning to continue to invest some time in the long run? We had some cases where code was "donated", but never looked at afterwards, which is bad as it increases the code-base, but also increases number of bug-reports and areas that are not well covered by tests. :). I am not looking for a 'code donation' here. I'll be around for a long time. On Sun, Dec 25, 2016 at 4:19 PM, Renjith R <[email protected]> wrote: > I don't know if you are able to see the screenshot in my previous mail. > Following was your comment. > I would start working on it if you think it worths adding. > > *From: *Dominik Stadler <[email protected]> > *Subject: *Re: Suggestion on how to read huge excel files. > *Date: *2015-06-20 15:24 (+0530) > *List: *[email protected] > <https://lists.apache.org/[email protected]> > > It seems not that many people need similar functionality currently, > however it looks useful for handling very large documents. > > I looked at it and it looks good, some comments: > > * The finalize() in the Beans looks strange and should not be needed, > these members are freed anyway and having to implement finalize() > always looks fishy! > > Thanks... Dominik. > > > > On Sun, Dec 25, 2016 at 4:12 PM, Renjith R <[email protected]> > wrote: > >> Ok. I recall that. It was you who did the code review that time. >> >> >> >> >> On Sun, Dec 25, 2016 at 4:04 PM, Renjith R <[email protected]> >> wrote: >> >>> Thanks, Dominik. I'll try to resend it. >>> Let me know if you can see the attachments. >>> >>> On Fri, Dec 23, 2016 at 6:57 AM, Renjith R <[email protected]> >>> wrote: >>> >>>> Hi Developers, >>>> >>>> Couple of years back I suggested an enhancement to read very large >>>> excel files using StAX api. Attached the document. Unfortunately, I did not >>>> get a chance to work on it. Do you think it will make sense if I start >>>> working on it?. Kindly let me know your suggestions. >>>> >>>> regards, >>>> Renjith >>>> >>> >>> >> >
