Re: formatting info in Header/FooterRecords in xls(x)

2016-01-13 Thread Nick Burch
On Mon, 4 Jan 2016, Allison, Timothy B. wrote: Over on TIKA-1730 [0], we have a request to hide formatting info from header/footer records for both xls and xlsx during text extraction. When I look at the text from FooterCell's getText(), it looks like we may want to add some parsing of the

RE: formatting info in Header/FooterRecords in xls(x)

2016-01-13 Thread Allison, Timothy B.
Thank you, Nick. Will take a look. -Original Message- From: Nick Burch [mailto:apa...@gagravarr.org] Sent: Wednesday, January 13, 2016 6:06 AM To: POI Developers List <dev@poi.apache.org> Subject: Re: formatting info in Header/FooterRecords in xls(x) On Mon, 4 Jan 2016, A

formatting info in Header/FooterRecords in xls(x)

2016-01-04 Thread Allison, Timothy B.
All, Over on TIKA-1730 [0], we have a request to hide formatting info from header/footer records for both xls and xlsx during text extraction. When I look at the text from FooterCell's getText(), it looks like we may want to add some parsing of the string to subcomponents for a