On Tue, 23 Aug 2011, Julien Nioche wrote:
We definitely don't get them in Tika. See docs attached (saved with OpenOffice )
It's probably worth putting these sample files on a tika issue so they don't get lost, and can be used in a future unit test
The next thing to check is probably to unit the .docx file, and see where the watermark text lives. If it's in the main document part then it should be farily easy to get for Tika. If it's in a different part, then a little bit of support will likely be needed on the POI side to allow easier access to it
Nick --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
