Hi Tika-dev community, I'm new to Tika, We are using AutoDetectParser (from Tika 0.9)for parsing the files and sending the parsed contents to Solr. We are facing severe performance issues while some large sized .xlsx, .docx and .pptx files getting parsed. Hence it is decided to parse files partially like first 10 paragraphs of a doc or first 1000 words or first 2MB of contents like that.
Please let me know is there any way to say Tika to parse part of a file. Regards, Baranee
