Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "MockParser" page has been changed by TimothyAllison: https://wiki.apache.org/tika/MockParser?action=diff&rev1=2&rev2=3 = MockParser = == Background == - So, you've tried Tika on a couple of files and all works well. Problem solved! + So, you've integrated Tika into your framework, tried it on a couple of thousand files and all works well. Problem solved! No. - In very rare cases, Tika can so some really bad things. We try to fix these problems when we can, but if history is any indication (e.g. [[https://issues.apache.org/jira/browse/TIKA-1132|TIKA-1132]]), if you are processing millions/billions of files from the wild, you'll need to defend against: + In very, very rare cases, Tika can so some really bad things. We try to fix these problems when we can, but if history is any indication (e.g. [[https://issues.apache.org/jira/browse/TIKA-1132|TIKA-1132]] and [[https://issues.apache.org/jira/browse/TIKA-2040|TIKA-2040]] to name a few), if you are processing millions/billions of files from the wild, you'll need to defend against: 1. Regular catchable exceptions 2. !OutOfMemory errors which can put the jvm in an unreliable state
