Baldwin, David
Tue, 05 Jan 2010 08:48:41 -0800
I need to get a handle on how much memory Tika needs to token-ize different file types. In other words, I need to find information on required overhead (including copies of buffers made if applicable) so that I can produce some kind of guidelines for memory possibly needed by users of the product I am working on which uses Lucene/Tika.
Now I realize that there is a lot of context that can be provided, I want to find out first, if anyone knows of already existing data/metrics on this. Much thanks in advance!