On 8/22/07, Thilo Goetz <[EMAIL PROTECTED]> wrote: > All the data that your analysis generates (with a few exceptions) lives > on the heap. So depending on how many annotations you create, the heap > may grow very large. It is usually several times the size of the input > document. I've personally had applications where the CAS (most of which > is the heap) would on average be about 50 times the size of the input > document.
Just to clarify: a notable exception is that Strings aren't stored on the heap. So storing long strings (including the document text) will not increase the heap size. However, using large arrays *will* increase the heap size. -Adam
