> > Still I'm
> > not sure when exactly Magnolia will index this or that document in DMS.
>
> It should be at save time, but I'm not 100% sure.
Oh so there are things you actually do not know? Tss we will have to
revisit this one t-shirt per mail policy *LOL*
The indexing happens after saving the content into repository. Whether
it happens immediately or not depends on the SearchIndexer configuration
in the workspace.xml
By default Magnolia is configured to use 3 indexing threads
<param name="extractorPoolSize" value="3" />
So the indexing happens asynchronously shortly after the saving (i
believe after the volatileIdleTime period is over). There is also an
timeout set for the extractor in case it is not finished with the
document in given type and there is max size of the backlog set to
define how many documents can be waiting for extraction. This is
relatively well documented at JackRabbit web and you can look all the
parameters and their meaning there.
To finish off, if you set the extractorPoolSize to 0, then extraction
would happen on the main thread and saving op will not be finished until
the extraction is done as well.
HTH,
Jan
>
> Sorry but I have no huge experience with PDF indexing, but are you sure
> that your PDF are indexable?You can try to wrap PDFIndexer and log
> something, but it is not a quick debugging option...
>
----------------------------------------------------------------
For list details see
http://www.magnolia-cms.com/home/community/mailing-lists.html
To unsubscribe, E-mail to: <[email protected]>
----------------------------------------------------------------