[
https://issues.apache.org/jira/browse/OAK-11131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nuno Santos resolved OAK-11131.
-------------------------------
Fix Version/s: 1.72.0
Resolution: Done
> indexing-job: AOT Blob downloader may download blobs that are not needed for
> the indexes
> ----------------------------------------------------------------------------------------
>
> Key: OAK-11131
> URL: https://issues.apache.org/jira/browse/OAK-11131
> Project: Jackrabbit Oak
> Issue Type: Bug
> Components: indexing
> Reporter: Nuno Santos
> Priority: Minor
> Fix For: 1.72.0
>
>
> This is the current logic to decide to download a blob:
> - AOT Blob downloader must be enabled for one of the indexes being indexed
> - The node name must end with the suffix configured in AOT blob downloader
> - The node must be indexed by one of the indexers
> This will result in downloading too many blobs. Consider that we are
> reindexing fooBar-1, which includes path {{/content/dam}} and some other
> index fooBaz-1 that covers the full repository, not only {{/content/dam}}.
> AOT blob download is enabled for fooBar-1. Then any node appearing outside
> {{/content/dam}} that matches the suffix, will also pass all the tests and
> its blob will be downloaded, even though fooBar does not need the blob.
> The fix is to check if the node being considered for blob download is indexed
> by the indexes for which AOT download is enabled, instead of checking against
> all the indexes.
> This issue should not cause any failures. It makes the indexing job download
> more than it should and may slow down the indexing phase, but likely not
> slower than it would be without AOT Blob download. So It is not a blocker to
> use this feature, but should be fixed for better performance and avoid
> useless work.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)