Hi Lukas, I am not entirely sure what you want to achieve (or what exactly you mean with “dealing with multi language content”), but trying to answer a bit:
Let’s say you have distinct content trees for different languages, like e.g. /content/en /content/jp Etc. You can choose to index all these trees in one (Lucene) index for full text search and filter the results in your query, i.e. Put the burden on the query engine. This is a simple setup which leads to a large index (although I personally have not seen this to be a problem) Alternatively, you can create different index definitions for each subtree (see [1]), e.g. Using the “includedPaths” property. This would lead to smaller indexes at the downside that you would have to create an index definition if you add a new language tree. This approach has the additional benefit that you can define language-specific Lucene analyzers for each sub tree, so that e.g. In the example above the Japanese index would have ist own analyzer. HTH Michael [1] http://jackrabbit.apache.org/oak/docs/query/lucene.html On 12/07/16 10:15, "Lukas Kahwe Smith" <[email protected]> wrote: >Aloha, > >I did a bit of search but didn’t find anything specific on any plans to >dealing with multi language content in any specific way inside Oak. >Specifically I am wondering as indexing all content from different languages >together can lead to suboptimal sorting and needless overhead. So are there >any plans to deal with this specifically? > >If not inside Oak, are there any projects on top of Oak (or inside AEM) that >deal with this? > >Or is this basically considered to be a case where one needs to plugin a >custom indexer and figure it out on your own? > >regards, >Lukas Kahwe Smith >[email protected] > > >
