Hi Lukas,

I am not entirely sure what you want to achieve (or what exactly you mean with 
“dealing with multi language content”), but trying to answer a bit:

Let’s say you have distinct content trees for different languages, like e.g.
/content/en
/content/jp
Etc.

You can choose to index all these trees in one (Lucene) index for full text 
search and filter the results in your query, i.e. Put the burden on the query 
engine.
This is a simple setup which leads to a large index (although I personally have 
not seen this to be a problem)

Alternatively, you can create different index definitions for each subtree (see 
[1]), e.g. Using the “includedPaths” property. This would lead to smaller 
indexes at the downside that you would have to create an index definition if 
you add a new language tree.
This approach has the additional benefit that you can define language-specific 
Lucene analyzers for each sub tree, so that e.g. In the example above the 
Japanese index would have ist own analyzer.

HTH
Michael

[1] http://jackrabbit.apache.org/oak/docs/query/lucene.html



On 12/07/16 10:15, "Lukas Kahwe Smith" <[email protected]> wrote:

>Aloha,
>
>I did a bit of search but didn’t find anything specific on any plans to 
>dealing with multi language content in any specific way inside Oak. 
>Specifically I am wondering as indexing all content from different languages 
>together can lead to suboptimal sorting and needless overhead. So are there 
>any plans to deal with this specifically?
>
>If not inside Oak, are there any projects on top of Oak (or inside AEM) that 
>deal with this?
>
>Or is this basically considered to be a case where one needs to plugin a 
>custom indexer and figure it out on your own?
>
>regards,
>Lukas Kahwe Smith
>[email protected]
>
>
>

Reply via email to