nfsantos commented on code in PR #955:
URL: https://github.com/apache/jackrabbit-oak/pull/955#discussion_r1205054631


##########
oak-doc/src/site/markdown/query/lucene.md:
##########
@@ -845,7 +850,17 @@ all the other components (e.g. `charFilters`, `Synonym`) 
are optional.
 
 #### Examples
 
-Adding stemming support
+To convert umlauts using ASCII folding, use:

Review Comment:
   Converting umlauts is just an example of what ASCII folding does, but the 
sentence above sounds like the example below is just for umlauts. Maybe 
rephrase to explain that umlaut conversion is just an example and show what it 
does: Ü -> u. And give another example, for instance: `açaí` -> `acai`. (taken 
from the Elasticsearch documentation, 
https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-asciifolding-tokenfilter.html)



##########
oak-doc/src/site/markdown/query/lucene.md:
##########
@@ -762,10 +762,15 @@ defaults to 5
 
 #### <a name="analyzers"></a>Analyzers
 
+If no analyzer is specified, then `OakAnalyzer` is used, which uses the
+Apache Lucene `StandardTokenizer`, the `LowerCaseFilter`,
+and the `WordDelimiterFilter` with the following options:
+`GENERATE_WORD_PARTS`, `STEM_ENGLISH_POSSESSIVE`, and `GENERATE_NUMBER_PARTS`.
+
 `@since Oak 1.5.5, 1.4.7, 1.2.19`
-Unless custom analyzer is configured (as documented below), in-built analyzer
-can be configured to include original term as well to be indexed. This is
-controlled by setting boolean property `indexOriginalTerm` on analyzers node.
+Unless custom analyzer is explicitly configured (as documented below), the 
built-in analyzer
+can be configured to include the original term as well (`PRESERVE_ORIGINAL`). 
This is
+controlled by setting boolean property `indexOriginalTerm` on analyzers node:

Review Comment:
   ```suggestion
   controlled by setting boolean property `indexOriginalTerm` on the 
`analyzers` node:
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to